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PREFACE 


This  book  is  particularly  intended  for  the  student  with  a  year  (or  a  year  and  a  half) 
of  calculus  who  wants  to  develop,  in  a  short  time,  a  basic  competence  in  each  of  the 
many  areas  of  mathematics  needed  in  junior  to  senior-graduate  courses  in  physics, 
chemistry,  and  engineering.  Thus  it  is  intended  to  be  accessible  to  sophomores  (or 
freshmen  with  AP  calculus  from  high  school).  It  may  also  be  used  effectively  by 
a  more  advanced  student  to  review  half-forgotten  topics  or  learn  new  ones,  either 
by  independent  study  or  in  a  class.  Although  the  book  was  written  especially 
for  students  of  the  physical  sciences,  students  in  any  field  (say  mathematics  or 
mathematics  for  teaching)  may  find  it  useful  to  survey  many  topics  or  to  obtain 
some  knowledge  of  areas  they  do  not  have  time  to  study  in  depth.  Since  theorems 
are  stated  carefully,  such  students  should  not  need  to  unlearn  anything  in  their  later 
work. 

The  question  of  proper  mathematical  training  for  students  in  the  physical  sci¬ 
ences  is  of  concern  to  both  mathematicians  and  those  who  use  mathematics  in  appli¬ 
cations.  Some  instructors  may  feel  that  if  students  are  going  to  study  mathematics 
at  all,  they  should  study  it  in  careful  and  thorough  detail.  For  the  undergradu¬ 
ate  physics,  chemistry,  or  engineering  student,  this  means  either  (1)  learning  more 
mathematics  than  a  mathematics  major  or  (2)  learning  a  few  areas  of  mathematics 
thoroughly  and  the  others  only  from  snatches  in  science  courses.  The  second  alter¬ 
native  is  often  advocated;  let  me  say  why  I  think  it  is  unsatisfactory.  It  is  certainly 
true  that  motivation  is  increased  by  the  immediate  application  of  a  mathematical 
technique,  but  there  are  a  number  of  disadvantages: 

1.  The  discussion  of  the  mathematics  is  apt  to  be  sketchy  since  that  is  not  the 
primary  concern. 

2.  Students  are  faced  simultaneously  with  learning  a  new  mathematical  method 
and  applying  it  to  an  area  of  science  that  is  also  new  to  them.  Frequently  the 
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difficulty  in  comprehending  the  new  scientific  area  lies  more  in  the  distraction 
caused  by  poorly  understood  mathematics  than  it  does  in  the  new  scientific  ideas. 

3.  Students  may  meet  what  is  actually  the  same  mathematical  principle  in  two 
different  science  courses  without  recognizing  the  connection,  or  even  learn  ap¬ 
parently  contradictory  theorems  in  the  two  courses!  For  example,  in  thermody¬ 
namics  students  learn  that  the  integral  of  an  exact  differential  around  a  closed 
path  is  always  zero.  In  electricity  or  hydrodynamics,  they  run  into  f~ n  dd,  which 
is  certainly  the  integral  of  an  exact  differential  around  a  closed  path  but  is  not 
equal  to  zero! 

Now  it  would  be  fine  if  every  science  student  could  take  the  separate  mathematics 
courses  in  differential  equations  (ordinary  and  partial),  advanced  calculus,  linear 
algebra,  vector  and  tensor  analysis,  complex  variables,  Fourier  series,  probability, 
calculus  of  variations,  special  functions,  and  so  on.  However,  most  science  students 
have  neither  the  time  nor  the  inclination  to  study  that  much  mathematics,  yet  they 
are  constantly  hampered  in  their  science  courses  for  lack  of  the  basic  techniques  of 
these  subjects.  It  is  the  intent  of  this  book  to  give  these  students  enough  background 
in  each  of  the  needed  areas  so  that  they  can  cope  successfully  with  junior,  senior, 
and  beginning  graduate  courses  in  the  physical  sciences.  I  hope,  also,  that  some 
students  will  be  sufficiently  intrigued  by  one  or  more  of  the  fields  of  mathematics 
to  pursue  it  futher. 

It  is  clear  that  something  must  be  omitted  if  so  many  topics  are  to  be  compressed 
into  one  course.  I  believe  that  two  things  can  be  left  out  without  serious  harm  at 
this  stage  of  a  student’s  work:  generality,  and  detailed  proofs.  Stating  and  proving 
a  theorem  in  its  most  general  form  is  important  to  the  mathematician  and  to  the 
advanced  student,  but  it  is  often  unnecessary  and  may  be  confusing  to  the  more 
elementary  student.  This  is  not  in  the  least  to  say  that  science  students  have  no 
use  for  careful  mathematics.  Scientists,  even  more  than  pure  mathematicians,  need 
careful  statements  of  the  limits  of  applicability  of  mathematical  processes  so  that 
they  can  use  them  with  confidence  without  having  to  supply  proof  of  their  validity. 
Consequently  I  have  endeavored  to  give  accurate  statements  of  the  needed  theorems, 
although  often  for  special  cases  or  without  proof.  Interested  students  can  easily  find 
more  detail  in  textbooks  in  the  special  fields. 

Mathematical  physics  texts  at  the  senior-graduate  level  are  able  to  assume  a 
degree  of  mathematical  sophistication  and  knowledge  of  advanced  physics  not  yet 
attained  by  students  at  the  sophomore  level.  Yet  such  students,  if  given  simple  and 
clear  explanations,  can  readily  master  the  techniques  we  cover  in  this  text.  (They 
not  only  can,  but  will  have  to  in  one  way  or  another,  if  they  are  going  to  pass 
their  junior  and  senior  physics  courses!)  These  students  are  not  ready  for  detailed 
applications — these  they  will  get  in  their  science  courses — but  they  do  need  and 
want  to  be  given  some  idea  of  the  use  of  the  methods  they  are  studying,  and  some 
simple  applications.  This  I  have  tried  to  do  for  each  new  topic. 

For  those  of  you  familiar  with  the  second  edition,  let  me  outline  the  changes  for 
the  third: 

1.  Prompted  by  several  requests  for  matrix  diagonalization  in  Chapter  3,  I  have 
moved  the  first  part  of  Chapter  10  to  Chapter  3  and  then  have  amplified  the 
treatment  of  tensors  in  Chapter  10.  I  have  also  changed  Chapter  3  to  include 
more  detail  about  linear  vector  spaces  and  then  have  continued  the  discussion  of 
basis  functions  in  Chapter  7  (Fourier  series),  Chapter  8  (Differential  equations), 
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Chapter  12  (Series  solutions)  and  Chapter  13  (Partial  differential  equations). 

2.  Again,  prompted  by  several  requests,  I  have  moved  Fourier  integrals  back  to  the 
Fourier  series  Chapter  7.  Since  this  breaks  up  the  integral  transforms  chapter 
(old  Chapter  15),  I  decided  to  abandon  that  chapter  and  move  the  Laplace 
transform  and  Dirac  delta  function  material  back  to  the  ordinary  differential 
equations  Chapter  8.  I  have  also  amplified  the  treatment  of  the  delta  function. 

3.  The  Probability  chapter  (old  Chapter  16)  now  becomes  Chapter  15.  Here  I  have 
changed  the  title  to  Probability  and  Statistics,  and  have  revised  the  latter  part 
of  the  chapter  to  emphasize  its  purpose,  namely  to  clarify  for  students  the  theory 
behind  the  rules  they  learn  for  handling  experimental  data. 

4.  The  very  rapid  development  of  technological  aids  to  computation  poses  a  steady 
question  for  instructors  as  to  their  best  use.  Without  selecting  any  particular 
Computer  Algebra  System,  I  have  simply  tried  for  each  topic  to  point  out  to 
students  both  the  usefulness  and  the  pitfalls  of  computer  use.  (Please  see  my 
comments  at  the  end  of  ”To  the  Student”  just  ahead.) 

The  material  in  the  text  is  so  arranged  that  students  who  study  the  chapters 
in  order  will  have  the  necessary  background  at  each  stage.  However,  it  is  not 
always  either  necessary  or  desirable  to  follow  the  text  order.  Let  me  suggest  some 
rearrangements  I  have  found  useful.  If  students  have  previously  studied  the  material 
in  any  of  chapters  1,  3,  4,  5,  6,  or  8  (in  such  courses  as  second-year  calculus, 
differential  equations,  linear  algebra),  then  the  corresponding  chapter(s)  could  be 
omitted,  used  for  reference,  or,  preferably,  be  reviewed  briefly  with  emphasis  on 
problem  solving.  Students  may  know  Taylor’s  theorem,  for  example,  but  have  little 
skill  in  using  series  approximations;  they  may  know  the  theory  of  multiple  integrals, 
but  find  it  difficult  to  set  up  a  double  integral  for  the  moment  of  inertia  of  a  spherical 
shell;  they  may  know  existence  theorems  for  differential  equations,  but  have  little 
skill  in  solving,  say,  y"  +  y  =  a;  sin  a;.  Problem  solving  is  the  essential  core  of  a 
course  on  Mathematical  Methods. 

After  Chapters  7  (Fourier  Series)  and  8  (Ordinary  Differential  Equations)  I  like 
to  cover  the  first  four  sections  of  Chapter  13  (Partial  Differential  Equations).  This 
gives  students  an  introduction  to  Partial  Differential  Equations  but  requires  only  the 
use  of  Fourier  series  expansions.  Later  on,  after  studying  Chapter  12,  students  can 
return  to  complete  Chapter  13.  Chapter  15  (Probability  and  Statistics)  is  almost 
independent  of  the  rest  of  the  text;  I  have  covered  this  material  anywhere  from  the 
beginning  to  the  end  of  a  one-year  course. 

It  has  been  gratifying  to  hear  the  enthusiastic  responses  to  the  first  two  editions, 
and  I  hope  that  this  third  edition  will  prove  even  more  useful.  I  want  to  thank  many 
readers  for  helpful  suggestions  and  I  will  appreciate  any  further  comments.  If  you 
find  misprints,  please  send  them  to  me  at  MLBoas@aol.com.  I  also  want  to  thank 
the  University  of  Washington  physics  students  who  were  my  DTJ^X  typists:  Toshiko 
Asai,  Jeff  Sherman,  and  Jeffrey  Frasca.  And  I  especially  want  to  thank  my  son, 
Harold  P.  Boas,  both  for  mathematical  consultations,  and  for  his  expert  help  with 
DTj^X  problems. 

Instructors  who  have  adopted  the  book  for  a  class  should  consult  the  publisher 
about  an  Instructor’s  Answer  Book,  and  about  a  list  correlating  2nd  and  3rd  edition 
problem  numbers  for  problems  which  appear  in  both  editions. 


Mary  L.  Boas 


TO  THE  STUDENT 


As  you  start  each  topic  in  this  book,  you  will  no  doubt  wonder  and  ask  “Just  why 
should  I  study  this  subject  and  what  use  does  it  have  in  applications?”  There  is  a 
story  about  a  young  mathematics  instructor  who  asked  an  older  professor  “What  do 
you  say  when  students  ask  about  the  practical  applications  of  some  mathematical 
topic?”  The  experienced  professor  said  “I  tell  them!”  This  text  tries  to  follow 
that  advice.  However,  you  must  on  your  part  be  reasonable  in  your  request.  It 
is  not  possible  in  one  book  or  course  to  cover  both  the  mathematical  methods 
and  very  many  detailed  applications  of  them.  You  will  have  to  be  content  with 
some  information  as  to  the  areas  of  application  of  each  topic  and  some  of  the 
simpler  applications.  In  your  later  courses,  you  will  then  use  these  techniques  in 
more  advanced  applications.  At  that  point  you  can  concentrate  on  the  physical 
application  instead  of  being  distracted  by  learning  new  mathematical  methods. 

One  point  about  your  study  of  this  material  cannot  be  emphasized  too  strongly: 
To  use  mathematics  effectively  in  applications,  you  need  not  just  knowledge  but  skill. 
Skill  can  be  obtained  only  through  practice.  You  can  obtain  a  certain  superficial 
knowledge  of  mathematics  by  listening  to  lectures,  but  you  cannot  obtain  skill  this 
way.  How  many  students  have  I  heard  say  “It  looks  so  easy  when  you  do  it,”  or  “I 
understand  it  but  I  can’t  do  the  problems!”  Such  statements  show  lack  of  practice 
and  consequent  lack  of  skill.  The  only  way  to  develop  the  skill  necessary  to  use  this 
material  in  your  later  courses  is  to  practice  by  solving  many  problems.  Always  study 
with  pencil  and  paper  at  hand.  Don’t  just  read  through  a  solved  problem — try  to 
do  it  yourself!  Then  solve  some  similar  ones  from  the  problem  set  for  that  section, 
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trying  to  choose  the  most  appropriate  method  from  the  solved  examples.  See  the 
Answers  to  Selected  Problems  and  check  your  answers  to  any  problems  listed  there. 
If  you  meet  an  unfamiliar  term,  look  for  it  in  the  Index  (or  in  a  dictionary  if  it  is 
nontechnical) . 

My  students  tell  me  that  one  of  my  most  frequent  comments  to  them  is  “You’re 
working  too  hard.”  There  is  no  merit  in  spending  hours  producing  a  solution  to 
a  problem  that  can  be  done  by  a  better  method  in  a  few  minutes.  Please  ignore 
anyone  who  disparages  problem-solving  techniques  as  “tricks”  or  “shortcuts.”  You 
will  find  that  the  more  able  you  are  to  choose  effective  methods  of  solving  problems 
in  your  science  courses,  the  easier  it  will  be  for  you  to  master  new  material.  But 
this  means  practice,  practice,  practice!  The  only  way  to  learn  to  solve  problems  is 
to  solve  problems.  In  this  text,  you  will  find  both  drill  problems  and  harder,  more 
challenging  problems.  You  should  not  feel  satisfied  with  your  study  of  a  chapter 
until  you  can  solve  a  reasonable  number  of  these  problems. 

You  may  be  thinking  “I  don’t  really  need  to  study  this — my  computer  will  solve 
all  these  problems  for  me.”  Now  Computer  Algebra  Systems  are  wonderful — as  you 
know,  they  save  you  a  lot  of  laborious  calculation  and  quickly  plot  graphs  which 
clarify  a  problem.  But  a  computer  is  a  tool;  you  are  the  one  in  charge.  A  very 
perceptive  student  recently  said  to  me  (about  the  use  of  a  computer  for  a  special 
project):  “First  you  learn  how  to  do  it;  then  you  see  what  the  computer  can  do 
to  make  it  easier.”  Quite  so!  A  very  effective  way  to  study  a  new  technique  is  to 
do  some  simple  problems  by  hand  in  order  to  understand  the  process,  and  compare 
your  results  with  a  computer  solution.  You  will  then  be  better  able  to  use  the 
method  to  set  up  and  solve  similar  more  complicated  applied  problems  in  your 
advanced  courses.  So,  in  one  problem  set  after  another,  I  will  remind  you  that  the 
point  of  solving  some  simple  problems  is  not  to  get  an  answer  (which  a  computer 
will  easily  supply)  but  rather  to  learn  the  ideas  and  techniques  which  will  be  so 
useful  in  your  later  courses. 


M.  L.  B. 
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Infinite  Series,  Power  Series 


1.  THE  GEOMETRIC  SERIES 

As  a  simple  example  of  many  of  the  ideas  involved  in  series,  we  are  going  to  consider 
the  geometric  series.  You  may  recall  that  in  a  geometric  progression  we  multiply 
each  term  by  some  fixed  number  to  get  the  next  term.  For  example,  the  sequences 


(1.1a) 

(1.1b) 

(1.1c) 


2,  4,  8,  16,  32,..., 

i  2  4  _8_  16 
’  3’  9’  27’  81 5 ' ' ‘ ’ 

a,  ar,  ar2,  ar3 , . . . , 


are  geometric  progressions.  It  is  easy  to  think  of  examples  of  such  progressions. 
Suppose  the  number  of  bacteria  in  a  culture  doubles  every  hour.  Then  the  terms  of 
(1.1a)  represent  the  number  by  which  the  bacteria  population  has  been  multiplied 
after  1  hr,  2  hr,  and  so  on.  Or  suppose  a  bouncing  ball  rises  each  time  to  |  of 
the  height  of  the  previous  bounce.  Then  (1.1b)  would  represent  the  heights  of  the 
successive  bounces  in  yards  if  the  ball  is  originally  dropped  from  a  height  of  1  yd. 

In  our  first  example  it  is  clear  that  the  bacteria  population  would  increase  with¬ 
out  limit  as  time  went  on  (mathematically,  anyway;  that  is,  assuming  that  nothing 
like  lack  of  food  prevented  the  assumed  doubling  each  hour).  In  the  second  example, 
however,  the  height  of  bounce  of  the  ball  decreases  with  successive  bounces,  and  we 
might  ask  for  the  total  distance  the  ball  goes.  The  ball  falls  a  distance  1  yd,  rises 
a  distance  |  yd  and  falls  a  distance  |  yd,  rises  a  distance  |  yd  and  falls  a  distance 
|  yd,  and  so  on.  Thus  it  seems  reasonable  to  write  the  following  expression  for  the 
total  distance  the  ball  goes: 

(1.2)  l  +  2-  |+  2-  |  +  2-  ^H  =1  +  2(|  +  |  +  ^H - ), 

where  the  three  dots  mean  that  the  terms  continue  as  they  have  started  (each  one 
being  |  the  preceding  one),  and  there  is  never  a  last  term.  Let  us  consider  the 
expression  in  parentheses  in  (1.2),  namely 


1 
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This  expression  is  an  example  of  an  infinite  series ,  and  we  are  asked  to  find  its  sum. 
Not  all  infinite  series  have  sums;  you  can  see  that  the  series  formed  by  adding  the 
terms  in  (1.1a)  does  not  have  a  finite  sum.  However,  even  when  an  infinite  series 
does  have  a  finite  sum,  we  cannot  find  it  by  adding  the  terms  because  no  matter 
how  many  we  add  there  are  always  more.  Thus  we  must  find  another  method.  (It 
is  actually  deeper  than  this;  what  we  really  have  to  do  is  to  define  what  we  mean 
by  the  sum  of  the  series.) 

Let  us  first  find  the  sum  of  n  terms  in  (1.3).  The  formula  (Problem  2)  for  the 
sum  of  n  terms  of  the  geometric  progression  (1.1c)  is 


(1.4) 


_  a(l  -  r") 
1  —  r 


Using  (1.4)  in  (1.3),  we  find 


As  n  increases,  (|)n  decreases  and  approaches  zero.  Then  the  sum  of  n  terms 
approaches  2  as  n  increases,  and  we  say  that  the  sum  of  the  series  is  2.  (This  is 
really  a  definition:  The  sum  of  an  infinite  series  is  the  limit  of  the  sum  of  n  terms 
as  n  — >  oo.)  Then  from  (1.2),  the  total  distance  traveled  by  the  ball  is  1  +  2  •  2  =  5. 
This  is  an  answer  to  a  mathematical  problem.  A  physicist  might  well  object  that 
a  bounce  the  size  of  an  atom  is  nonsense!  However,  after  a  number  of  bounces,  the 
remaining  infinite  number  of  small  terms  contribute  very  little  to  the  final  answer 
(see  Problem  1).  Thus  it  makes  little  difference  (in  our  answer  for  the  total  distance) 
whether  we  insist  that  the  ball  rolls  after  a  certain  number  of  bounces  or  whether 
we  include  the  entire  series,  and  it  is  easier  to  find  the  sum  of  the  series  than  to  find 
the  sum  of,  say,  twenty  terms. 

Series  such  as  (1.3)  whose  terms  form  a  geometric  progression  are  called  geo¬ 
metric  series.  We  can  write  a  geometric  series  in  the  form 

(1.6)  a  +  ar  +  ar2-\ - h  ar71-1  H - . 

The  sum  of  the  geometric  series  (if  it  has  one)  is  by  definition 


(1.7) 

S  =  lim  Sn , 

n— ►  oo 

where  Sn  is  the  sum  of  n  terms  of  the  series.  By  following  the  method  of  the  exam¬ 
ple  above,  you  can  show  (Problem  2)  that  a  geometric  series  has  a  sum  if  and  only 
if  |r|  <  1,  and  in  this  case  the  sum  is 


(1.8) 


S  = 


a 


1  —  r 
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The  series  is  then  called  convergent. 

Here  is  an  interesting  use  of  (1.8).  We  can  write  0.3333- ••  =  ^  +  JL  _|_ 
1(1)0  +  ' ' '  =  liffio  =  5  by  (1.8).  Now  of  course  you  knew  that,  but  how  about 
0.785714285714- ••  ?  We  can  write  this  as  0.5  +  0.285714285714  •••  =  \  = 

\  +  999999  =  \  +  |  =  ■  (Note  that  any  repeating  decimal  is  equivalent  to  a  frac¬ 

tion  which  can  be  found  by  this  method.)  If  you  want  to  use  a  computer  to  do  the 
arithmetic,  be  sure  to  tell  it  to  give  you  an  exact  answer  or  it  may  hand  you  back 
the  decimal  you  started  with!  You  can  also  use  a  computer  to  sum  the  series,  but 
using  (1.8)  may  be  simpler.  (Also  see  Problem  14.) 

PROBLEMS,  SECTION  1 

1.  In  the  bouncing  ball  example  above,  find  the  height  of  the  tenth  rebound,  and  the 
distance  traveled  by  the  ball  after  it  touches  the  ground  the  tenth  time.  Compare 
this  distance  with  the  total  distance  traveled. 

2.  Derive  the  formula  (1.4)  for  the  sum  Sn  of  the  geometric  progression  S„  =  a  +  ar  + 
ar2  +  •  •  •  +  ar71-1.  Hint:  Multiply  S„  by  r  and  subtract  the  result  from  Sn',  then 
solve  for  Sn.  Show  that  the  geometric  series  (1.6)  converges  if  and  only  if  |r|  <  1; 
also  show  that  if  |r|  <  1,  the  sum  is  given  by  equation  (1.8). 

Use  equation  (1.8)  to  find  the  fractions  that  are  equivalent  to  the  following  repeating 
decimals: 

3.  0.55555---  4.  0.818181  ••  •  5.  0.583333--- 

6.  0.61111---  7.  0.185185- ••  8.  0.694444- •• 

9.  0.857142857142  •  •  •  10.  0.576923076923076923  •  •  • 

11.  0.678571428571428571  ■  ■  ■ 

12.  In  a  water  purification  process,  one-nth  of  the  impurity  is  removed  in  the  first  stage. 
In  each  succeeding  stage,  the  amount  of  impurity  removed  is  one- nth  of  that  removed 
in  the  preceding  stage.  Show  that  if  n  =  2,  the  water  can  be  made  as  pure  as  you 
like,  but  that  if  n  =  3,  at  least  one-half  of  the  impurity  will  remain  no  matter  how 
many  stages  are  used. 

13.  If  you  invest  a  dollar  at  “6%  interest  compounded  monthly,”  it  amounts  to  (1.005)n 
dollars  after  n  months.  If  you  invest  $10  at  the  beginning  of  each  month  for  10  years 
(120  months),  how  much  will  you  have  at  the  end  of  the  10  years? 

14.  A  computer  program  gives  the  result  1/6  for  the  sum  of  the  series  5)™-  Show 

that  this  series  is  divergent.  Do  you  see  what  happened?  Warning  hint:  Always 
consider  whether  an  answer  is  reasonable,  whether  it’s  a  computer  answer  or  your 
work  by  hand. 

15.  Connect  the  midpoints  of  the  sides  of  an  equilateral  triangle  to  form  4  smaller 
equilateral  triangles.  Leave  the  middle  small  triangle  blank,  but  for  each  of  the 
other  3  small  triangles,  draw  lines  connecting  the  midpoints  of  the  sides  to  create 
4  tiny  triangles.  Again  leave  each  middle  tiny  triangle  blank  and  draw  the  lines  to 
divide  the  others  into  4  parts.  Find  the  infinite  series  for  the  total  area  left  blank 
if  this  process  is  continued  indefinitely.  (Suggestion:  Let  the  area  of  the  original 
triangle  be  1;  then  the  area  of  the  first  blank  triangle  is  1/4.)  Sum  the  series  to  find 
the  total  area  left  blank.  Is  the  answer  what  you  expect?  Hint:  What  is  the  “area” 
of  a  straight  line?  (Comment:  You  have  constructed  a  fractal  called  the  Sierpihski 
gasket.  A  fractal  has  the  property  that  a  magnified  view  of  a  small  part  of  it  looks 
very  much  like  the  original.) 
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16.  Suppose  a  large  number  of  particles  are  bouncing  back  and  forth  between  x  =  0  and 
x  =  1,  except  that  at  each  endpoint  some  escape.  Let  r  be  the  fraction  reflected 
each  time;  then  (1  —  r)  is  the  fraction  escaping.  Suppose  the  particles  start  at  x  =  0 
heading  toward  x  =  1;  eventually  all  particles  will  escape.  Write  an  infinite  series 
for  the  fraction  which  escape  at  x  =  1  and  similarly  for  the  fraction  which  escape  at 
x  =  0.  Sum  both  the  series.  What  is  the  largest  fraction  of  the  particles  which  can 
escape  at  x  =  0?  (Remember  that  r  must  be  between  0  and  1.) 


2.  DEFINITIONS  AND  NOTATION 


There  are  many  other  infinite  series  besides  geometric  series.  Here  are  some  exam¬ 
ples: 


(2.1a) 

(2.1b) 

(2.1c) 


l2  +  22  +  32  +  42  +  •  •  ■ 
12  3  4 

2  +  22  +  23  +  24+' 

X2  X3  XA 

T  +  ‘ 


x  — 


2  3 

In  general,  an  infinite  series  means  an  expression  of  the  form 


(2.2) 


Ol  +  0,2  +  O3  +  •  •  ’  +  CLn  +  •  •  •  , 


where  the  an’s  (one  for  each  positive  integer  n)  are  numbers  or  functions  given  by 
some  formula  or  rule.  The  three  dots  in  each  case  mean  that  the  series  never  ends. 
The  terms  continue  according  to  the  law  of  formation,  which  is  supposed  to  be 
evident  to  you  by  the  time  you  reach  the  three  dots.  If  there  is  apt  to  be  doubt 
about  how  the  terms  are  formed,  a  general  or  nth  term  is  written  like  this: 


(2.3a) 

(2.3b) 


l2  +  22  +  32  H - h  n2  H - , 


(-l)"-1® 
(n  —  1)! 


(The  quantity  n\,  read  n  factorial,  means,  for  integral  n,  the  product  of  all  integers 
from  1  to  n;  for  example,  5!  =  5-  4-  3-  2-  l  =  120.  The  quantity  0!  is  defined  to  be 
1.)  In  (2.3a),  it  is  easy  to  see  without  the  general  term  that  each  term  is  just  the 
square  of  the  number  of  the  term,  that  is,  n2.  However,  in  (2.3b),  if  the  formula  for 
the  general  term  were  missing,  you  could  probably  make  several  reasonable  guesses 
for  the  next  term.  To  be  sure  of  the  law  of  formation,  we  must  either  know  a  good 
many  more  terms  or  have  the  formula  for  the  general  term.  You  should  verify  that 
the  fourth  term  in  (2.3b)  is  —  xA /Q. 

We  can  also  write  series  in  a  shorter  abbreviated  form  using  a  summation  sign 
JO  followed  by  the  formula  for  the  ?rth  term.  For  example,  (2.3a)  would  be  written 


(2.4)  l2  +  22  +  32  +  42  +  •  •  •  = 

n—  1 


(read  “the  sum  of  n2  from  n  =  1  to  00”).  The  series  (2.3b)  would  be  written 


+T-1T 


OO 

E 

n=  1 


(n-  1)! 


Section  2 


Definitions  and  Notation  5 


For  printing  convenience,  sums  like  (2.4)  are  often  written  n2 ■ 

In  Section  1,  we  have  mentioned  both  sequences  and  series.  The  lists  in  (1.1) 
are  sequences;  a  sequence  is  simply  a  set  of  quantities,  one  for  each  n.  A  series  is 
an  indicated  sum  of  such  quantities,  as  in  (1.3)  or  (1.6).  We  will  be  interested  in 
various  sequences  related  to  a  series:  for  example,  the  sequence  an  of  terms  of  the 
series,  the  sequence  Sn  of  partial  sums  [see  (1.5)  and  (4.5)],  the  sequence  Rn  [see 
(4.7)],  and  the  sequence  pn  [see  (6.2)].  In  all  these  examples,  we  want  to  find  the 
limit  of  a  sequence  as  n  — >  oo  (if  the  sequence  has  a  limit) .  Although  limits  can  be 
found  by  computer,  many  simple  limits  can  be  done  faster  by  hand. 


Example  1.  Find  the  limit  as  n  — >  oo  of  the  sequence 

(2 n  -  l)4  +  yT  +  9ns 
1  —  n3  —  7  n4 

We  divide  numerator  and  denominator  by  n4  and  take  the  limit  as  n  — >  oo.  Then 
all  terms  go  to  zero  except 

24  +  vT)  19 
-7  “  _T' 


Example  2.  Find  liirpi^oc  By  L’Hopital’s  rule  (see  Section  15) 


,im  Ura  iZn  =  o. 

n—>  oo  77,  n — >-oo  1 


Comment:  Strictly  speaking,  we  can’t  differentiate  a  function  of  n  if  n  is  an  integer, 
but  we  can  consider  f{x)  =  (In  x)  /x,  and  the  limit  of  the  sequence  is  the  same  as 
the  limit  of  f(x). 


Example  3.  Find  linin^oo  (i)1/".  We  first  find 


In 


l/r 


= - In  n. 

n 


Then  by  Example  2,  the  limit  of  (In  n)/n  is  0,  so  the  original  limit  is  e°  =  1. 


PROBLEMS,  SECTION  2 


In  the  following  problems,  find  the  limit  of  the  given  sequence  as  n  — >  oo. 


1. 

n2  +  5n3 

2. 

(n+1)2 

3. 

(-i)"v^TT 

2  n3  +  3V  4  +  n6 

\/3  +  5n2  +  4n4 

n 

4. 

2n 

10" 

6. 

nn 

— 

5. 

— 

— 

n2 

n! 

n\ 

7. 

/-i  ,  2\l/lnn 

(1  +  n  )  ' 

8. 

(n!)2 
(2  n)! 

9. 

nsin(l/n) 
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3.  APPLICATIONS  OF  SERIES 

In  the  example  of  the  bouncing  ball  in  Section  1,  we  saw  that  it  is  possible  for  the 
sum  of  an  infinite  series  to  be  nearly  the  same  as  the  sum  of  a  fairly  small  number  of 
terms  at  the  beginning  of  the  series  (also  see  Problem  1.1).  Many  applied  problems 
cannot  be  solved  exactly,  but  we  may  be  able  to  find  an  answer  in  terms  of  an 
infinite  series,  and  then  use  only  as  many  terms  as  necessary  to  obtain  the  needed 
accuracy.  We  shall  see  many  examples  of  this  both  in  this  chapter  and  in  later 
chapters.  Differential  equations  (see  Chapters  8  and  12)  and  partial  differential 
equations  (see  Chapter  13)  are  frequently  solved  by  using  series.  We  will  learn 
how  to  find  series  that  represent  functions;  often  a  complicated  function  can  be 
approximated  by  a  few  terms  of  its  series  (see  Section  15). 

But  there  is  more  to  the  subject  of  infinite  series  than  making  approximations. 
We  will  see  (Chapter  2,  Section  8)  how  we  can  use  power  series  (that  is,  series 
whose  terms  are  powers  of  x)  to  give  meaning  to  functions  of  complex  numbers, 
and  (Chapter  3,  Section  6)  how  to  define  a  function  of  a  matrix  using  the  power 
series  of  the  function.  Also  power  series  are  just  a  first  example  of  infinite  series.  In 
Chapter  7  we  will  learn  about  Fourier  series  (whose  terms  are  sines  and  cosines).  In 
Chapter  12,  we  will  use  power  series  to  solve  differential  equations,  and  in  Chapters 
12  and  13,  we  will  discuss  other  series  such  as  Legendre  and  Bessel.  Finally,  in 
Chapter  14,  we  will  discover  how  a  study  of  power  series  clarifies  our  understanding 
of  the  mathematical  functions  we  use  in  applications. 


4.  CONVERGENT  AND  DIVERGENT  SERIES 

We  have  been  talking  about  series  which  have  a  finite  sum.  We  have  also  seen  that 
there  are  series  which  do  not  have  finite  sums,  for  example  (2.1a).  If  a  series  has  a 
finite  sum,  it  is  called  convergent.  Otherwise  it  is  called  divergent.  It  is  important 
to  know  whether  a  series  is  convergent  or  divergent.  Some  weird  things  can  happen 
if  you  try  to  apply  ordinary  algebra  to  a  divergent  series.  Suppose  we  try  it  with 
the  following  series: 

(4.1)  5  =  l  +  2  +  4  +  8  +  16+---. 

Then, 


25  =  2  +  4  +  8  + 16-| - =  5  —  1, 

5  =  — 1. 


This  is  obvious  nonsense,  and  you  may  laugh  at  the  idea  of  trying  to  operate  with 
such  a  violently  divergent  series  as  (4.1).  But  the  same  sort  of  thing  can  happen  in 
more  concealed  fashion,  and  has  happened  and  given  wrong  answers  to  people  who 
were  not  careful  enough  about  the  way  they  used  infinite  series.  At  this  point  you 
probably  would  not  recognize  that  the  series 


(4.2) 


is  divergent,  but  it  is;  and  the  series 


(4.3) 
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is  convergent  as  it  stands,  but  can  be  made  to  have  any  sum  you  like  by  combining 
the  terms  in  a  different  order!  (See  Section  8.)  You  can  see  from  these  examples 
how  essential  it  is  to  know  whether  a  series  converges,  and  also  to  know  how  to 
apply  algebra  to  series  correctly.  There  are  even  cases  in  which  some  divergent 
series  can  be  used  (see  Chapter  11),  but  in  this  chapter  we  shall  be  concerned  with 
convergent  series. 

Before  we  consider  some  tests  for  convergence,  let  us  repeat  the  definition  of 
convergence  more  carefully.  Let  us  call  the  terms  of  the  series  an  so  that  the  series 
is 

(4.4)  oi  +  +  03  +  04  +  ■  ■  ■  +  an  +  •  •  •  . 

Remember  that  the  three  dots  mean  that  there  is  never  a  last  term;  the  series  goes 
on  without  end.  Now  consider  the  sums  Sn  that  we  obtain  by  adding  more  and 
more  terms  of  the  series.  We  define 

51  =  ai, 

52  =  a  1  +  o2, 

(4.5)  S3  =  ai  +  a2  +  a3, 


Sn  —  0,1  +  a2  +  0,3  +  ■  •  '  +  On- 

Each  Sn  is  called  a  partial  sum,-,  it  is  the  sum  of  the  first  n  terms  of  the  series.  We 
had  an  example  of  this  for  a  geometric  progression  in  (1.4).  The  letter  n  can  be 
any  integer;  for  each  n,  Sn  stops  with  the  nth  term.  (Since  Sn  is  not  an  infinite 
series,  there  is  no  question  of  convergence  for  it.)  As  n  increases,  the  partial  sums 
may  increase  without  any  limit  as  in  the  series  (2.1a).  They  may  oscillate  as  in  the 
series  1  —  2  +  3  —  4+5— •  ••  (which  has  partial  sums  1,  —  1,  2,  —  2, 3,  •  •  • )  or  they  may 
have  some  more  complicated  behavior.  One  possibility  is  that  the  Sn's  may,  after 
a  while,  not  change  very  much  any  more;  the  an’ s  may  become  very  small,  and  the 
Sn  s  come  closer  and  closer  to  some  value  S.  We  are  particularly  interested  in  this 
case  in  which  the  Sn  s  approach  a  limiting  value,  say 


(4.6)  lim  Sn  =  S. 

n— ►  00 

(It  is  understood  that  S'  is  a  finite  number.)  If  this  happens,  we  make  the  following 
definitions. 

a.  If  the  partial  sums  Sn  of  an  infinite  series  tend  to  a  limit  S,  the  series  is  called 
convergent.  Otherwise  it  is  called  divergent. 

b.  The  limiting  value  S  is  called  the  sum  of  the  series. 

c.  The  difference  Rn  =  S  —  Sn  is  called  the  remainder  (or  the  remainder  after  n 
terms).  From  (4.6),  we  see  that 

(4.7)  lim  Rn  =  lim  (S  —  Sn)  =  S  —  S  =  0. 
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Example  1.  We  have  already  (Section  1)  found  Sn  and  S  for  a  geometric  series.  From  (1.8) 
and  (1.4),  we  have  for  a  geometric  series,  Rn  =  yzy  which  — >  0  as  n  — >  oo  if  |r|  <  1. 


Example  2.  By  partial  fractions,  we  can  write  n?_1  =  —  +j-.  Let’s  write  out  a 

number  of  terms  of  the  series 

_L _ i_\  n  1 

n  —  1  n+l)  yn  n  + 2 

11111111111 

"3+2_4+3_5+4~6+5~7+6~8+ 
111  111 

n  —  2  n  n—  1  n+l  n  n  +  2 


Note  the  cancellation  of  terms;  this  kind  of  series  is  called  a  telescoping  series. 
Satisfy  yourself  that  when  we  have  added  the  nth  term  (L  —  +_).  the  only  terms 
which  have  not  cancelled  are  1.  1  — and  — so  we  have 

’  2  ’  n+l  ’  n+2 ’ 


Sn 


3  1  1 

2  n+l  n  +  2’ 


Rn 


1 

n+l 


1 

n  +  2 


Example  3.  Another  interesting  series  is 

OO  /  \  OO 

ln  (^Ti)  =  [ln n  - ln(n  + 1)] 

=  In  1  —  In  2  +  In  2  —  In  3  +  In  3  —  In  4  +  •  •  •  +  In  n  —  ln(n  +  1)  •  •  •  . 

Then  Sn  =  —  ln(n  +  1)  which  — »  — oo  as  n  — »  oo,  so  the  series  diverges.  However, 
note  that  an  =  ln  +-j-  — >  ln  1  =  0  as  n  — *  oo,  so  we  see  that  even  if  the  terms  tend 
to  zero,  a  series  may  diverge. 


PROBLEMS,  SECTION  4 


For  the  following  series,  write  formulas  for  the  sequences  a„, 
limits  of  the  sequences  as  n  — >  oo  (if  the  limits  exist). 


1. 


3. 


1  1 
8  +  16 


oo  1 

2-  Eff 


Sn 


and  R„,  and  find  the 


OO 

4.  ^+-7lln3  Hint:  What  is  e_ln3? 

l 


5.  ^e2"in»mh/3)  Hint:  Simplify  this, 

o 


6. 


E 

i 


i 

n(n  +  1) 


Hint: 


1 

n(n  +  1) 


1  1 

n  n+l 


7. 


_3 _ 5_  _7 _ 9_ 

1-2  2-3  +  3-  4  4-5  + 
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5.  TESTING  SERIES  FOR  CONVERGENCE;  THE  PRELIMINARY  TEST 

It  is  not  in  general  possible  to  write  a  simple  formula  for  Sn  and  find  its  limit  as 
n  — >  oo  (as  we  have  done  for  a  few  special  series),  so  we  need  some  other  way  to  find 
out  whether  a  given  series  converges.  Here  we  shall  consider  a  few  simple  tests  for 
convergence.  These  tests  will  illustrate  some  of  the  ideas  involved  in  testing  series 
for  convergence  and  will  work  for  a  good  many,  but  not  all,  cases.  There  are  more 
complicated  tests  which  you  can  find  in  other  books.  In  some  cases  it  may  be  quite 
a  difficult  mathematical  problem  to  investigate  the  convergence  of  a  complicated 
series.  However,  for  our  purposes  the  simple  tests  we  give  here  will  be  sufficient. 

First  we  discuss  a  useful  preliminary  test.  In  most  cases  you  should  apply  this 
to  a  series  before  you  use  other  tests. 


Preliminary  test.  If  the  terms  of  an  infinite  series  do  not  tend  to  zero  (that  is, 
if  limn — .qo  an  ^  0),  the  series  diverges.  If  linin^oo  an  =  0,  we  must  test  further. 


This  is  not  a  test  for  convergence;  what  it  does  is  to  weed  out  some  very  badly 
divergent  series  which  you  then  do  not  have  to  spend  time  testing  by  more  com¬ 
plicated  methods.  Note  carefully:  The  preliminary  test  can  never  tell  you  that  a 
series  converges.  It  does  not  say  that  series  converge  if  an  — >  0  and,  in  fact,  often 
they  do  not.  A  simple  example  is  the  harmonic  series  (4.2);  the  nth  term  certainly 
tends  to  zero,  but  we  shall  soon  show  that  the  series  Vn  ^  divergent.  On 

the  other  hand,  in  the  series 

12  3  4 

2  +  3  +  4+5  +  "' 

the  terms  are  tending  to  1,  so  by  the  preliminary  test,  this  series  diverges  and  no 
further  testing  is  needed. 


PROBLEMS,  SECTION  5 

Use  the  preliminary  test  to  decide  whether  the  following  series  are  divergent  or  require 
further  testing.  Careful:  Do  not  say  that  a  series  is  convergent;  the  preliminary  test  cannot 
decide  this. 


1. 

1  4  9 

2  ~~  5  +  TO  ~~ 

16 

17 

25 

+  26 

36 

37+"' 

2. 

V2  +  f +  • 

3. 

00  l  O 

71  +  0 

4. 

00  (  1  \n~>2 
y-  (-1)  n 

n2  +  10 n 

n= 1 

+  (n  +  l)2 

n= 1 

5. 

n! 

6. 

oo  , 

Y^  n\ 

n!  +  1 

n= 1 

+  (n  +  1)! 

n= 1 

7. 

+  (— l)"n 

8. 

oo  , 

inn 
^  n 

n= 1 

,  Vn3  +  1 

n= 1 

9. 

00  on 

y 

'  2n  +  3n 

n= 1 

10. 

n= 2  ^ 

11. 

Using  (4.6),  give  t 

i  proof  of  the  preliminary  test.  Hint:  S, 

Sn  —  l  —  a 


n  • 
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6.  CONVERGENCE  TESTS  FOR  SERIES  OF  POSITIVE  TERMS; 

ABSOLUTE  CONVERGENCE 

We  are  now  going  to  consider  four  useful  tests  for  series  whose  terms  are  all  positive. 
If  some  of  the  terms  of  a  series  are  negative,  we  may  still  want  to  consider  the  related 
series  which  we  get  by  making  all  the  terms  positive;  that  is,  we  may  consider  the 
series  whose  terms  are  the  absolute  values  of  the  terms  of  our  original  series.  If 
this  new  series  converges,  we  call  the  original  series  absolutely  convergent.  It  can  be 
proved  that  if  a  series  converges  absolutely,  then  it  converges  (Problem  7.9).  This 
means  that  if  the  series  of  absolute  values  converges,  the  series  is  still  convergent 
when  you  put  back  the  original  minus  signs.  (The  sum  is  different,  of  course.)  The 
following  four  tests  may  be  used,  then,  either  for  testing  series  of  positive  terms,  or 
for  testing  any  series  for  absolute  convergence. 

A.  The  Comparison  Test 

This  test  has  two  parts,  (a)  and  (b). 


(a)  Let 


TO  1  +  TO2  +  TO-3  +  TO4  +  •  •  • 


be  a  series  of  positive  terms  which  you  know  converges.  Then  the  series  you  are 
testing,  namely 

(Jl  +  (12  +  tt3  +  (J4  H - 

is  absolutely  convergent  if  |an|  <  mn  (that  is,  if  the  absolute  value  of  each  term  of 
the  a  series  is  no  larger  than  the  corresponding  term  of  the  to  series)  for  all  n  from 
some  point  on,  say  after  the  third  term  (or  the  millionth  term).  See  the  example 
and  discussion  below. 


(b)  Let 


di  +  c?2  +  cfe  +  C?4  +  •  •  • 


be  a  series  of  positive  terms  which  you  know  diverges.  Then  the  series 


|ai|  +  |a.2 1  +  |^3 1  +  I&4J  +  •  •  • 


diverges  if  |an|  >  dn  for  all  n  from  some  point  on. 

Warning:  Note  carefully  that  neither  |an|  >  mn  nor  |a„|  <  dn  tells  us  anything. 
That  is,  if  a  series  has  terms  larger  than  those  of  a  convergent  series,  it  may  still 
converge  or  it  may  diverge — we  must  test  it  further.  Similarly,  if  a  series  has  terms 
smaller  than  those  of  a  divergent  series,  it  may  still  diverge,  or  it  may  converge. 


°°  1  111 
Example.  Test  £  -  =  1  +  -  +  -  +  - 


for  convergence. 


As  a  comparison  series,  we  choose  the  geometric  series 


v--  1  111  1 

\  -  =  -  4 - 1 - 1 - 


2 15 

n—  1 


8  16 


Notice  that  we  do  not  care  about  the  first  few  terms  (or,  in  fact,  any  finite  number 
of  terms)  in  a  series,  because  they  can  affect  the  sum  of  the  series  but  not  whether 


Section  6 


Convergence  Tests  for  Series  of  Positive  Terms;  Absolute  Convergence  11 


it  converges.  When  we  ask  whether  a  series  converges  or  not,  we  are  asking  what 
happens  as  we  add  more  and  more  terms  for  larger  and  larger  n.  Does  the  sum 
increase  indefinitely,  or  does  it  approach  a  limit?  What  the  first  five  or  hundred  or 
million  terms  are  has  no  effect  on  whether  the  sum  eventually  increases  indefinitely 
or  approaches  a  limit.  Consequently  we  frequently  ignore  some  of  the  early  terms 
in  testing  series  for  convergence. 

In  our  example,  the  terms  of  Vn-  are  smaller  than  the  corresponding 

terms  of  Y^Li  l/2ra  for  all  n  >  3  (Problem  1).  We  know  that  the  geometric  series 
converges  because  its  ratio  is  Therefore  Vn-  converges  also. 


PROBLEMS,  SECTION  6 


1.  Show  that  n\  >  2™  for  all  n  >  3.  Hint:  Write  out  a  few  terms;  then  consider  what 
you  multiply  by  to  go  from,  say,  5!  to  6!  and  from  25  to  26. 

2.  Prove  that  the  harmonic  series  l/n  is  divergent  by  comparing  it  with  the 

series 


1+2  + 


1111 

8  +  8  +  8  +  8 


+  (  8  terms  each  equal  to 


16 


,  .  ,  .  ,1111 

which  is  13 - 1 - 1 - 1 - 1-  ■ 

1  o  1  o  1  9  1  o  1 


3.  Prove  the  convergence  of  l/u2  by  grouping  terms  somewhat  as  in  Problem  2. 

4.  Use  the  comparison  test  to  prove  the  convergence  of  the  following  series: 

OO  CO 

(a)  V - - -  (b)  V  — — 

'  '  ^  2"  +  3n  ^  n  2n 

n= 1  n=l 

5.  Test  the  following  series  for  convergence  using  the  comparison  test. 

CO  ^  OO  ^ 

(a)  — —  Hint:  Which  is  larger,  n  or  Jn  ?  (b)  - - 

\fn  Inn 

n= 1  v  n= 2 

6.  There  are  9  one-digit  numbers  (1  to  9),  90  two-digit  numbers  (10  to  99).  How  many 
three-digit,  four-digit,  etc.,  numbers  are  there?  The  first  9  terms  of  the  harmonic 
series  l  +  §  +  §  +  '''  +  §  are  all  greater  than  A ;  similarly  consider  the  next  90 
terms,  and  so  on.  Thus  prove  the  divergence  of  the  harmonic  series  by  comparison 
with  the  series 


[A  -f  i  -f  . . .  (9  terms  each  =  y^)]  +  [  90  terms  each  =  y^]  +  •  •  • 

_  9  |  90  |  _  9  |  9  | 

—  10  '  100  '  '  ’  '  —  10  '  10  '  *  *  *  * 

The  comparison  test  is  really  the  basic  test  from  which  other  tests  are  derived. 
It  is  probably  the  most  useful  test  of  all  for  the  experienced  mathematician  but  it 
is  often  hard  to  think  of  a  satisfactory  m  series  until  you  have  had  a  good  deal  of 
experience  with  series.  Consequently,  you  will  probably  not  use  it  as  often  as  the 
next  three  tests. 


B.  The  Integral  Test 

We  can  use  this  test  when  the  terms  of  the  series  are  positive  and  not  increasing, 
that  is,  when  an_|_i  <  an.  (Again  remember  that  we  can  ignore  any  finite  number  of 
terms  of  the  series;  thus  the  test  can  still  be  used  even  if  the  condition  an+i  <  an 
does  not  hold  for  a  finite  number  of  terms.)  To  apply  the  test  we  think  of  an  as  a 
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function  of  the  variable  n,  and,  forgetting  our  previous  meaning  of  n,  we  allow  it  to 
take  all  values,  not  just  integral  ones.  The  test  states  that: 


If  0  <  an+i  <  an  for  n  >  N,  then  an  converges  if  f°°  an  dn  is  finite  and 
diverges  if  the  integral  is  infinite.  (The  integral  is  to  be  evaluated  only  at  the 
upper  limit;  no  lower  limit  is  needed.) 


To  understand  this  test,  imagine  a  graph  sketched  of  an  as  a  function  of  n.  For 
example,  in  testing  the  harmonic  series  l/n>  we  consider  the  graph  of  the 

function  y  =  1/n  (similar  to  Figures  6.1  and  6.2)  letting  n  have  all  values,  not  just 
integral  ones.  Then  the  values  of  y  on  the  graph  at  n  =  1,2,3,-  •  - ,  are  the  terms 
of  the  series.  In  Figures  6.1  and  6.2,  the  areas  of  the  rectangles  are  just  the  terms 
of  the  series.  Notice  that  in  Figure  6.1  the  top  edge  of  each  rectangle  is  above 
the  curve,  so  that  the  area  of  the  rectangles  is  greater  than  the  corresponding  area 
under  the  curve.  On  the  other  hand,  in  Figure  6.2  the  rectangles  lie  below  the 
curve,  so  their  area  is  less  than  the  corresponding  area  under  the  curve.  Now  the 
areas  of  the  rectangles  are  just  the  terms  of  the  series,  and  the  area  under  the  curve 
is  an  integral  of  y  dn  or  an  dn.  The  upper  limit  on  the  integrals  is  oo  and  the  lower 
limit  could  be  made  to  correspond  to  any  term  of  the  series  we  wanted  to  start 
with.  For  example  (see  Figure  6.1),  f3  an  dn  is  less  than  the  sum  of  the  series  from 
03  on,  but  (see  Figure  6.2)  greater  than  the  sum  of  the  series  from  0.4  on.  If  the 
integral  is  finite,  then  the  sum  of  the  series  from  04  on  is  finite,  that  is,  the  series 
converges.  Note  again  that  the  terms  at  the  beginning  of  a  series  have  nothing  to 
do  with  convergence.  On  the  other  hand,  if  the  integral  is  infinite,  then  the  sum  of 
the  series  from  03  on  is  infinite  and  the  series  diverges.  Since  the  beginning  terms 
are  of  no  interest,  you  should  simply  evaluate  J  an  dn.  (Also  see  Problem  16.) 


Example.  Test  for  convergence  the  harmonic  series 


(6.1) 


Using  the  integral  test,  we  evaluate 


dn  =  In  n 


OO 


=  OO. 


(We  use  the  symbol  In  to  mean  a  natural  logarithm,  that  is,  a  logarithm  to  the  base 
e.)  Since  the  integral  is  infinite,  the  series  diverges. 
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PROBLEMS,  SECTION  6 

Use  the  integral  test  to  find  whether  the  following  series  converge  or  diverge.  Hint  and 
warning:  Do  not  use  lower  limits  on  your  integrals  (see  Problem  16). 


7. 

10. 

13. 


15. 


E 

n= 2 
oo 

E 

n= 1 
oo 

E 


i 


n  In  n 


e2n  +  9 


n3  +  1 


n2  +  4 


«•  E 

71=1 

oo  1 

n(l  +  In  n)3/2 

oo 

i4.  y^E= 

\/n2  +  9 


9-  E 


n2  —  4 


12-  E 


?  (™2  +  !)2 


Use  the  integral  test  to  prove  the  following  so-called  p-series  test.  The  series 


1, 

1. 

Caution:  Do  p  =  1  separately. 

16.  In  testing  JC 1/n 2  for  convergence,  a  student  evaluates  JQ°°  n~2dn  =  —  n-1|o°  = 
0  +  oo  =  oo  and  concludes  (erroneously)  that  the  series  diverges.  What  is  wrong? 
Hint:  Consider  the  area  under  the  curve  in  a  diagram  such  as  Figure  6.1  or  6.2. 
This  example  shows  the  danger  of  using  a  lower  limit  in  the  integral  test. 

17.  Use  the  integral  test  to  show  that  e_n  converges.  Hint:  Although  you  cannot 

evaluate  the  integral,  you  can  show  that  it  is  finite  (which  is  all  that  is  necessary) 
by  comparing  it  with  f°°  e~ndn. 


oo  ,  ( 

v — "v  1  .  I  co 

y  rC  1S  jdi 

71  =  1  L 


j  convergent  if  p  > 
divergent  if  p  < 


C.  The  Ratio  Test 

The  integral  test  depends  on  your  being  able  to  integrate  andn ;  this  is  not  always 
easy!  We  consider  another  test  which  will  handle  many  cases  in  which  we  cannot 
evaluate  the  integral.  Recall  that  in  the  geometric  series  each  term  could  be  obtained 
by  multiplying  the  one  before  it  by  the  ratio  r,  that  is,  an+i  =  ran  or  an+i/ara  =  r. 
For  other  series  the  ratio  an+i/an  is  not  constant  but  depends  on  n;  let  us  call 
the  absolute  value  of  this  ratio  pn.  Let  us  also  find  the  limit  (if  there  is  one)  of 
the  sequence  pn  as  n  — >  oo  and  call  this  limit  p.  Thus  we  define  pn  and  p  by  the 
equations 


(6.2) 


Pn  — 


^n+1 


p  =  lim  pn. 

n— ►  oo 


If  you  recall  that  a  geometric  series  converges  if  |r|  <  1,  it  may  seem  plausible  that 
a  series  with  p  <  1  should  converge  and  this  is  true.  This  statement  can  be  proved 
(Problem  30)  by  comparing  the  series  to  be  tested  with  a  geometric  series.  Like  a  ge¬ 
ometric  series  with  |r|  >  1,  a  series  with  p  >  1  also  diverges  (Problem  30).  However, 
if  p  =  1,  the  ratio  test  does  not  tell  us  anything;  some  series  with  p  =  1  converge 
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and  some  diverge,  so  we  must  find  another  test  (say  one  of  the  two  preceding  tests). 
To  summarize  the  ratio  test: 


(6.3) 

If  i 

[pel, 

1  p  =  1, 

the  series  converges; 
use  a  different  test; 

[p  >  !, 

the  series  diverges. 

Example  1.  Test  for  convergence  the  series 


1  + 


1 

2! 


1 

3! 


Using  (6.2),  we  have 


1  .  1 

"  (n  + 1)!  n! 

n\  n(n  —  1)  •  •  •  3  •  2  •  1  1 

(n  +  1)!  (n  +  l)(n)(n  —  1)  •  •  •  3  •  2  •  1  n  +  1’ 

p  =  lim  pn  =  lim  - -  =  0. 

n— ►  oo  n — >-oo  77,  — |—  1 


Since  p  <  1,  the  series  converges. 


Example  2.  Test  for  convergence  the  harmonic  series 


1  + 


1 

2 


1 

3 


We  find 


Pn  — 

P  = 


1 


1 

n+1  n 
lim 

n — >oo  n+1 


n 

n  +  1  ’ 

=  lim  - y 

n — >oo  1  +  — 
n 


=  1. 


Here  the  test  tells  us  nothing  and  we  must  use  some  different  test.  A  word  of 
warning  from  this  example:  Notice  that  pn  =  n/[n  +  1)  is  always  less  than  1.  Be 
careful  not  to  confuse  this  ratio  with  p  and  conclude  incorrectly  that  this  series 
converges.  (It  is  actually  divergent  as  we  proved  by  the  integral  test.)  Remember 
that  p  is  not  the  same  as  the  ratio  pn  =  \an+i/an\,  but  is  the  limit  of  this  ratio  as 
n  — +  oo. 


PROBLEMS,  SECTION  6 

Use  the  ratio  test  to  find  whether  the  following  series  converge  or  diverge: 


18. 


n=l 


19. 


T  — 

/  -y  2 2n 

n=0 


20. 


OO 


E 

n=0 


n\ 

(2n)i 
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21. 

5"(n!)2 
^  (2  n)\ 

71=0  V  ’ 

22. 

^  10" 

^  (n!)2 

71=1  V  ’ 

23. 

OO  . 

n\ 

100™ 

71=1 

24. 

00  Q2n 

V  — 

/  ^  23n 

71=0 

25. 

o°  n 

F  — 

26. 

^  (n!)3e3" 
^ (3n)! 

71=0  v  ’ 

27. 

^  100" 

2 E  n200 

71=0 

28. 

^  n!(2n)! 
(3 n)\ 

71=0  V  1 

29. 

Y'  VftnV- 

2-^  n! 

71=0 

30. 

Prove  the  ratio  test. 

Hint: 

If  |&n+l  / Cln  | 

-*/)<!,  take 

a  so  that  p 

Then  |an+i/an|  <  a  if  n  is  large,  say  n  >  N.  This  means  that  we  have  |ajv+i|  < 
cr|ajv|,  | a jv+2 |  <  a|ajv+i|  <  a2|ajv|,  and  so  on.  Compare  with  the  geometric  series 

OO 

71  =  1 

Also  prove  that  a  series  with  p  >  1  diverges.  Hint:  Take  p  >  a  >  1,  and  use  the 
preliminary  test. 

D.  A  Special  Comparison  Test 

This  test  has  two  parts:  (a)  a  convergence  test,  and  (b)  a  divergence  test.  (See 
Problem  37.) 


(a)  If  J2n=i  b 71  a  convergent  series  of  positive  terms  and  an  >  0  and  an/bn 
tends  to  a  (finite)  limit,  then  Jin'Ll  an  converges. 

(b)  If  dn  is  a  divergent  series  of  positive  terms  and  an  >  0  and  an/dn 

tends  to  a  limit  greater  than  0  (or  tends  to  +oo),  then  an  diverges. 


There  are  really  two  steps  in  using  either  of  these  tests,  namely,  to  decide  on  a 
comparison  series,  and  then  to  compute  the  required  limit.  The  first  part  is  the  most 
important;  given  a  good  comparison  series  it  is  a  routine  process  to  find  the  needed 
limit.  The  method  of  finding  the  comparison  series  is  best  shown  by  examples. 

Example  1.  Test  for  convergence 


OO 


E 


V2 n2  -  5n  +  1 
4n3  -  7 n2  +  2  ' 


Remember  that  whether  a  series  converges  or  diverges  depends  on  what  the 
terms  are  as  n  becomes  larger  and  larger.  We  are  interested  in  the  nth  term  as 
n  — >  oo.  Think  of  n  =  1010  or  lO100,  say;  a  little  calculation  should  convince  you 
that  as  n  increases,  2 n2  —  5n  +  1  is  2 n2  to  quite  high  accuracy.  Similarly,  the 
denominator  in  our  example  is  nearly  4n3  for  large  n.  By  Section  9,  fact  1,  we  see 
that  the  factor  \/2/4  in  every  term  does  not  affect  convergence.  So  we  consider  as 
a  comparison  series  just 


OO 


E 


n° 


'  n 2 

71= 3 
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which  we  recognize  (say  by  integral  test)  as  a  convergent  series.  Hence  we  use  test 
(a)  to  try  to  show  that  the  given  series  converges.  We  have: 


lim  — 

n— >oo  bn 


=  lim 

n— »■  oo 


V2 n2  -  5n  +  1  .  1 
4n3  —  7  n2  +  2  n2 


=  lim 

n—>  oo 


n2\/2ri2  —  5n  +  1 
4n3  -  7n2  +  2 


=  lim 

n—>oo 


V2 

4  ' 


Since  this  is  a  finite  limit,  the  given  series  converges.  (With  practice,  you  won’t 
need  to  do  all  this  algebra!  You  should  be  able  to  look  at  the  original  problem  and 
see  that,  for  large  n,  the  terms  are  essentially  1/n2,  so  the  series  converges.) 


Example  2.  Test  for  convergence 


OO 


E 


3"  -  n3 
n5  —  5  n2 


Here  we  must  first  decide  which  is  the  important  term  as  n  — *  oo;  is  it  3”  or 
n3?  We  can  find  out  by  comparing  their  logarithms  since  In  N  and  N  increase  or 
decrease  together.  We  have  ln3ra  =  ?rln3,  and  Inn3  =  3 Inn.  Now  ln?r  is  much 
smaller  than  n,  so  for  large  n  we  have  ?rln3  >  3 Inn,  and  3”  >  n3.  (You  might  like 
to  compute  1003  =  106,  and  3100  >  5  x  104'.)  The  denominator  of  the  given  series 
is  approximately  n5.  Thus  the  comparison  series  is  Y^=  2  3"/n5.  It  is  easy  to  prove 
this  divergent  by  the  ratio  test.  Now  by  test  (b) 


lim 

n—>  00 


/  3n  -  n3 
\n5  —  5n2 


lim 


1  - 


1-4 


=  1 


which  is  greater  than  zero,  so  the  given  series  diverges. 


PROBLEMS,  SECTION  6 

Use  the  special  comparison  test  to  find  whether  the  following  series  converge  or  diverge. 


31.  £ 


(2n  +  l)(3w-5) 
Vn2  -  73 


32.  £ 


i(n  +  1) 


rj  (n  +  2)2(n  +  3) 


33.  V  — 

/  J  On  _  , 


35.  y. 


n= 3 


(n  —  Inn)2 
5n4  —  3n2  +  1 


34.  ^ 


36.  Y 


n2  +  3n  +  4 
i4  +  7  n3  +  6n  —  3 

\/n3  +  5n  —  1 
n2  —  sin  n3 


37.  Prove  the  special  comparison  test.  Hint  (part  a):  If  an/bn  L  and  M  >  L,  then 
a„  <  Mbn  for  large  n.  Compare  an  with  Mbn  ■ 
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7.  ALTERNATING  SERIES 


So  far  we  have  been  talking  about  series  of  positive  terms  (including  series  of  abso¬ 
lute  values).  Now  we  want  to  consider  one  important  case  of  a  series  whose  terms 
have  mixed  signs.  An  alternating  series  is  a  series  whose  terms  are  alternately  plus 
and  minus;  for  example, 


(7.1) 


(_l)n+t 

n 


is  an  alternating  series.  We  ask  two  questions  about  an  alternating  series.  Does  it 
converge?  Does  it  converge  absolutely  (that  is,  when  we  make  all  signs  positive)? 
Let  us  consider  the  second  question  first.  In  this  example  the  series  of  absolute 
values 


,  111  1 

1  +  —  +  —  +  y  +  --  --| - b  '  ’  ’ 

2  3  4  n 

is  the  harmonic  series  (6.1),  which  diverges.  We  say  that  the  series  (7.1)  is  not 
absolutely  convergent.  Next  we  must  ask  whether  (7.1)  converges  as  it  stands.  If  it 
had  turned  out  to  be  absolutely  convergent,  we  would  not  have  to  ask  this  question 
since  an  absolutely  convergent  series  is  also  convergent  (Problem  9).  However,  a 
series  which  is  not  absolutely  convergent  may  converge  or  it  may  diverge;  we  must 
test  it  further.  For  alternating  series  the  test  is  very  simple: 


Test  for  alternating  series.  An  alternating  series  converges  if  the  absolute 
value  of  the  terms  decreases  steadily  to  zero,  that  is,  if  |an+i|  <  |a„|  and 
limn_>00  an  =  0. 


In  our  example  -  <  — ,  and  lim  —  =  0,  so  (7.1)  converges. 

n  +  1  n  n—>  oo  n 


PROBLEMS,  SECTION  7 

Test  the  following  series  for  convergence. 


1. 

V'  (-1)" 

\  \/n 

n= 1 

2. 

OO 

E 

71=1 

(-2)" 

n2 

3. 

(-1)" 

^  n2 

71=1 

4. 

^  (-3)" 
n\ 

71=1 

5. 

OO 

E 

71=2 

(-1)" 

In  n 

6. 

y'  (— l)nn 
n  +  5 

71=1 

7. 

Y'  (-l)"n 
2-f.  1  +  n2 

71=0 

8. 

oo 

E 

71=1 

(-l)nvT0n 
n  +  2 

9. 

10. 

Prove  that  an  absolutely  convergent  series  a"  's  convergent.  Hint:  Put  b„  = 

an  +  |an|.  Then  the  6„  are  nonnegative;  we  have  |6„|  <  2|an|  and  an  =  b„  —  |on|. 

The  following  alternating  series  are  divergent  (but  you  are  not  asked  to  prove  this) . 

Show  that  an  — >  0.  Why  doesn’t  the  alternating  series  test  prove  (incorrectly)  that 
these  series  converge? 


/U\  JLJLJLJLJLJLJLJL 

(  }  V2  ~  2  +  7i~  3  +  7t~  4  +  7I”  5"' 
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8.  CONDITIONALLY  CONVERGENT  SERIES 


A  series  like  (7.1)  which  converges,  but  does  not  converge  absolutely,  is  called  con¬ 
ditionally  convergent.  You  have  to  use  special  care  in  handling  conditionally  con¬ 
vergent  series  because  the  positive  terms  alone  form  a  divergent  series  and  so  do 
the  negative  terms  alone.  If  you  rearrange  the  terms,  you  will  probably  change  the 
sum  of  the  series,  and  you  may  even  make  it  diverge!  It  is  possible  to  rearrange  the 
terms  to  make  the  sum  any  number  you  wish.  Let  us  do  this  with  the  alternating 
harmonic  series  1  —  |  +  1  —  ^  +  •  •  • .  Suppose  we  want  to  make  the  sum  equal  to 
1.5.  First  we  take  enough  positive  terms  to  add  to  just  over  1.5.  The  first  three 
positive  terms  do  this: 


1  +  ^  +  ^1^>L5- 


Then  we  take  enough  negative  terms  to  bring  the  partial  sum  back  under  1.5;  the 
one  term  —  1  does  this.  Again  we  add  positive  terms  until  we  have  a  little  more  than 
1.5,  and  so  on.  Since  the  terms  of  the  series  are  decreasing  in  absolute  value,  we  are 
able  (as  we  continue  this  process)  to  get  partial  sums  just  a  little  more  or  a  little  less 
than  1.5  but  always  nearer  and  nearer  to  1.5.  But  this  is  what  convergence  of  the 
series  to  the  sum  1.5  means:  that  the  partial  sums  should  approach  1.5.  You  should 
see  that  we  could  pick  in  advance  any  sum  that  we  want,  and  rearrange  the  terms 
of  this  series  to  get  it.  Thus,  we  must  not  rearrange  the  terms  of  a  conditionally 
convergent  series  since  its  convergence  and  its  sum  depend  on  the  fact  that  the 
terms  are  added  in  a  particular  order. 

Here  is  a  physical  example  of  such  a  series  which  emphasizes  the  care  needed 
in  applying  mathematical  approximations  in  physical  problems.  Coulomb’s  law 
in  electricity  says  that  the  force  between  two  charges  is  equal  to  the  product  of 
the  charges  divided  by  the  square  of  the  distance  between  them  (in  electrostatic 
units;  to  use  other  units,  say  SI,  we  need  only  multiply  by  a  numerical  constant). 
Suppose  there  are  unit  positive  charges  at  x  =  0,  -\/2,  \f\.  \/6,  v/8,  •  •  • ,  and  unit 
negative  charges  at  x  =  1,  y/3,  x/5,  y/7,  •  •  • .  We  want  to  know  the  total  force  acting 
on  the  unit  positive  charge  at  x  =  0  due  to  all  the  other  charges.  The  negative 
charges  attract  the  charge  at  x  =  0  and  try  to  pull  it  to  the  right;  we  call  the 
forces  exerted  by  them  positive,  since  they  are  in  the  direction  of  the  positive  x 
axis.  The  forces  due  to  the  positive  charges  are  in  the  negative  x  direction,  and  we 
call  them  negative.  For  example,  the  force  due  to  the  positive  charge  at  x  =  \/2  is 
—  (1  -  1)  /  (a/2)  =  —1/2.  The  total  force  on  the  charge  at  x  =  0  is,  then, 


(8.1) 


Now  we  know  that  this  series  converges  as  it  stands  (Section  7).  But  we  have  also 
seen  that  its  sum  (even  the  fact  that  it  converges)  can  be  changed  by  rearranging 
the  terms.  Physically  this  means  that  the  force  on  the  charge  at  the  origin  depends 
not  only  on  the  size  and  position  of  the  charges,  but  also  on  the  order  in  which  we 
place  them  in  their  positions!  This  may  very  well  go  strongly  against  your  physical 
intuition.  You  feel  that  a  physical  problem  like  this  should  have  a  definite  answer. 
Think  of  it  this  way.  Suppose  there  are  two  crews  of  workers,  one  crew  placing  the 
positive  charges  and  one  placing  the  negative.  If  one  crew  works  faster  than  the 
other,  it  is  clear  that  the  force  at  any  stage  may  be  far  from  the  F  of  equation  (8.1) 
because  there  are  many  extra  charges  of  one  sign.  The  crews  can  never  place  all  the 
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charges  because  there  are  an  infinite  number  of  them.  At  any  stage  the  forces  which 
would  arise  from  the  positive  charges  that  are  not  yet  in  place,  form  a  divergent 
series;  similarly,  the  forces  due  to  the  unplaced  negative  charges  form  a  divergent 
series  of  the  opposite  sign.  We  cannot  then  stop  at  some  point  and  say  that  the 
rest  of  the  series  is  negligible  as  we  could  in  the  bouncing  ball  problem  in  Section 

1.  But  if  we  specify  the  order  in  which  the  charges  are  to  be  placed,  then  the  sum 
S  of  the  series  is  determined  ( S  is  probably  different  from  F  in  (8.1)  unless  the 
charges  are  placed  alternately).  Physically  this  means  that  the  value  of  the  force 
as  the  crews  proceed  comes  closer  and  closer  to  S,  and  we  can  use  the  sum  of  the 
(properly  arranged)  infinite  series  as  a  good  approximation  to  the  force. 

9.  USEFUL  FACTS  ABOUT  SERIES 

We  state  the  following  facts  for  reference: 

1.  The  convergence  or  divergence  of  a  series  is  not  affected  by  multiplying  every 
term  of  the  series  by  the  same  nonzero  constant.  Neither  is  it  affected  by 
changing  a  finite  number  of  terms  (for  example,  omitting  the  first  few  terms). 

2.  Two  convergent  series  an  and  may  be  added  (or  subtracted) 

term  by  term.  (Adding  “term  by  term”  means  that  the  nth  term  of  the  sum 
is  an  +  bn .)  The  resulting  series  is  convergent,  and  its  sum  is  obtained  by 
adding  (subtracting)  the  sums  of  the  two  given  series. 

3.  The  terms  of  an  absolutely  convergent  series  may  be  rearranged  in  any  order 
without  affecting  either  the  convergence  or  the  sum.  This  is  not  true  of 
conditionally  convergent  series  as  we  have  seen  in  Section  8. 

PROBLEMS,  SECTION  9 


Test  the  following  series  for  convergence  or  divergence.  Decide  for  yourself  which  test  is 
easiest  to  use,  but  don’t  forget  the  preliminary  test.  Use  the  facts  stated  above  when  they 
apply. 
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19. 

1  1 
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°°  1 

(b)  E^nr 


(c)  For  what  values  of  k  is  ^  convergent? 


10.  POWER  SERIES;  INTERVAL  OF  CONVERGENCE 

We  have  been  discussing  series  whose  terms  were  constants.  Even  more  important 
and  useful  are  series  whose  terms  are  functions  of  x.  There  are  many  such  series, 
but  in  this  chapter  we  shall  consider  series  in  which  the  nth  term  is  a  constant  times 
xn  or  a  constant  times  ( x  —  a)n  where  a  is  a  constant.  These  are  called  power  series, 
because  the  terms  are  multiples  of  powers  of  x  or  of  (x  —  a).  In  later  chapters  we 
shall  consider  Fourier  series  whose  terms  involve  sines  and  cosines,  and  other  series 
(Legendre,  Bessel,  etc.)  in  which  the  terms  may  be  polynomials  or  other  functions. 
By  definition,  a  power  series  is  of  the  form 


El  anxn  =  ao  +  a±x  +  CL2X2  +  03a:3  +  •  •  •  or 


(10.1) 

n= 0 

00 

E  an{x  —  a)n  =  ao  +  a i(x  - 

Tl— 0 

-  a)  +  a2(x  —  a)2  +  a3(x  -  a)3  + 

where  the 

coefficients  an  are  constants. 

Here  are  some  examples: 

(10.2a) 

9  “T 

_  X  X  X° 

‘-2  +  T-T  +  - 

( -x)n 

+  +  •  •  • 

+  2n  +  ’ 

(10.2b) 

X2  X 3  X4 

X  2"  +  _3 

(-l)n+1xn 

•  +  - — 1 - +  •••  , 

n 

(10.2c) 

X3  X5  X7 

X  3!  +  5!  “  7!  +  " 

(- l)n+1x2n -1 
•  +  4-  •  •  • 

(2n  —  1)!  ’ 

(10. 2d) 

(x  +  2)  (x  +  2)2 

V2  V3 

(x  +  2)n 

■  +  •••+  - -  +  ■  ■  ■  . 

yjn\-  1 

Whether  a  power  series  converges  or  not  depends  on  the  value  of  x  we  are 
considering.  We  often  use  the  ratio  test  to  find  the  values  of  x  for  which  a  series 
converges.  We  illustrate  this  by  testing  each  of  the  four  series  (10.2).  Recall  that 
in  the  ratio  test  we  divide  term  n  + 1  by  term  n  and  take  the  absolute  value  of  this 
ratio  to  get  pn,  and  then  take  the  limit  of  pn  as  n  — >  00  to  get  p. 

Example  1.  For  (10.2a),  we  have 


(— a;)”+1  .  ( -x)n 

X 

2n+l  *  2n 

2 

x 
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The  series  converges  for  p  <  1,  that  is,  for  \x/2\  <  1  or  \x\  <  2,  and  it  diverges 
for  |ar|  >2  (see  Problem  6.30).  Graphically  we  consider  the  interval  on  the  x  axis 
between  x  =  —2  and  x  =  2;  for  any  x  in  this  interval  the  series  (10.2a)  converges. 
The  endpoints  of  the  interval,  x  =  2  and  x  =  —2,  must  be  considered  separately. 
When  x  =  2,  (10.2a)  is 

1  —  1  +  1  —  1  +  •  •  •  , 

which  is  divergent;  when  x  =  —2,  (10.2a)  is  1  +  1  +  1  +  1  +  --  -,  which  is  divergent. 
Then  the  interval  of  convergence  of  (10.2a)  is  stated  as  —2  <  x  <  2. 


Example  2.  For  (10.2b)  we  find 


.j.n+1  j,n 

nx 

n  +  1  n 

n  +  1 

p  =  lim 

n—*oo 


nx 

71+1 


The  series  converges  for  \x\  <  1.  Again  we  must  consider  the  endpoints  of  the 
interval  of  convergence,  x  =  1  and  x  =  —1.  For  x  =  1,  the  series  (10.2b)  is 
1—  |  +  i  —  !  +  •••;  this  is  the  alternating  harmonic  series  and  is  convergent.  For 

x  =  —1,  (10.2b)  is  —  1  —  \  —  |  j - ;  this  is  the  harmonic  series  (times  —1)  and 

is  divergent.  Then  we  state  the  interval  of  convergence  of  (10.2b)  as  —  1  <  x  <  1. 
Notice  carefully  how  this  differs  from  our  result  for  (10.2a).  Series  (10.2a)  did  not 
converge  at  either  endpoint  and  we  used  only  <  signs  in  stating  its  interval  of 
convergence.  Series  (10.2b)  converges  at  x  =  1,  so  we  use  the  sign  <  to  include 
x  =  1.  You  must  always  test  a  series  at  its  endpoints  and  include  the  results  in  your 
statement  of  the  interval  of  convergence.  A  series  may  converge  at  neither,  either 
one,  or  both  of  the  endpoints. 


Example  3.  In  (10.2c),  the  absolute  value  of  the  nth  term  is  \x2n~l  /  {2n  —  1)!|.  To  get 
term  n  +  1  we  replace  n  by  n  +  1;  then  2n  —  1  is  replaced  by  2(n  +  1)  —  1  =  2n  +  1, 
and  the  absolute  value  of  term  n  +  1  is 


Thus  we  get 


a;2"-1-1 

(2n+l)! 

x2n+1 

a;2""1 

x1 

(2n  +  1)!  ' 

(2n  —  1)! 

(2  n  +  l)(2n) 

p  =  lim 

n — »oo 


X 2 

(2  n  +  l)(2n) 


=  0. 


Since  p  <  1  for  all  values  of  x,  this  series  converges  for  all  x. 


Example  4.  In  (10. 2d),  we  find 


Pn  — 

P  = 


{x  +  2)n+1 
y/  n  +  2 


lim 

n—>  oo 


(x  +  2) 


(x  +  2)n 
y/n  1 
\]n  +  1 

Tl  +  2 


\x  +  2| . 
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The  series  converges  for  |rr  +  2|  <  1;  that  is,  for  —1  <  x  +  2  <  1,  or  —  3  <  x  <  —  1. 
If  x  =  -3,  (10.2d)  is 

,  1  1  1 

1 - H - -j= - -j=  +  •  •  • 

V2  V3  V4 

which  is  convergent  by  the  alternating  series  test.  For  x  =■  —  1,  the  series  is 

1  1  -V  1 

+  V2  +  V3  +  "'  ^0v^n 

which  is  divergent  by  the  integral  test.  Thus,  the  series  converges  for  —  3  <  x  <  1. 
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Find  the  interval  of  convergence  of  each  of  the  following  power  series;  be  sure  to  investigate 


the  endpoints  of  the 

interval  in  each  case. 

1. 

OO 

n=0 

2. 

OO 

E 

71=0 

(2  x)n 

3" 

3. 

y  ( -i)nxn 
“  n(n  +  1) 

71=1 

4. 

00 

2"n2 

n= 1 

5. 

OO 

E 

71=1 

xn 

(n!)2 

6. 

y  (_l)"xn 

Z-  (2n)i 

71=1  v  7 

7. 

00  ~3n 

V  ^  J-! 

2^  n 

n= 1 

8. 

OO 

E 

71=1 

(-l)"xn 

y/n 

9. 

OO 

E 

71=1 

10. 

y  (-I)™*2- 
(2  n)3/2 

11. 

OO 

E 

71=1 

1  /x\n 

n  V5/ 

12. 

OO 

En(_2*)n 

71=1 

13. 

y,  n(-x)n 

14. 

OO 

E 

71=1 

n  (x 

D’  «• 

y  (X  -  2)™ 

n2  +  1 

71=1 

n  +  1  V  3 

Z_-/  Qti 

71=1 

16. 

y  (x  - 1)" 

17. 

OO 

E 

71=1 

(-l)"(x 

+  l)71 

1  >  IQ 

(— 2)n(2x  +  l)n 

2^  2n 

71=1 

n 

±0. 

Z^  n2 

71=1 

The  following  series  are  not  power  series,  but  you  can  transform  each  one  into  a  power 
series  by  a  change  of  variable  and  so  find  out  where  it  converges. 

19. 

Eo^s-'V- 

for  \y\  <  8,  so 

l)n  Method:  Let  y  =  x2  —  1.  The  power  series  Eo°  ®  "y"  converges 
the  original  series  converges  for  \x2  —  1 1  <8,  which  means  \x\  <  3. 

20. 

x  C\n 

0 

2  +  l)2" 

21. 

y  (-l)"x”/2 

/  n  Inn 

2 

22. 

y,n!(-l)n 

23. 

y  3n(n+l) 

/  rpTl 

0  * 

OO 

,  77,  C\Tl 

V  (*  +  !)" 

OO 

24. 

E  (v%2  + 1 

0 

\'L  Z 

/  3 n  +  n3 

25. 

E(sin*)"(— 

0 

l)n2n 
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11.  THEOREMS  ABOUT  POWER  SERIES 

We  have  seen  that  a  power  series  X)^°=o  an%n  converges  in  some  interval  with  center 
at  the  origin.  For  each  value  of  x  (in  the  interval  of  convergence)  the  series  has  a 
finite  sum  whose  value  depends,  of  course,  on  the  value  of  x.  Thus  we  can  write  the 
sum  of  the  series  as  S(x)  =  anXn-  We  see  then  that  a  power  series  (within 

its  interval  of  convergence)  defines  a  function  of  x,  namely  S(x).  In  describing  the 
relation  of  the  series  and  the  function  S(x),  we  may  say  that  the  series  converges 
to  the  function  S(x),  or  that  the  function  S(x)  is  represented  by  the  series,  or  that 
the  series  is  the  power  series  of  the  function.  Here  we  have  thought  of  obtaining 
the  function  from  a  given  series.  We  shall  also  (Section  12)  be  interested  in  finding 
a  power  series  that  converges  to  a  given  function.  When  we  are  working  with 
power  series  and  the  functions  they  represent,  it  is  useful  to  know  the  following 
theorems  (which  we  state  without  proof;  see  advanced  calculus  texts) .  Power  series 
are  very  useful  and  convenient  because  within  their  interval  of  convergence  they  can 
be  handled  much  like  polynomials. 

1.  A  power  series  may  be  differentiated  or  integrated  term  by  term;  the  resulting 
series  converges  to  the  derivative  or  integral  of  the  function  represented  by  the 
original  series  within  the  same  interval  of  convergence  as  the  original  series 
(that  is,  not  necessarily  at  the  endpoints  of  the  interval) . 

2.  Two  power  series  may  be  added,  subtracted,  or  multiplied;  the  resultant  series 
converges  at  least  in  the  common  interval  of  convergence.  You  may  divide  two 
series  if  the  denominator  series  is  not  zero  at  x  =  0,  or  if  it  is  and  the  zero 
is  canceled  by  the  numerator  [as,  for  example,  in  (sin®)/®;  see  (13.1)].  The 
resulting  series  will  have  some  interval  of  convergence  (which  can  be  found 
by  the  ratio  test  or  more  simply  by  complex  variable  theory — see  Chapter  2, 
Section  7). 

3.  One  series  may  be  substituted  in  another  provided  that  the  values  of  the 
substituted  series  are  in  the  interval  of  convergence  of  the  other  series. 

4.  The  power  series  of  a  function  is  unique,  that  is,  there  is  just  one  power  series 

of  the  form  anXn  which  converges  to  a  given  function. 

12.  EXPANDING  FUNCTIONS  IN  POWER  SERIES 

Very  often  in  applied  work,  it  is  useful  to  find  power  series  that  represent  given 
functions.  We  illustrate  one  method  of  obtaining  such  series  by  finding  the  series 
for  sin®.  In  this  method  we  assume  that  there  is  such  a  series  (see  Section  14  for 
discussion  of  this  point)  and  set  out  to  find  what  the  coefficients  in  the  series  must 
be.  Thus  we  write 

(12.1)  sin®  =  do  +  a\X  +  a^x2  +  •  •  •  +  anxn  +  ■  ■  ■ 

and  try  to  find  numerical  values  of  the  coefficients  an  to  make  (12.1)  an  identity 
(within  the  interval  of  convergence  of  the  series).  Since  the  interval  of  convergence 
of  a  power  series  contains  the  origin,  (12.1)  must  hold  when  x  =  0.  If  we  substitute 
x  =  0  into  (12.1),  we  get  0  =  ao  since  sinO  =  0  and  all  the  terms  except  a0  on  the 
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right-hand  side  of  the  equation  contain  the  factor  x.  Then  to  make  (12.1)  valid  at 
x  =  0,  we  must  have  ao  =  0.  Next  we  differentiate  (12.1)  term  by  term  to  get 

(12.2)  cos  x  =  a  i  +  2a,2X  +  3a32T  +  ■  ■  ■  . 


(This  is  justified  by  Theorem  1  of  Section  11.)  Again  putting  x  =  0,  we  get  1  =  Oi. 
We  differentiate  again,  and  put  x  =  0  to  get 


(12.3) 


—  sin  x  =  2a2  +  3  •  203a;  +  4  •  3a42;2  +  •  •  •  , 
0  =  2&2' 


Continuing  the  process  of  taking  successive  derivatives  of  (12.1)  and  putting  x  =  0, 
we  get 


—  3  •  2a3  ~\~  4  •  3  •  ‘la^x 

=  3!  a3,  a3  =  -l; 

=  4  •  3  •  2  •  <24  +  5-  4-  3-  2a$x  +  ■  ■  ■  , 

=  a4; 

—  5  •  4  •  3  •  2ci5  -(-•••, 

=  5!  a5,  ■  ■  ■  . 

We  substitute  these  values  back  into  (12.1)  and  get 

(12.5)  sinx  =  a;-|i-  +  |y - . 

You  can  probably  see  how  to  write  more  terms  of  this  series  without  further  com¬ 
putation.  The  sin  a:  series  converges  for  all  x\  see  Example  3,  Section  10. 

Series  obtained  in  this  way  are  called  Maclaurin  series  or  Taylor  series  about 
the  origin.  A  Taylor  series  in  general  means  a  series  of  powers  of  (x  —  a),  where  a  is 
some  constant.  It  is  found  by  writing  (x  —  a)  instead  of  x  on  the  right-hand  side  of 
an  equation  like  (12.1),  differentiating  just  as  we  have  done,  but  substituting  x  =  a 
instead  of  x  =  0  at  each  step.  Let  us  carry  out  this  process  in  general  for  a  function 
f(x).  As  above,  we  assume  that  there  is  a  Taylor  series  for  f(x),  and  write 

(12.6)  f(x)  =  ao  +  a4(x  —  a)  +  0,2(2:  —  a)2  +  a3(a:  —  a)3  +  04(2:  —  a)4  +  ••• 

+  an{x  —  a)n  +  •  •  •  , 

f'(  x)  =  ai  +  202(2:  —  o)  +  3  as(x  —  a)2  +  404(2:  —  a)3  +  •  •  • 

+  nan(x  —  a)n_1  +  •  •  •  , 

f"( x)  =  2a2  +  3  •  2a3(2;  —  a)  +  4  •  304(2:  —  o)2  H - 

+  n(n  -  l)a„(2;  -  a)n~2  +  •  •  •  , 
f"'(x)  =  3!  a3  +  4  •  3  •  204(2:  —  a)  +  •  •  • 

+  n(n  —  1  )(n—  2)an(x  —  a)"-3  +  •  •  •  , 

/<-n)( 2’)  =  n(n  —  l)(n  —  2)  •  •  •  1  •  an  +  terms  containing  powers  of  (2 ;  —  o). 


—  cos  x 
-1 

(12.4)  sin  x 

0 

COS  X 
1 
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[The  symbol  / (x)  means  the  nth  derivative  of  /(#).]  We  now  put  x  =  a  in  each 
equation  of  (12.6)  and  obtain 


f(a)  =  a0,  f'(a)  =  a\,  f"(a)  =  2a2, 

(12.7)  ,  v 

=  3!  a3,  •••,  /{n)  (a)  =  n!  an. 

[Remember  that  /'(a)  means  to  differentiate  /( x)  and  then  put  x  =  a;  f"(a)  means 
to  find  f"(x)  and  then  put  x  =  a,  and  so  on.] 

We  can  then  write  the  Taylor  series  for  f(x)  about  x  =  a: 

(12.8)  f(x)  =  /(a)  +  (x-a)f(a)  +  ^(x-a)2f"(a)  +  ---  +  ^(x- a)"/(n) (a)  +  •  •  •  . 

The  Maclaurin  series  for  f(x)  is  the  Taylor  series  about  the  origin.  Putting 
a  =  0  in  (12.8),  we  obtain  the  Maclaurin  series  for  f(x): 

2  3  n 

(12.9)  f(x)  =  /( 0)  +  xf\  0)  +  X-f’\  0)  +  X-f"\  0)  +  •  •  •  +  ^-/<">(0)  +  •  •  •  . 

We  have  written  this  in  general  because  it  is  sometimes  convenient  to  have  the 
formulas  for  the  coefficients.  However,  finding  the  higher  order  derivatives  in  (12.9) 
for  any  but  the  simplest  functions  is  unnecessarily  complicated  (try  it  for,  say,  etana:). 
In  Section  13,  we  shall  discuss  much  easier  ways  of  getting  Maclaurin  and  Taylor 
series  by  combining  a  few  basic  series.  Meanwhile,  you  should  verify  (Problem  1, 
below)  the  basic  series  (13.1)  to  (13.5)  and  memorize  them. 


PROBLEMS,  SECTION  12 


1.  By  the  method  used  to  obtain  (12.5)  [which  is  the  series  (13.1)  below],  verify  each 
of  the  other  series  (13.2)  to  (13.5)  below. 


13.  TECHNIQUES  FOR  OBTAINING  POWER  SERIES  EXPANSIONS 

There  are  often  simpler  ways  for  finding  the  power  series  of  a  function  than  the 
successive  differentiation  process  in  Section  12.  Theorem  4  in  Section  11  tells  us 
that  for  a  given  function  there  is  just  one  power  series,  that  is,  just  one  series  of  the 
form  anXn ■  Therefore  we  can  obtain  it  by  any  correct  method  and  be  sure 

that  it  is  the  same  Maclaurin  series  we  would  get  by  using  the  method  of  Section 
12.  We  shall  illustrate  a  variety  of  methods  for  obtaining  power  series.  First  of  all, 
it  is  a  great  timesaver  for  you  to  verify  (Problem  12.1)  and  then  memorize  the  basic 
series  (13.1)  to  (13.5).  We  shall  use  these  series  without  further  derivation  when  we 
need  them. 
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When  we  use  a  series  to  approximate  a  function,  we  may  want  only  the  first  few 
terms,  but  in  derivations,  we  may  want  the  formula  for  the  general  term  so  that  we 
can  write  the  series  in  summation  form.  Let’s  look  at  some  methods  of  obtaining 
either  or  both  of  these  results. 


A.  Multiplying  a  Series  by  a  Polynomial  or  by  Another  Series 

Example  1.  To  find  the  series  for  (a;  +  1)  sin  x,  we  multiply  (x  +  1)  times  the  series  (13.1) 
and  collect  terms: 


(a;  +  1)  sin  a: 


(x  +  1) 

x  +  x2 


You  can  see  that  this  is  easier  to  do  than  taking  the  successive  derivatives  of  the 
product  ( x  +  1)  sin  a:,  and  Theorem  4  assures  us  that  the  results  are  the  same. 
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Example  2.  To  find  the  series  for  e^cosa:,  we  multiply  (13.2)  by  (13.3): 


e*  COS  X  =  (1  +  X  + 


=1  +  x  - 


X 

21 

x2 

2!" 


x “ 

3!" 

2! 


x 

4!  4 
x4 

¥" 

x4 

2!2! 

x4 

u 


2  4 

„  x  X * 

~  2[  +  ¥ 


=1  +  x  +  Ox2  -  — - - - = 


a:3  x4 


There  are  two  points  to  note  here.  First,  as  you  multiply,  line  up  the  terms 
involving  each  power  of  x  in  a  column;  this  makes  it  easier  to  combine  them.  Second, 
be  careful  to  include  all  the  terms  in  the  product  out  to  the  power  you  intend  to 
stop  with,  but  don’t  include  any  higher  powers.  In  the  above  example,  note  that 
we  did  not  include  the  x3  ■  x2  terms;  if  we  wanted  the  x5  term  in  the  answer,  we 
would  have  to  include  all  products  giving  a:5  (namely,  x  ■  x4,  x3  ■  x2,  and  a;5  •  1). 

Also  see  Chapter  2,  Problem  17.30,  for  a  simple  way  of  getting  the  general  term 
of  this  series. 


B.  Division  of  Two  Series  or  of  a  Series  by  a  Polynomial 

Example  1.  To  find  the  series  for  (1/a:)  ln(l  +  x),  we  divide  (13.4)  by  x.  You  should  be 
able  to  do  this  in  your  head  and  just  write  down  the  answer. 


1 ,  ,  „  x  x2 

-i„(i+*)  =  i--  +  T 


To  obtain  the  summation  form,  we  again  just  divide  (13.4)  by  x.  We  can  simplify 
the  result  by  changing  the  limits  to  start  at  n  =  0,  that  is,  replace  n  by  n  +  1. 


1  00  /  1  \n+l  n-l  °°  ( _ 1  \n~.n 

1  in(i + x)  =  y  — 1  rc x 

n= 1  n=0 


n 


n  +  1 


28  Infinite  Series,  Power  Series 


Chapter  1 


Example  2.  To  find  the  series  for  tana:,  we  divide  the  series  for  sin  a:  by  the  series  for 
cos  a:  by  long  division: 


C.  Binomial  Series 

If  you  recall  the  binomial  theorem,  you  may  see  that  (13.5)  looks  just  like  the 
beginning  of  the  binomial  theorem  for  the  expansion  of  (a  +  b)n  if  we  put  a  =  1, 
b  =  x,  and  n  =  p.  The  difference  here  is  that  we  allow  p  to  be  negative  or  fractional, 
and  in  these  cases  the  expansion  is  an  infinite  series.  The  series  converges  for  \x\  <  1 
as  you  can  verify  by  the  ratio  test.  (See  Problem  1.) 

From  (13.5),  we  see  that  the  binomial  coefficients  are: 


(13.6) 


'p\  =  pjp  -  1) 

2 )  2!  ’ 

'p\  _  pip-  l)(p-  2) 

3J~  3! 

p\  =  p(p-l)(p-2)---(p-n+l) 
n  J  n\ 


Example  1.  To  find  the  series  for  1/(1  +  a;),  we  use  the  binomial  series  (13.5)  to  write 


1 

1  +  x 


(1  +  x)  1  =  1  -  X  + 


(—!)(— 2)  9  ,  (—!)(— 2)(-3) 


=  1  —  a;  +  a:2  —  a;3H - -  ^(-a:)71. 

n= 0 
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Example  2.  The  series  for  \/l  +  x  is  (13.5)  with  p  =  1/2. 


y/l  +  x  =  (1  +  x )1^2  = 


OO 

E 

n=0 


1/2' 

n 


=  !  +  -* 

=  !  +  !*- 


If-lf 
21  2>-x2 


H-  h)(- l\xs  ,  IH)(— §)(—§) 


2! 


3! 

1  3  5  4 

16 X  128 X 


4! 


2V 


From  (13.6)  we  can  see  that  the  binomial  coefficients  when  n  =  0  and  n  =  1  are 
(l/2)  =  1  and  (1{2)  =  1/2.  For  n  >  2,  we  can  write 

/±\  =  (£)(-§)(-§)•••(£ -  n+1)  =  (— l)n~13  •  5  •  7 ■  •  •  (2n  —  3) 

\n J  n\  n\  2” 


(— l)"_1(2n  —  3)!! 
(2n)!! 


where  the  double  factorial  of  an  odd  number  means  the  product  of  that  number 
times  all  smaller  odd  numbers,  and  a  similar  definition  for  even  numbers.  For 
example,  7!!  =  7-5-3,  and  8!!  =  8  •  6  •  4  •  2. 


PROBLEMS,  SECTION  13 

1.  Use  the  ratio  test  to  show  that  a  binomial  series  converges  for  |a:|  <  1. 

2.  Show  that  the  binomial  coefficients  (//)  =  (— l)n. 

3.  Show  that  if  p  is  a  positive  integer,  then  ((()  =  0  when  n  >  p,  so  (1  +  x)p  =  5/  (£)x" 
is  just  a  sum  of  p+  1  terms,  from  n  =  0  to  n  =  p.  For  example,  (1  +  x)2  has  3  terms, 
(1  +  a;)3  has  4  terms,  etc.  This  is  just  the  familiar  binomial  theorem. 

4.  Write  the  Maclaurin  series  for  1  /\f\  +  x  in  J/  form  using  the  binomial  coefficient 
notation.  Then  find  a  formula  for  the  binomial  coefficients  in  terms  of  n  as  we  did 
in  Example  2  above. 


D.  Substitution  of  a  Polynomial  or  a  Series 
for  the  Variable  in  Another  Series 

Example  1.  Find  the  series  for  e~x  .  Since  we  know  the  series  (13.3)  for  ex,  we  simply 
replace  the  x  there  by  — x 2  to  get 


1-x2 

1-x2 


(-a;2)2  (-£2)3 


Example  2.  Find  the  series  for  etanx.  Here  we  must  replace  the  x  in  (13.3)  by  the  series 
of  Example  2  in  method  B.  Let  us  agree  in  advance  to  keep  terms  only  as  far  as  x4; 
we  then  write  only  terms  which  can  give  rise  to  powers  of  x  up  to  4,  and  neglect 
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any  higher  powers: 


gtanz  =  1  + 


777  X 


=  1  +  X 


X 

2!" 


7^  x 


+  1+ 


+  ...)4  +  , 


2a;4 

3  •  2! 


3! 


4! 


=  1  +  x  +  T  +  T  +  -x 


E.  Combination  of  Methods 

Example.  Find  the  series  for  arc  tan x.  Since 


dt 


1  +  t2 


=  arc  tan  t 


X 


=  arc  tan  x, 


o 

2\-l 


we  first  write  out  (as  a  binomial  series)  (1  +  t2)  1  and  then  integrate  term  by  term: 
(l  +  t2)-1  =  l-t2  +  ti -t6  -\ - ; 


dt  t 3 

r+f2  =  y 


7 


Thus,  we  have 


(13.7) 


/v»3  ^5  rpT 

arc  tan  x  =  x  —  — —  — 


Compare  this  simple  way  of  getting  the  series  with  the  method  in  Section  12  of 
finding  successive  derivatives  of  arc  tan  x. 


F.  Taylor  Series  Using  the  Basic  Maclaurin  Series 

In  many  simple  cases  it  is  possible  to  obtain  a  Taylor  series  using  the  basic  memo¬ 
rized  Maclaurin  series  instead  of  the  formulas  or  method  of  Section  12. 


Example  1.  Find  the  first  few  terms  of  the  Taylor  series  for  In  a:  about  x  =  1.  [This  means 
a  series  of  powers  of  (x  —  1)  rather  than  powers  of  x.\  We  write 

In  a:  =  ln[l  +  (x  —  1)] 

and  use  (13.4)  with  x  replaced  by  (x  —  1): 

In  a;  =  ln[l  +  (ar  —  1)]  =  (x  -  1)  -  ^(x  -  l)2  +  i(a:  -  l)3  -  i(a:  -  l)4  •  •  •  . 
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Example  2.  Expand  cos  a:  about  x  =  3n/2.  We  write 


cos  x  =  cos 


37T 

T 


using  (13.1)  with  x  replaced  by  (x  —  37r/2). 


G.  Using  a  Computer 

You  can  also  do  problems  like  these  using  a  computer.  This  is  a  good  method 
for  complicated  functions  where  it  saves  you  a  lot  of  algebra.  However,  you’re  not 
saving  time  if  it  takes  longer  to  type  a  problem  into  the  computer  than  to  do  it 
in  your  head!  For  example,  you  should  be  able  to  just  write  down  the  first  few 
terms  of  (sin  x) /x  or  (1  —  cosx)/x2.  A  good  method  of  study  is  to  practice  doing 
problems  by  hand  and  also  check  your  results  using  the  computer.  This  will  turn 
up  errors  you  are  making  by  hand,  and  also  let  you  discover  what  the  computer 
will  do  and  what  it  won’t  do!  It  is  very  illuminating  to  computer  plot  the  function 
you  are  expanding,  along  with  several  partial  sums  of  the  series,  in  order  to  see  how 
accurately  the  partial  sums  represent  the  function — see  the  following  example. 


Example.  Plot  the  function  ex  cos  x  together  with  several  partial  sums  of  its  Maclaurin 
series.  Using  Example  2  in  13A  or  a  computer,  we  have 


x3  x4 

ex  cos  x  =  1  +  x  —  - - - — 

3  6 


Figure  13.1  shows  plots  of  the  function  along  with  each  of  the  partial  sums  S\  =  1+x, 
S3  =  1  +  x  -  S4  =  1  +  x  -  £  -  *£,  S5  =  1  +  x  -  £  -  ^  -  §£.  We  can  see 
from  the  graphs  the  values  of  x  for  which  an  approximation  is  fairly  good.  Also  see 
Section  14. 
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PROBLEMS,  SECTION  13 

Using  the  methods  of  this  section: 

(a)  Find  the  first  few  terms  of  the  Maclaurin  series  for  each  of  the  following  functions. 

(b)  Find  the  general  term  and  write  the  series  in  summation  form. 

(c)  Check  your  results  in  (a)  by  computer. 

(d)  Use  a  computer  to  plot  the  function  and  several  approximating  partial  sums  of  the 
series. 


8. 


11. 


x2  ln(l  —  x) 

6. 

xVl  +  x 

7.  —  sin  x 

X 

1 

9. 

1  +  X 

10.  sinx2 

•s/1  —  x 2 

1  —  X 

Sirl^,  x  >  0 

12. 

PX 

/  cos  t2  dt 

rx  2 

13.  /  e~l  dt 

VT 

Jo 

Jo 

14.  In 


1  +  x 


dt 


16.  cosh  a:  = 


1  -  *  J0 

ex  +e 


1-t2 


15.  arc  sin  x  = 

1  +  x 


r  dt 

'o  vT  —  t2 


17.  In  ■ 


1  —  x 


18. 


r  si 

J  0 


sin  t  dt 


t 


nx 

19.  ln(x  +  sj l  +  x2  )  = 

Jo 


dt 


vi  +  t2 


Find  the  first  few  terms  of  the  Maclaurin  series  for  each  of  the  following  functions  and 
check  your  results  by  computer. 


20.  e^sinx 
1 


23. 

26. 


1  +  x  +  x2 

1 

^/cosx 


21.  tan  x 


24.  secx  = 


22. 

25. 


1  —  x 
2x 


27.  es 


29.  x/l  +  ln(l  +  x) 
32.  ln(l  +  xex) 

35.  -A- 


30. 

33. 

36. 


1  —  x 
1  +  x 
1  —  sin  x 
1  —  x 

fu  sinx  dx 
Jo  Vl  —  x2 


e2x  -  1 
28.  sin[ln(l  +  x)] 

31.  cos(ex  —  1) 

34.  ln(2  —  e~x) 


37.  In  cos  x  Hints:  Method  1:  Write  cosx  =  1  +  (cosx  —  1)  =  1  +  u;  use  the  series  you 
know  for  ln(l  +  -u);  replace  u  by  the  Maclaurin  series  for  (cosx  —  1).  Method  2: 
In  cosx  =  —  fg  tanu  du.  Use  the  series  of  Example  2  in  method  B. 

OO  COS  X  TT-  ,  COS  X  COS  Of  — 1 

38.  e  hint:  e  =  e  •  e 

Using  method  F  above,  find  the  first  few  terms  of  the  Taylor  series  for  the  following 
functions  about  the  given  points. 

(x)  =  -i,  a  =  1 
x 


39. 

f(x) 

=  sinx, 

a  =  7t/2 

40. 

41. 

f(x) 

X 

=  e  ,  a 

=  3 

42. 

43. 

/(*) 

=  cot  X, 

a  =  7r/2 

44. 
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14.  ACCURACY  OF  SERIES  APPROXIMATIONS 

The  thoughtful  student  might  well  be  disturbed  about  the  mathematical  manipula¬ 
tions  we  have  been  doing.  How  do  we  know  whether  these  processes  we  have  shown 
really  give  us  series  that  approximate  the  functions  being  expanded?  Certainly  some 
functions  cannot  be  expanded  in  a  power  series;  since  a  power  series  becomes  just 
cio  when  x  =  0,  it  cannot  be  equal  to  any  function  (like  1/x  or  In  a:)  which  is  infinite 
at  the  origin.  So  we  might  ask  whether  there  are  other  functions  (besides  those  that 
become  infinite  at  the  origin)  which  cannot  be  expanded  in  a  power  series.  All  we 
have  done  so  far  is  to  show  methods  of  finding  the  power  series  for  a  function  if  it 
has  one.  Now  is  there  a  chance  that  there  might  be  some  functions  which  do  not 
have  series  expansions,  but  for  which  our  formal  methods  would  give  us  a  spurious 
series?  Unfortunately,  the  answer  is  “Yes” ;  fortunately,  this  is  not  a  very  common 
difficulty  in  practice.  However,  you  should  know  of  the  possibility  and  what  to  do 
about  it.  You  may  first  think  of  the  fact  that,  say,  the  equation 

- - =  1  —  x  +  x2  —  x3  +  •  •  • 

1  +  x 

is  not  valid  for  |x|  >  1.  This  is  a  fairly  easy  restriction  to  determine;  from  the 
beginning  we  recognized  that  we  could  use  our  series  expansions  only  when  they 
converged.  But  there  is  another  difficulty  which  can  arise.  It  is  possible  for  a 
series  found  by  the  above  methods  to  converge  and  still  not  represent  the  function 
being  expanded!  A  simple  example  of  this  is  e~^!x  1  for  which  the  formal  series  is 
0  +  0  +  0+  ••  •  because  e~d+  )  and  a]j  its  derivatives  are  zero  at  the  origin  (Problem 
15.26).  It  is  clear  that  1  is  not  zero  for  x2  >  0,  so  the  series  is  certainly  not 

correct.  You  can  startle  your  friends  with  the  following  physical  interpretation  of 
this.  Suppose  that  at  t  =  0  a  car  is  at  rest  (zero  velocity),  and  has  zero  acceleration, 
zero  rate  of  change  of  acceleration,  etc.  (all  derivatives  of  distance  with  respect  to 
time  are  zero  at  t  =  0) .  Then  according  to  Newton’s  second  law  (force  equals  mass 
times  acceleration),  the  instantaneous  force  acting  on  the  car  is  also  zero  (and,  in 
fact,  so  are  all  the  derivatives  of  the  force).  Now  we  ask  “Is  it  possible  for  the  car 
to  be  moving  immediately  after  t  =  0?”  The  answer  is  “Yes”!  For  example,  let  its 
distance  from  the  origin  as  a  function  of  time  be  e”*'1/*  \ 

This  strange  behavior  is  really  the  fault  of  the  function  itself  and  not  of  our 
method  of  finding  series.  The  most  satisfactory  way  of  avoiding  the  difficulty  is  to 
recognize,  by  complex  variable  theory,  when  functions  can  or  cannot  have  power 
series.  We  shall  consider  this  in  Chapter  14,  Section  2.  Meanwhile,  let  us  consider 
two  important  questions:  (1)  Does  the  Taylor  or  Maclaurin  series  in  (12.8)  or  (12.9) 
actually  converge  to  the  function  being  expanded?  (2)  In  a  computation  problem,  if 
we  know  that  a  series  converges  to  a  given  function,  how  rapidly  does  it  converge? 
That  is,  how  many  terms  must  we  use  to  get  the  accuracy  we  require?  We  take  up 
these  questions  in  order. 

The  remainder  Rn{x)  in  a  Taylor  series  is  the  difference  between  the  value  of 
the  function  and  the  sum  of  n  +  1  terms  of  the  series: 

f{a)  +  (x  -  a)f'(a)  +  ^( x  -  a)2f\a) 

+  ---  +  -Ax-a)nf<-n\a)  . 
n\ 


(14.1)  Rn(x)  =  f(x) 
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Saying  that  the  series  converges  to  the  function  means  that  lim^oo  |i?n(a;)|  =  0. 
There  are  many  different  formulas  for  Rn(x)  which  are  useful  for  special  purposes; 
you  can  find  these  in  calculus  books.  One  such  formula  is 


(14.2) 


Rn  (*t') 


(x  —  a)w+1/(n+1)(c) 
(n+  1)! 


where  c  is  some  point  between  a  and  x.  You  can  use  this  formula  in  some  simple 
cases  to  prove  that  the  Taylor  or  Maclaurin  series  for  a  function  does  converge  to 
the  function  (Problems  11  to  13). 


Error  in  Series  Approximations  Now  suppose  that  we  know  in  advance  that 
the  power  series  of  a  function  does  converge  to  the  function  (within  the  interval 
of  convergence),  and  we  want  to  use  a  series  approximation  for  the  function.  We 
would  like  to  estimate  the  error  caused  by  using  only  a  few  terms  of  the  series. 

There  is  an  easy  way  to  estimate  this  error  when  the  series  is  alternating  and 
meets  the  alternating  series  test  for  convergence  (Section  7).  In  this  case  the  error 
is  (in  absolute  value)  less  than  the  absolute  value  of  the  first  neglected  term  (see 
Problem  1). 


OO 

If  S  =  }  an  is  an  alternating  series  with  |an+i|  <  an  | , 

(14.3) 

and  lim  an  =  0,  then  15  —  (ai  +  02  +  •  •  •  +  an)\  <  |an+i|. 

n — ►oo 


Example  1.  Consider  the  series 

!  1  1  1  1  1  1 
-2  +  4_8+16~32  +  64"'' 

The  sum  of  this  series  [see  (1.8),  a  =  1,  r  =  —  5]  is  S  =  |  =  0.666  •  •  •  .  The  sum  of 
the  terms  through  is  0.656+,  which  differs  from  S  by  about  0.01.  This  is  less 
than  the  next  term  =  ^  =  0.015+. 

Estimating  the  error  by  the  first  neglected  term  may  be  quite  misleading  for 
convergent  series  that  are  not  alternating. 


► 


Example  2.  Suppose  we  approximate  1  /n'2  by  the  sum  of  the  first  five  terms;  the 

error  is  then  about  0.18  [see  problem  2(a)].  But  the  first  neglected  term  is  1/62  = 
0.028  which  is  much  less  than  the  error.  However,  note  that  we  are  finding  the  sum 
of  the  power  series  xn /n2  when  x  =  1,  which  is  the  largest  x  for  which  the 

series  converges.  If,  instead,  we  ask  for  the  sum  of  the  series  when  x  =  1/2,  we  find 
[see  Problem  2(b)]: 


=  0.5822+  . 


The  sum  of  the  first  five  terms  of  the  series  is  0.5815+,  so  the  error  is  about  0.0007. 
The  next  term  is  (^)2/62  =  0.0004,  which  is  less  than  the  error  but  still  of  the 
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same  order  of  magnitude.  We  can  state  the  following  theorem  [Problem  2(c)]  which 
covers  many  practical  problems. 


That  is,  as  in  (14.3),  the  error  may  be  estimated  by  the  first  neglected  term,  but 
here  the  error  may  be  a  few  times  as  large  as  the  first  neglected  term  instead  of 
smaller.  In  the  example  of  '}^xn/n2  with  x  =  we  have  1  —  x  =  1,  so  (14.4) 
says  that  the  error  is  less  than  two  times  the  next  term.  We  observe  that  the  error 
0.0007  is  less  than  2(0.0004)  as  (14.4)  says. 

For  values  of  \x\  much  less  than  1,  1  —  |cc|  is  about  1,  so  the  next  term  gives  a 
good  error  estimate  in  this  case.  If  the  interval  of  convergence  is  not  |®|  <  1,  but, 
for  example,  |ar|  <2  as  in 


we  can  easily  let  x/2  =  y,  and  apply  the  theorem  in  terms  of  y. 

PROBLEMS,  SECTION  14 

1.  Prove  theorem  (14.3).  Hint:  Group  the  terms  in  the  error  as  (an+i  +a„+2)  +  (fl»+3  + 
ttn+4)  +  •  •  •  to  show  that  the  error  has  the  same  sign  as  a„+i-  Then  group  them  as 
an+ 1  +  (an+2  +  a-n+3 )  +  (an+4  +  a„+ 5)  +  •  •  •  to  show  that  the  error  has  magnitude 
less  than  |a„+i|- 

2.  (a)  Using  computer  or  tables  (or  see  Chapter  7,  Section  11),  verify  that  l/w2  = 

7r2/6  =  1.6449+,  and  also  verify  that  the  error  in  approximating  the  sum  of 
the  series  by  the  first  five  terms  is  approximately  0.1813. 

(b)  By  computer  or  tables  verify  that  5D[(!_1(l/n2)(l/2)”  =  7r2/12  —  (l/2)(ln 2)2  = 
0.5822+,  and  that  the  sum  of  the  first  five  terms  is  0.5815+. 

(c)  Prove  theorem  (14.4).  Hint:  The  error  is  |  X);v+i  anXn\.  Use  the  fact  that  the 
absolute  value  of  a  sum  is  less  than  or  equal  to  the  sum  of  the  absolute  values. 
Then  use  the  fact  that  |a„+i|  <  \an\  to  replace  all  a„  by  ajv+i,  and  write  the 
appropriate  inequality.  Sum  the  geometric  series  to  get  the  result. 

In  Problems  3  to  7,  assume  that  the  Maclaurin  series  converges  to  the  function. 

3.  If  0  <  x  <  1,  show  [using  theorem  (14.3)]  that  ■s/T+lc  =  1  +  I®  with  an  error  less 

than  0.032.  Hint:  Note  that  the  series  is  alternating  after  the  first  term. 

4.  Show  that  sin®  =  x  with  an  error  less  than  0.021  for  0  <  x  <  1,  and  with  an  error 

less  than  0.0002  for  0  <  x  <  0.1.  Hint:  Use  theorem  (14.3)  and  note  that  the  “next” 
term  is  the  x3  term. 

5.  Show  that  1  —  cos®  =  ®2/2  with  an  error  less  than  0.003  for  |®|  <  |. 
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6.  Show  that  ln(l  —  x)  =  — x  with  an  error  less  than  0.0056  for  \x\  <  0.1.  Hint:  Use 
theorem  (14.4). 

7.  Show  that  2 / \/A  —  x  =  1  +  |*  with  an  error  less  than  ^  for  0  <  *  <  1.  Hint:  Let 
x  =  4 y,  and  use  theorem  (14.4). 

8.  Estimate  the  error  if  xn/n3  is  approximated  by  the  sum  of  its  first  three  terms 

for  |a:|  < 

9.  Consider  the  series  in  Problem  4.6  and  show  that  the  remainder  after  n  terms  is 
Rn  =  l/(n  +  1).  Compare  the  value  of  term  n  +  1  with  Rn  for  n  =  3,  n  =  10,  n  = 
100,  n  =  500  to  see  that  the  first  neglected  term  is  not  a  useful  estimate  of  the  error. 

10.  Show  that  the  interval  of  convergence  of  the  series  xn  /  (n2  +n)  is  \x\  <  1.  (For 

x  =  1,  this  is  the  series  of  Problem  9.)  Using  theorem  (14.4),  show  that  for  x  = 
four  terms  will  give  two  decimal  place  accuracy. 

11.  Show  that  the  Maclaurin  series  for  sin*  converges  to  sin*.  Hint:  If  /(*)  =  sin*, 
f(n+1\x)  =  ±sin*  or  ±cos*,  and  so  \f(n+1'>  (*)|  <  1  for  all  *  and  all  n.  Let  n  —>  oo 
in  (14.2). 

12.  Show  as  in  Problem  11  that  the  Maclaurin  series  for  ex  converges  to  ex. 

13.  Show  that  the  Maclaurin  series  for  (1  +  x)p  converges  to  (1  +  x)p  when  0  <  *  <  1. 

15.  SOME  USES  0FSERIES 

In  this  chapter  we  are  going  to  consider  a  few  rather  straightforward  uses  of  series. 

In  later  chapters  there  will  also  be  many  other  cases  where  we  need  them. 

Numerical  Computation  With  computers  and  calculators  so  available,  you  may 

wonder  why  we  would  ever  want  to  use  series  for  numerical  computation.  Here  is 

an  example  to  warn  you  of  the  pitfalls  of  blind  computation. 


Example  1.  Evaluate  f(x)  =  In  i/(l  +  x)/(l  —  x)  —  tan*  at  *  =  0.0015. 

Here  are  answers  from  several  calculators  and  computers:  —9  x  10~16,  3  x 
1CT10,  6.06  X  10"16  ,  5.5  x  10  16.  All  of  these  are  wrong!  Let’s  use  series  to  see 
what’s  going  on.  By  Section  13  methods  we  find,  for  x  =  0.0015: 

3  5  7 

In  s/(l  +  x)/(l-x)  =  x+^-  +  ^-  +  ^----  =  0.001500001125001518752441, 

o  O  i 

qr 3  Zx^  17 nrj 

tanx  =  z  +  —  + - +  — -  =  0.001500001125001012500922, 

3  15  315 

f{x)  =  ill  +  -  •  =  5-0625  >< 10-16 

with  an  error  of  the  order  of  *7  or  10~21.  Now  we  see  that  the  answer  is  the 
difference  of  two  numbers  which  are  identical  until  the  16th  decimal  place,  so  any 
computer  carrying  fewer  digits  will  lose  all  accuracy  in  the  subtraction.  It  may  also 
be  necessary  to  tell  your  computer  that  the  value  of  x  is  an  exact  number  and  not 
a  4  decimal  place  approximation.  The  moral  here  is  that  a  computer  is  a  tool — a 
very  useful  tool,  yes-  but  you  need  to  be  constantly  aware  of  whether  an  answer 
is  reasonable  when  you  are  doing  problems  either  by  hand  or  by  computer.  A  final 
point  is  that  in  an  applied  problem  you  may  want,  not  a  numerical  value,  but  a 
simple  approximation  for  a  complicated  function.  Here  we  might  approximate  /(*) 
by  *5/15  for  small  x. 
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Example  2.  Evaluate 

(I  .  2\ 

dx5  [xSmX  )  x=0- 

We  can  do  this  by  computer,  but  it’s  probably  faster  to  use  sin  a;2  =  x2  —  (a:2)3/3!  •  •  • , 
and  observe  that  when  we  divide  this  by  x  and  take  5  derivatives,  the  x2  term  is 
gone.  The  second  term  divided  by  x  is  an  x 5  term  and  the  fifth  derivative  of  x5  is 
5!.  Any  further  terms  will  have  a  power  of  x  which  is  zero  at  x  =  0.  Thus  we  have 

d 5  (1  ~(x2)3\  =_5!  n 

dx 5  \x  3!  J  x=0  3! 

Summing  series  We  have  seen  a  few  numerical  series  which  we  could  sum  exactly 
(see  Sections  1  and  4)  and  we  will  see  some  others  later  (see  Chapter  7,  Section  11). 
Here  it  is  interesting  to  note  that  if  f{x)  =  ^  an xn,  and  we  let  x  have  a  particular 
value  (within  the  interval  of  convergence),  then  we  get  a  numerical  series  whose 
sum  is  the  value  of  the  function  for  that  x.  For  example,  if  we  substitute  x  =  1  in 
(13.4),  we  get 

ln(l  +  I)  =  ln2  =  l-  i  +  i-  i-  -- 

so  the  sum  of  the  alternating  harmonic  series  is  In  2. 

We  can  also  find  sums  of  series  from  tables  or  computer,  either  the  exact  sum 
if  that  is  known,  or  a  numerical  approximation  (see  Problems  20  to  22,  and  also 
Problems  14.2,  16.1,  16.30,  and  16.31). 

Integrals  By  Theorem  1  of  Section  11,  we  may  integrate  a  power  series  term 
by  term.  Then  we  can  find  an  approximation  for  an  integral  when  the  indefinite 
integral  cannot  be  found  in  terms  of  elementary  functions.  As  an  example,  consider 
the  Fresnel  integrals  (integrals  of  sinar  and  cos  a:2)  which  occur  in  the  problem  of 
Fresnel  diffraction  in  optics.  We  find 


t3  t7  t 11 

3  7  •  3!  +  11  •  5! 


, 3  .7 

so  for  t  <  1,  the  integral  is  approximately  y  —  y  with  an  error  <  0.00076  since 
this  is  an  alternating  series  (see  (14.3)). 


Evaluation  of  Indeterminate  Forms  Suppose  we  want  to  find 


lim 


1  -  ex 
x 


If  we  try  to  substitute  x  =  0,  we  get  0/0.  Expressions  that  lead  us  to  such  mean¬ 
ingless  results  when  we  substitute  are  called  indeterminate  forms.  You  can  evaluate 
these  by  computer,  but  simple  ones  can  often  be  done  quickly  by  series.  For  example, 


lim 

x— »0 


1  -ex 
x 


lim 

x— »0 


1  -  (1  +  x  +  (x2/2!)  H - ) 

x 


lim 

x— »0 


=  -l. 
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You  may  recall  L’Hopital’s  rule  which  says  that 


iim  44  =  ,ira  a*> 


<P(x) 


<y(x) 


when  /(a)  and  4>{a)  are  both  zero,  and  f  /(j)'  approaches  a  limit  or  tends  to  infinity 
(that  is,  does  not  oscillate)  as  x  — >  a.  Let’s  use  power  series  to  see  why  this  is  true. 
We  consider  functions  f{x)  and  <j>{x)  which  are  expandable  in  a  Taylor  series  about 
x  =  a,  and  assume  that  <j/(a)  yf  0.  Using  (12.8),  we  have 


lim 


f(x) 

<t>(x) 


/(a)  +  (x 


lim 

x->a  <j)(a)  +  (x 


a)f'(a)  +  (x 
a)<//(a)  +  (x 


a)2f"(a)/ 2!  +  •  •  • 

a)2  4>"  (a)  /  2\  +  •  •  •  ’ 


If  /(a)  =  0  and  <j>{a)  =  0,  and  we  cancel  one  (x  —  a)  factor,  this  becomes 

lim  />)  +  /  ~  +  '  •  •  =  ^TT 

x^a  (j)'(a)  +  (x  —  a)(j>"{a)/2\  +  ■  ■  ■  </>'(a)  x^aft^x) 

as  L’Hopital’s  rule  says.  If  f'(a )  =  0  and  (j>'(a)  =  0,  and  </>"(a)  0,  then  a  repetition 

of  the  rule  gives  the  limit  as  /"(a)/4>"(a),  and  so  on. 

There  are  other  indeterminate  forms  besides  0/0,  for  example,  oo/oo,  0  •  oo, 
etc.  L’Hopital’s  rule  holds  for  the  oo/oo  form  as  well  as  the  0/0  form.  Series  are 
most  useful  for  the  0/0  form  or  others  which  can  easily  be  put  into  the  0/0  form. 
For  example,  the  limit  limx_,o(l/a;)  sinx  is  an  oo  •  0  form,  but  is  easily  written  as 
lim^^o (sin  x)/x  which  is  a  0/0  form.  Also  note  carefully:  Series  (of  powers  of  x)  are 
useful  mainly  in  finding  limits  as  x  — >  0,  because  for  x  =  0  such  a  series  collapses 
to  the  constant  term;  for  any  other  value  of  x  we  have  an  infinite  series  whose  sum 
we  probably  do  not  know  (see  Problem  25,  however). 


Series  Approximations  When  a  problem  in,  say,  differential  equations  or  physics 
is  too  difficult  in  its  exact  form,  we  often  can  get  an  approximate  answer  by  replac¬ 
ing  one  or  more  of  the  functions  in  the  problem  by  a  few  terms  of  its  infinite  series. 
We  shall  illustrate  this  idea  by  two  examples. 


Example  3.  In  elementary  physics  we  find  that  the  equation  of  motion  of  a  simple  pen¬ 
dulum  is  (see  Chapter  11,  Section  8,  or  a  physics  textbook): 


d20 

dt2 


6. 


This  differential  equation  cannot  be  solved  for  9  in  terms  of  elementary  functions 
(see  Chapter  11,  Section  8),  and  you  may  recall  that  what  is  usually  done  is  to 
approximate  sinf?  by  9.  Recall  the  infinite  series  (13.1)  for  sin0;  9  is  simply  the  first 
term  of  the  series  for  sin  9.  (Remember  that  9  is  in  radians;  see  discussion  in  Chapter 
2,  end  of  Section  3.)  For  small  values  of  9  (say  9  <  \  radian  or  about  30°),  this 
series  converges  rapidly,  and  using  the  first  term  gives  a  good  approximation  (see 
Problem  14.4).  The  solutions  of  the  differential  equation  are  then  9  =  A  sin  \Jg/l  t 
and  9  =  B  cos  \  fgjl  t  (A  and  B  constants)  as  you  can  verify;  we  say  that  the 
pendulum  is  executing  simple  harmonic  motion  (see  Chapter  7,  Section  2). 
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Example  4.  Let  us  consider  a  radioactive  substance  containing  Nq  atoms  at  t  =  0.  It  is 
known  that  the  number  of  atoms  remaining  at  a  later  time  t  is  given  by  the  formula 
(see  Chapter  8,  Section  3): 


(15.1) 


N  =  N0e 


-At 


where  A  is  a  constant  which  is  characteristic  of  the  radioactive  substance.  To  find  A 
for  a  given  substance,  a  physicist  measures  in  the  laboratory  the  number  of  decays 
AN  during  the  time  interval  At  for  a  succession  of  At  intervals.  It  is  customary 
to  plot  each  value  of  AN/ At  at  the  midpoint  of  the  corresponding  time  interval 
At.  If  A  At  is  small,  this  graph  is  a  good  approximation  to  the  exact  dN/dtt  graph. 
A  better  approximation  can  be  obtained  by  plotting  AN / At  a  little  to  the  left  of 
the  midpoint.  Let  us  show  that  the  midpoint  does  give  a  good  approximation  and 
also  find  the  more  accurate  t  value.  (An  approximate  value  of  A,  good  enough  for 
calculating  the  correction,  is  assumed  known  from  a  rough  preliminary  graph.) 

What  we  should  like  to  plot  is  the 
graph  of  dN/dt,  that  is,  the  graph  of  the 
slope  of  the  curve  in  Figure  15.1.  What 
we  measure  is  the  value  of  AN/ At  for 
each  At  interval.  Consider  one  such  At 
interval  in  Figure  15.1,  from  t\  to  t2. 

To  get  an  accurate  graph  we  should  plot 
the  measured  value  of  the  quotient 
AN / At  at  the  point  between  t\  and  t2 
where  AN / At  =  dN/dt.  Let  us  write 
this  condition  and  find  the  t  which  sat¬ 
isfies  it.  The  quantity  AN  is  the  change 
in  N,  that  is,  .ZV(t2)  —  N(t±);  the  value 
of  dN/dt  we  get  from  (15.1).  Then 
dN/dt  =  AN/  At  becomes 


(15.2) 


—\Noe~xt  = 


N0e~xt2  -  N0e 

A T 


— At  i 


Multiplying  this  equation  by  (A t/N0)ex(-tl+t2^2,  we  get 

(15  3)  —A  At  g-A[t-(*i+*2)/2]  _  g— A(t2— ti)/2  _  gA(t2-ti)/2  _  g— A  At/2  _  gAAt/2 

since  t2  —  ti  =  At.  Since  we  assumed  A  At  to  be  small,  we  can  expand  the  expo¬ 
nentials  on  the  right-hand  side  of  (15.3)  in  power  series;  this  gives 


(15.4) 


-A  At  e-A[t-(tl+*2)/21  =  -A  At  - 


1  fXAt.Y 

2 -) 


or,  canceling  (—A  At), 

(15.5)  e-A[t-(t1+t2)/2]  =1  +  _L(A  At)2 


Suppose  A  At  is  small  enough  so  that  we  can  neglect  the  term  A-(AAt)2.  Then 
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(15.5)  reduces  to 


e— -M*- (ti+ta)/2]  _  j 

,  _  ti  +  t2 

-  2  ' 

Thus  we  have  justified  the  usual  practice  of  plotting  AN/ At  at  the  midpoint  of  the 
interval  At. 

Next  consider  a  more  accurate  approximation.  From  (15.5)  we  get 


Since  (A  At)2  <C  1,  we  can  expand  the  logarithm  by  (13.4)  to  get 


-A  t  - 


1 1  +  t2 


=  ^(AAt)2. 


Then  we  have 


V+t^  _  1 

2  24AV  ; 


Thus  the  measured  AN/ At  should  be  plotted  a  little  to  the  left  of  the  midpoint  of 
At,  as  we  claimed. 


PROBLEMS,  SECTION  15 


In  Problems  1  to  4,  use  power  series  to  evaluate  the  function  at  the  given  point.  Compare 
with  computer  results,  using  the  computer  to  find  the  series,  and  also  to  do  the  problem 
without  series.  Resolve  any  disagreement  in  results  (see  Example  1). 

— — — ^  at  x  =  0.0003 

2.  1  —  cos  x2  at  x  =  0.012 

Vl  +X4 

3.  In  (x  +  \J l  +  x2  ^  —  sin*  at  *  =  0.001 

4.  e™1  —  (l/*3)  ln(l  +  x3ex)  at  x  =  0.00035 


Use  Maclaurin  series  to  evaluate  each  of  the  following.  Although  you  could  do  them  by 
computer,  you  can  probably  do  them  in  your  head  faster  than  you  can  type  them  into  the 
computer.  So  use  these  to  practice  quick  and  skillful  use  of  basic  series  to  make  simple 
calculations. 


5. 


6. 


7. 


d//_ 

dx4 

dx3  V  1  —  * 


ln(l  +  *3)  at  x  =  0 


at  *  =  0 


d 


10 


dx10 


(ar  tan"  *) 


at  *  =  0 
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8.  lim 

x — >-0 


11.  lim 


1  —  COS  X 


X * 


sin2  2x 


9.  lim 

x — >-0 


12. 


x—0  X 


lim 

x — >-0 


sin  x  —  x 

X3 

tan  x  —  x 
x3 


10.  lim 


1  —  ex 


x^O  xa 


13. 


lim 

x — >-0 


ln(l  —  x) 


x 


Find  a  two  term  approximation  for  each  of  the  following  integrals  and  an  error  bound  for 

the  given  t  interval, 
rt 


14. 


dx,  0  <  t  <  0.1 


15. 


f  y/xe  x  dx,  0  <  t  <  0.01 


Jo  Jo 

Find  the  sum  of  each  of  the  following  series  by  recognizing  it  as  the  Maclaurin  series  for  a 
function  evaluated  at  a  point. 


16. 

18. 


n=l 


17. 

19. 


V  (-I)n  (*\2n 
^  (2 n)\  V  2  / 


20. 


By  computer  or  tables,  find  the  exact  sum  of  each  of  the  following  series. 


(a)  I] 


n 

(4 n2  -  l)2 


(b)  E^r 


(c)  E 


n(n  +  1) 
3™ 


21.  By  computer,  find  a  numerical  approximation  for  the  sum  of  each  of  the  following 
series. 

OO  CO  .  CO  1 

<•>  (b)  <•> 

n= 1  v  ’  n= 2  n= 1 

22.  The  series  l/nSi  s  >  1 ,  is  called  the  Riemann  Zeta  function,  £(s).  (In  Prob¬ 

lem  14.2(a)  you  found  £(2)  =  7r2/6.  When  n  is  an  even  integer,  these  series  can  be 
summed  exactly  in  terms  of  n.)  By  computer  or  tables,  find 

oo  1  °°  .  /*r\  °°  1 

<»>  (b)  <«) 

n=l  n= 1  '  '  n= 1 


23.  Find  the  following  limits  using  Maclaurin  series  and  check  your  results  by  computer. 
Hint:  First  combine  the  fractions.  Then  find  the  first  term  of  the  denominator  series 
and  the  first  term  of  the  numerator  series. 


(a) 

lim 

x — >-0 

(c) 

lim 

X - b0 

w  ii„,  (i-gai 

x — *0  V  X2  Sill  X 


(d) 


lim 

x — >•() 


ln(l  +  x)  1 


24.  Evaluate  the  following  indeterminate  forms  by  using  L’Hopital’s  rule  and  check  your 
results  by  computer.  (Note  that  Maclaurin  series  would  not  be  useful  here  because 
x  does  not  tend  to  zero,  or  because  a  function  (ln.x,  for  example)  is  not  expandable 
in  a  Maclaurin  series.) 


(a) 

. .  x  sin  x 

lim  - 

(b) 

lim  ^ 

a 

1 

1 

x — >-7t/2  ln(l 

(c) 

limln(2-*) 

(d) 

..  lnx 
lim  —-= 

a->i  *  —  1 

x— >00  yJX 

(e) 

lim  x  In  2x 

X - b0 
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(f)  lim  xne  x  (n  not  necessarily  integral) 

x — >-oo 

25.  In  general,  we  do  not  expect  Maclaurin  series  to  be  useful  in  evaluating  indeter¬ 
minate  forms  except  when  x  tends  to  zero  (see  Problem  24).  Show,  however,  that 
Problem  24(f)  can  be  done  by  writing  xne~x  =  xn /ex  and  using  the  series  (13.3) 
for  ex .  Hint:  Divide  numerator  and  denominator  by  xn  before  you  take  the  limit. 
What  is  special  about  the  ex  series  which  makes  it  possible  to  know  what  the  limit 
of  the  infinite  series  is? 

26.  Find  the  values  of  several  derivatives  of  e~  '  at  t  =  0.  Hint:  Calculate  a  few 
derivatives  (as  functions  of  t);  then  make  the  substitution  x  =  1/t2,  and  use  the 
result  of  Problem  24(f)  or  25. 

27.  The  velocity  v  of  electrons  from  a  high  energy  accelerator  is  very  near  the  velocity  c 
of  light.  Given  the  voltage  V  of  the  accelerator,  we  often  want  to  calculate  the  ratio 
v/c.  The  relativistic  formula  for  this  calculation  is  (approximately,  for  V  1) 

V  =  number  of  million  volts. 

Use  two  terms  of  the  binomial  series  (13.5)  to  find  1  —  v/c  in  terms  of  V.  Use  your 
result  to  find  1  —  v/c  for  the  following  values  of  V.  Caution:  V  =  the  number  of 
million  volts. 

(a)  V  =  100  million  volts 

(b)  V  =  500  million  volts 

(c)  V  =  25,  000  million  volts 

(d)  V  =  100  gigavolts  (100x10®  volts  =  10s  million  volts) 

28.  The  energy  of  an  electron  at  speed  v  in  special  relativity  theory  is  mc2(l— u2/c2)-1^2, 
where  m  is  the  electron  mass,  and  c  is  the  speed  of  light.  The  factor  me2  is  called 
the  rest  mass  energy  (energy  when  v  =  0).  Find  two  terms  of  the  series  expansion 
of  (1  —  u2/c2)-1/2,  and  multiply  by  me2  to  get  the  energy  at  speed  v.  What  is  the 
second  term  in  the  energy  series?  (If  v/c  is  very  small,  the  rest  of  the  series  can  be 
neglected;  this  is  true  for  everyday  speeds.) 

29.  The  figure  shows  a  heavy  weight  suspended  by  a  cable  and  pulled 
to  one  side  by  a  force  F.  We  want  to  know  how  much  force  F  is 
required  to  hold  the  weight  in  equilibrium  at  a  given  distance  x  to 
one  side  (say  to  place  a  cornerstone  correctly).  From  elementary 
physics,  T  cos  6  =  W,  and  Tsind  =  F. 

(a)  Find  F/W  as  a  series  of  powers  of  8. 

(b)  Usually  in  a  problem  like  this,  what  we  know  is  not  8,  but  x 
and  l  in  the  diagram.  Find  F/W  as  a  series  of  powers  of  x/l. 

30.  Given  a  strong  chain  and  a  convenient  tree,  could  you  pull  your  car  out  of  a  ditch 
in  the  following  way?  Fasten  the  chain  to  the  car  and  to  the  tree.  Pull  with  a 
force  F  at  the  center  of  the  chain  as  shown  in  the  figure.  From  mechanics,  we  have 
F  =  2 T  sin#,  or  T  =  F/ (2  sin  8),  where  T  is  the  tension  in  the  chain,  that  is,  the 
force  exerted  on  the  car. 

Tree 


F 
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(a)  Find  T  as  x  1  times  a  series  of  powers  of  x. 

(b)  Find  T  as  #_1  times  a  series  of  powers  of  #. 

31.  A  tall  tower  of  circular  cross  section  is  reinforced  by  horizontal  circular  disks  (like 
large  coins),  one  meter  apart  and  of  negligible  thickness.  The  radius  of  the  disk  at 
height  n  is  l/(nlnn)  (n  >  2). 


Assuming  that  the  tower  is  of  infinite  height: 


(a)  Will  the  total  area  of  the  disks  be  finite  or  not?  Hint:  Can  you  compare  the 
series  with  a  simpler  one? 

(b)  If  the  disks  are  strengthened  by  wires  going  around  their  circumferences  like 
tires,  will  the  total  length  of  wire  required  be  finite  or  not? 

(c)  Explain  why  there  is  not  a  contradiction  between  your  answers  in  (a)  and  (b). 
That  is,  how  is  it  possible  to  start  with  a  set  of  disks  of  finite  area,  remove  a 
little  strip  around  the  circumference  of  each,  and  get  an  infinite  total  length 
of  these  strips?  Hint:  Think  about  units — you  can’t  compare  area  and  length. 
Consider  two  cases:  (1)  Make  the  width  of  each  strip  equal  to  one  percent  of 
the  radius  of  the  disk  from  which  you  cut  it.  Now  the  total  length  is  infinite 
but  what  about  the  total  area?  (2)  Try  to  make  the  strips  all  the  same  width; 
what  happens?  Also  see  Chapter  5,  Problem  3.31(b). 

32.  Show  that  the  “doubling  time”  (time  for  your  money  to  double)  is  n  periods  at 
interest  rate  i%  per  period  with  ni  =  69,  approximately.  Show  that  the  error  in  the 
approximation  is  less  than  10%  if  i%  <  20%.  (Note  that  n  does  not  have  to  be  the 
number  of  years;  it  can  be  the  number  of  months  with  i  =  interest  rate  per  month, 
etc.)  Hint:  You  want  (1  +  i/100)n  =  2;  take  In  of  both  sides  of  this  equation  and 
use  equation  (13.4).  Also  see  theorem  (14.3). 

33.  If  you  are  at  the  top  of  a  tower  of  height  h  above  the  surface 
of  the  earth,  show  that  the  distance  you  can  see  along  the 
surface  of  the  earth  is  approximately  s  =  y/2R.h,  where  R  is 
the  radius  of  the  earth.  Hints:  See  figure.  Show  that  h/R  = 
sec#  —  1;  find  two  terms  of  the  series  for  sec#  =  1/cos#, 
and  use  s  =  R8.  Thus  show  that  the  distance  in  miles  is 
approximately  ^Jih/2  with  h  in  feet. 
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16.  MISCELLANEOUS  PROBLEMS 

1.  (a)  Show  that  it  is  possible 


Show  that  it  is  possible  to  stack  a  pile  of  identical  books  so  that  the  top  book 
is  as  far  as  you  like  to  the  right  of  the  bottom  book.  Start  at  the  top  and 
each  time  place  the  pile  already  completed  on  top  of  another  book  so  that 
the  pile  is  just  at  the  point  of  tipping.  (In  practice,  of  course,  you  can’t  let 
them  overhang  quite  this  much  without  having  the  stack  topple.  Try  it  with 


a  deck  of  cards.)  Find  the  distance  from  the  right-hand  end  of  each  book  to 
the  right-hand  end  of  the  one  beneath  it.  To  find  a  general  formula  for  this 
distance,  consider  the  three  forces  acting  on  book  n,  and  write  the  equation  for 
the  torque  about  its  right-hand  end.  Show  that  the  sum  of  these  setbacks  is  a 
divergent  series  (proportional  to  the  harmonic  series).  [See  “Leaning  Tower  of 
The  Physical  Reviews,”  Am.  J.  Phys.  27,  121-122  (1959).] 

(b)  By  computer,  find  the  sum  of  N  terms  of  the  harmonic  series  with  N  =  25, 

100,  200,  1000,  10s,  10100. 

(c)  From  the  diagram  in  (a),  you  can  see  that  with  5  books  (count  down  from  the 
top)  the  top  book  is  completely  to  the  right  of  the  bottom  book,  that  is,  the 
overhang  is  slightly  over  one  book.  Use  your  series  in  (a)  to  verify  this.  Then 
using  parts  (a)  and  (b)  and  a  computer  as  needed,  find  the  number  of  books 
needed  for  an  overhang  of  2  books,  3  books,  10  books,  100  books. 

2.  The  picture  is  a  mobile  constructed  of 
dowels  (or  soda  straws)  connected  by 
thin  threads.  Each  thread  goes  from 
the  left-hand  end  of  a  rod  to  a  point 

on  the  rod  below.  Number  the  rods  1  *  -  --  ■  — — 1 

from  the  bottom  and  find,  for  rod  n,  ■ 

the  distance  from  its  left  end  to  the  nr--  1  !■  i 

thread  so  that  all  rods  of  the  mobile 

will  be  horizontal.  Hint:  Can  you  see 

the  relation  between  this  problem  and 

Problem  1? 

3.  Show  that  ^^2  Vn3^2  is  convergent.  What  is  wrong  with  the  following  “proof” 
that  it  diverges? 


VT25  + 

•  •  > 

+  \/36 

1 

1 

/  1  1 

l2+"' 

“  3 

(1+2  +  3 

which  is 


Since  the  harmonic  series  diverges,  the  original  series  diverges.  Hint:  Compare  3n 
and  n^fn. 


Test  for  convergence: 


00  r)7l 

4. 

n=l 

00  /  1  \2 

*  V-  ("  -  !) 

^  1  +  n2 

n= 2 

y-  '/n-  1 

•  ^(n  +  U’-l 

00 

7.  y  1 

J  n  ln(n3) 

00  n  ^ 

0  v  2n 

^  n4  -  2 

71=2 
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Find  the  interval  of  convergence,  including  end-point  tests: 
°°  x™  _  ^(nl)V1 


»■  E 


— (  ln(n  +  1) 


10.  E 


rl  (2n)l 


12.  E 


5 "(n2  +  1) 


13.  e 


ii-  E 

n= 1 

(x  +  2)n 


(— l)na:2n" 
2n  -  1 


r!  (-3)"v^ 


Find  the  Maclaurin  series  for  the  following  functions. 
14.  cos[ln(l  +  x)]  15.  In  fs^nx 


V 


17.  e1 


16. 


18.  arc  tan  a:  = 


Vl  +  sin* 
:  du 


Jo  1  +  u 

Find  the  first  few  terms  of  the  Taylor  series  for  the  following  functions  about  the  given 
points. 

19.  sin  a:,  a  =  n  20.  \/x,  a  =  8  21.  ex ,  a  =  1 

Use  series  you  know  to  show  that: 

22.  1  —  i  +  i  —  i  +  =  Hint:  See  Problem  18. 


„2  „4  6 

„  _  7T  7V  7V 

'  3T~  5T+  tT  = 


„„  ,  n  (In  3) 2  (In  3)3 

24.  In3+^— J-  +  ^— J-+- 


2! 


3! 


=  2 


25.  Evaluate  the  limit  lim^^o  x2 /  In  cos  x  by  series  (in  your  head),  by  L’Hopital’s  rule, 
and  by  computer. 


Use  Maclaurin  series  to  do  Problems  26  to  29  and  check  your  results  by  computer. 


26. 

lim 

x — >-0 

28. 

lim 

x — >-0 

30. 

(a) 

1 


X2  1  —  COS2  X 
1  +  X  1 


X 


sin  a: 


27. 


29. 


lim 

x — >-0 


1  2 

-7T  —  cot  X 


db  ,  4  a:2 

d^{X€ 


(b) 


It  is  clear  that  you  (or  your  computer)  can’t  find  the  sum  of  an  infinite  se¬ 
ries  just  by  adding  up  the  terms  one  by  one.  For  example,  to  get  £(1.1)  = 
53 1  /n1  1  (see  Problem  15.22)  with  error  <  0.005  takes  about  1033  terms. 
To  see  a  simple  alternative  (for  a  series  of  positive  decreasing  terms)  look  at 
Figures  6.1  and  6.2.  Show  that  when  you  have  summed  N  terms,  the  sum  Rn 
of  the  rest  of  the  series  is  between  In  =  dn  and  Jjv+i  =  f™+1  an  dn. 

Find  the  integrals  in  (a)  for  the  £(1.1)  series  and  verify  the  claimed  number  of 
terms  needed  for  error  <  0.005.  Hint :  Find  N  such  that  Jjv  =  0.005.  Also  find 
upper  and  lower  bounds  for  £(1.1)  by  computing  Y2n=i  l/^1  1  +  Jn  n~ 


and  Etr  1/n1'1  +  f, 


N+ 1 


dn 

dn  where  N  is  far  less  than  1033 .  Hint:  You 
want  the  difference  between  the  upper  and  lower  limits  to  be  about  0.005;  find 
N  so  that  term  ajv  =  0.005. 

31.  As  in  Problem  30,  for  each  of  the  following  series,  find  the  number  of  terms  required 
to  find  the  sum  with  error  <  0.005,  and  find  upper  and  lower  bounds  for  the  sum 
using  a  much  smaller  number  of  terms. 


Ett 


1 


(b)  E 


1 


n(  1  +  In  n)2 


(<=)  E 


1 


nlnn(lnln  n) 
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Complex  Numbers 


1.  INTRODUCTION 

You  will  probably  recall  using  imaginary  and  complex  numbers  in  algebra.  The 
general  solution  of  the  quadratic  equation 

(1.1)  az2  +  bz  +  c  =  0 
for  the  unknown  z,  is  given  by  the  quadratic  formula 

(1.2)  z=-b±W-4ac. 

If  the  discriminant  d  =  (b2  —  4ac)  is  negative,  we  must  take  the  square  root  of 
a  negative  number  in  order  to  find  z.  Since  only  non-negative  numbers  have  real 
square  roots,  it  is  impossible  to  use  (1.2)  when  d  <  0  unless  we  introduce  a  new 
kind  of  number,  called  an  imaginary  number.  We  use  the  symbol  i  =  \J  —  1  with  the 
understanding  that  i2  =  — 1.  Then 

\/— 16  =  4  i,  \J — 3  =  iV3,  i 3  =  —  i 

are  imaginary  numbers,  but 

i2  =  — 1,  =  iV2  ■  iV8  =  - 4,  i4n  =  1 

are  real.  In  (1.2)  we  also  need  combinations  of  real  and  imaginary  numbers. 


Example.  The  solution  of 

z2  -  2z  +  2  =  0 
is 

2±>/4^8  2±y^4  „  , 

z  = - - - =  - - - =  1  ±  i. 

2  2 

We  use  the  term  complex  number  to  mean  any  one  of  the  whole  set  of  numbers, 
real,  imaginary,  or  combinations  of  the  two  like  1  ±  i.  Thus,  *  +  5,  17*,  4,  3  +  i\J 5 
are  all  examples  of  complex  numbers. 
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Once  the  new  kind  of  number  is  admitted  into  our  number  system,  fascinating 
possibilities  open  up.  Can  we  attach  any  meaning  to  marks  like  sin*,  eI7r,  ln(l  +  *)? 
We’ll  see  later  that  we  can  and  that,  in  fact,  such  expressions  may  turn  up  in 
problems  in  physics,  chemistry,  and  engineering,  as  well  as  mathematics. 

When  people  first  considered  taking  square  roots  of  negative  numbers,  they  felt 
very  uneasy  about  the  problem.  They  thought  that  such  numbers  could  not  have 
any  meaning  or  any  connection  with  reality  (hence  the  term  “imaginary”).  They 
certainly  would  not  have  believed  that  the  new  numbers  could  be  of  any  practical 
use.  Yet  complex  numbers  are  of  great  importance  in  a  variety  of  applied  fields;  for 
example,  the  electrical  engineer  would,  to  say  the  least,  be  severely  handicapped 
without  them.  The  complex  notation  often  simplifies  setting  up  and  solving  vibra¬ 
tion  problems  in  either  dynamical  or  electrical  systems,  and  is  useful  in  solving  many 
differential  equations  which  arise  from  problems  in  various  branches  of  physics.  (See 
Chapters  7  and  8.)  In  addition,  there  is  a  highly  developed  field  of  mathematics 
dealing  with  functions  of  a  complex  variable  (see  Chapter  14)  which  yields  many 
useful  methods  for  solving  problems  about  fluid  flow,  elasticity,  quantum  mechanics, 
and  other  applied  problems.  Almost  every  field  of  either  pure  or  applied  mathe¬ 
matics  makes  some  use  of  complex  numbers. 

2.  REAL  AND  IMAGINARY  PARTS  OFA  COMPLEX  NUMBER 

A  complex  number  such  as  5  +  3*  is  the  sum  of  two  terms.  The  real  term  (not 
containing  i)  is  called  the  real  part  of  the  complex  number.  The  coefficient  of  i  in 
the  other  term  is  called  the  imaginary  part  of  the  complex  number.  In  5  +  3*,  5  is 
the  real  part  and  3  is  the  imaginary  part.  Notice  carefully  that  the  imaginary  part 
of  a  complex  number  is  not  imaginaryl 

Either  the  real  part  or  the  imaginary  part  of  a  complex  number  may  be  zero. 
If  the  real  part  is  zero,  the  complex  number  is  called  imaginary  (or,  for  emphasis, 
pure  imaginary) .  The  zero  real  part  is  usually  omitted;  thus  0  +  5*  is  written  just 
5*.  If  the  imaginary  part  of  the  complex  number  is  zero,  the  number  is  real.  We 
write  7  +  0*  as  just  7.  Complex  numbers  then  include  both  real  numbers  and  pure 
imaginary  numbers  as  special  cases. 

In  algebra  a  complex  number  is  ordinarily  written  (as  we  have  been  doing)  as  a 
sum  like  5  +  3*.  There  is  another  very  useful  way  of  thinking  of  a  complex  number. 
As  we  have  said,  every  complex  number  has  a  real  part  and  an  imaginary  part 
(either  of  which  may  be  zero).  These  are  two  real  numbers,  and  we  could,  if  we 
liked,  agree  to  write  5  +  3*  as  (5,3).  Any  complex  number  could  be  written  this  way 
as  a  pair  of  real  numbers,  the  real  part  first  and  then  the  imaginary  part  (which,  you 
must  remember,  is  real).  This  would  not  be  a  very  convenient  form  for  computation, 
but  it  suggests  a  very  useful  geometrical  representation  of  a  complex  number  which 
we  shall  now  consider. 


3.  THE  COMPLEX  PLANE 

In  analytic  geometry  we  plot  the  point  (5,3)  as  shown  in  Figure  3.1.  As  we  have 
seen,  the  symbol  (5,  3)  could  also  mean  the  complex  number  5  +  3*.  The  point  (5,  3) 
may  then  be  labeled  either  (5,3)  or  5  +  3*.  Similarly,  any  complex  number  x  +  iy  ( x 
and  y  real)  can  be  represented  by  a  point  (x,  y)  in  the  (x,  y)  plane.  Also  any  point 
(x,y)  in  the  (x,y)  plane  can  be  labeled  x  +  iy  as  well  as  ( x,y ).  When  the  (x,y) 
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plane  is  used  in  this  way  to  plot  complex  numbers,  it  is  called  the  complex  plane.  It 
is  also  sometimes  called  an  Argand  diagram.  The  x  axis  is  called  the  real  axis,  and 
the  y  axis  is  called  the  imaginary  axis  (note,  however,  that  you  plot  y  and  not  iy). 


Figure  3.1 

When  a  complex  number  is  written  in  the  form  x  +  iy,  we  say  that  it  is  in 
rectangular  form  because  x  and  y  are  the  rectangular  coordinates  of  the  point 
representing  the  number  in  the  complex  plane.  In  analytic  geometry,  we  can  locate 
a  point  by  giving  its  polar  coordinates  (r,  9)  instead  of  its  rectangular  coordinates 
(x,y).  There  is  a  corresponding  way  to  write  any  complex  number.  In  Figure  3.2, 

x  =  rcosO, 

(3-1) 

y  =  rsind. 

Then  we  have 

x  +  iy  =  r  cos  9  +  ir  sin  9 
=  r  (cos 


(3.2) 


i  sin0). 

This  last  expression  is  called  the  polar  form  of  the  complex  number.  As  we  shall 
see  (Sections  9  to  16),  the  expression  (cos 9  +  i  sin0)  can  be  written  as  elS,  so  a 
convenient  way  to  write  the  polar  form  of  a  complex  number  is 


(3.3) 


x  +  iy  =  r( cos  9  +  i  sin  9)  =  relS . 


The  polar  form  re 10  of  a  complex  number 
is  often  simpler  to  use  than  the  rectangu¬ 
lar  form. 

Example.  In  Figure  3.3  the  point  A  could  be 
labeled  as  (l,\/3)  or  as  1  +  iV 3.  Simi¬ 
larly,  using  polar  coordinates,  the  point 
A  could  be  labeled  with  its  (r,6)  values 
as  (2,7t/3).  Notice  that  r  is  always  taken 
positive.  Using  (3.3)  we  have 

1  +  iV 3  =  2  (cos  —  +  i  sin  —  ^  =  2el7r^3. 

This  gives  two  more  ways  to  label  point 
A  in  Figure  3.3. 


Figure  3.3 
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Radians  and  Degrees  In  Figure  3.3,  the  angle  7r/3  is  in  radians.  Ever  since 
you  studied  calculus,  you  have  been  expected  to  measure  angles  in  radians  and  not 
degrees.  Do  you  know  why?  You  have  learned  that  (d/dx)  sin  x  =  cosx.  This 
formula  is  not  correct—  unless  x  is  in  radians.  (Look  up  the  derivation  in  your 
calculus  book!)  Many  of  the  formulas  you  now  know  and  use  are  correct  only  if 
you  use  radian  measure;  consequently  that  is  what  you  are  usually  advised  to  do. 
However,  it  is  sometimes  convenient  to  do  computations  with  complex  numbers 
using  degrees,  so  it  is  important  to  know  when  you  can  and  when  you  cannot  use 
degrees.  You  can  use  degrees  to  measure  an  angle  and  to  add  and  subtract  angles 
as  long  as  the  final  step  is  to  find  the  sine,  cosine,  or  tangent  of  the  resulting  angle 
(with  your  calculator  in  degree  mode).  For  example,  in  Figure  3.3,  we  can,  if  we  like, 
say  that  0  =  60°  instead  of  6  =  7t/3.  If  we  want  to  find  sin(7r/3— 7t/4)  =  sin(7r/12)  = 
0.2588  (calculator  in  radian  mode),  we  can  instead  find  sin(60°  —  45°)  =  sin  15°  = 
0.2588  (calculator  in  degree  mode).  Note  carefully  that  an  angle  is  in  radians  unless 
the  degree  symbol  is  used;  for  example,  in  sin  2,  the  2  is  2  radians  or  about  115°. 

In  formulas,  however,  use  radians.  For  example,  in  using  infinite  series,  we  say 
that  sin0  =  6  for  very  small  0.  Try  this  on  your  calculator;  you  will  find  that 
it  is  true  in  radian  mode  but  not  in  degree  mode.  As  another  example,  consider 
fo  dx/{1  +  x2)  =  arc  tan  1  =  7t/4  =  0.785.  Here  arc  tan  1  is  not  an  angle;  it  is  the 
numerical  value  of  the  integral,  so  the  answer  45  (obtained  from  a  calculator  in 
degree  mode)  is  wrong!  Do  not  use  degree  mode  in  reading  an  arc  tan  (or  arc  sin  or 
arc  cos)  unless  you  are  finding  an  angle  [for  example,  in  Figure  3.2,  6  =  arc  tan(y/a;) , 
and  in  Figure  3.3,  9  =  arc  tan  V3  =  tt/3  or  60°]. 

4.  TERMINOLOGY  AND  NOTATION 

Both  i  and  j  are  used  to  represent  \f— T,  j  usually  in  any  problem  dealing  with 
electricity  since  i  is  needed  there  for  current.  A  physicist  should  be  able  to  work 
with  ease  using  either  symbol.  We  shall  for  consistency  use  i  throughout  this  book. 

We  often  label  a  point  with  a  single  letter  (for  example,  P  in  Figure  3.2  and  A 
in  Figure  3.3)  even  though  it  requires  two  coordinates  to  locate  the  point.  If  you 
have  studied  vectors,  you  will  recall  that  a  vector  is  represented  by  a  single  letter, 
say  v,  although  it  has  (in  two  dimensions)  two  components.  It  is  customary  to  use 
a  single  letter  for  a  complex  number  even  though  we  realize  that  it  is  actually  a  pair 
of  real  numbers.  Thus  we  write 


(4.1) 


z  =  x  +  iy  =  r(cos  9  +  i  sin  9)  =  rel6 . 


Here  z  is  a  complex  number;  x  is  the  real  part  of  the  complex  number  z,  and  y  is 
the  imaginary  part  of  z.  The  quantity  r  is  called  the  modulus  or  absolute  value  of 
z,  and  9  is  called  the  angle  of  z  (or  the  phase ,  or  the  argument ,  or  the  amplitude  of 
z).  In  symbols: 


(4.2) 


R ez  =  x,  \z\  =  mod  z  =  r  =  \J  x2  +  y2, 

Im  z  =  y  (not  iy),  angle  of  z  =  9. 
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The  values  of  9  should  be  found  from  a  diagram  rather  than  a  formula,  although 
we  do  sometimes  write  9  =  arctan(y/rr).  An  example  shows  this  clearly. 


Example.  Write  z  =  —1  —  *  in  polar  form.  Here  we  have  x  =  —1,  y  =  — 1,  r  =  \/2  (Figure 
4.1).  There  are  an  infinite  number  of  values  of  6 , 


(4.3) 


9  = 


57T 


2t*7t, 


where  n  is  any  integer,  positive  or  negative.  The 
value  9  =  57t/4  is  sometimes  called  the  principal 
angle  of  the  complex  number  z  =  —  1  —  i.  Notice 
carefully,  however,  that  this  is  not  the  same  as 
the  principal  value  7r/4  of  arc  tan  1  as  defined  in 
calculus.  The  angle  of  a  complex  number  must  be 

in  the  same  quadrant  as  the  point  representing  the  number.  For  our  present  work, 
any  one  of  the  values  in  (4.3)  will  do;  here  we  would  probably  use  either  57r/4  or 
— 37t/4.  Then  we  have  in  our  example 


z  = 


=  -l-i=y/2 


COS 


=  \f2  (  cos  — 
V  4 


57T 

T 

5-7T 


2mr 


i  sin 


57T 

T 


2mr 


*sin^  =  v/2e5i7r/4. 


[We  could  also  write  2  =  y/2  (cos  225°  +  i  sin  225°).] 

The  complex  number  x  —  iy ,  obtained  by  changing  the  sign  of  i  in  z  =  x  +  iy, 
is  called  the  complex  conjugate  or  simply  the  conjugate  of  z.  We  usually  write  the 
conjugate  of  z  =  x  +  iy  as  z  =  x  —  iy.  Sometimes  we  use  z*  instead  of  z  (in  fields 
such  as  statistics  or  quantum  mechanics  where  the  bar  may  be  used  to  mean  an 
average  value).  Notice  carefully  that  the  conjugate  of  7i  —  5  is  —7 i  —  5;  that  is,  it 
is  the  i  term  whose  sign  is  changed. 

Complex  numbers  come  in  conjugate  pairs;  for 
example,  the  conjugate  of  2  +  3*  is  2  —  3*  and  the 
conjugate  of  2  —  3*  is  2+3*.  Such  a  pair  of  points  in 
the  complex  plane  are  mirror  images  of  each  other 
with  the  x  axis  as  the  mirror  (Figure  4.2).  Then 
in  polar  form,  z  and  z  have  the  same  r  value,  but 
their  9  values  are  negatives  of  each  other.  If  we 
write  z  =  r( cos  9  +  *  sin  9),  then 


Figure  4.2 


(4.4)  z  =  r[cos(— 9)  +  i  sin(— 0)]  =  r(cos  9  —  i  sin  9)  =  re  l6 . 


PROBLEMS,  SECTION  4 

For  each  of  the  following  numbers,  first  visualize  where  it  is  in  the  complex  plane.  With  a 
little  practice  you  can  quickly  find  x,  y,  r,  9  in  your  head  for  these  simple  problems.  Then 
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plot  the  number  and  label  it  in  five  ways  as  in  Figure  3.3.  Also  plot  the  complex  conjugate 
of  the  number. 


1. 

1+2 

2. 

i  —  1 

3.  l-iV3 

4. 

— V3  +  i 

5. 

2  i 

6.  -4  i 

7. 

-1 

8. 

3 

9.  2i  -  2 

10. 

2  —  2? 

11. 

_  /  7T  .  .  7T\ 

2  ^cos  -  +  2  sm  —  J 

12. 

A  (  2tt  .  .  2tt\ 

yOs  ~3  1  sin  ”3”  J 

13. 

3tt  .  .  37t 
cos  —  +  2  sm  — 

14. 

„  /  n  .  .  7r\ 

2  cos  —  +  *  sm  — 

V  4  4/ 

15. 

cos  7T  —  2  sin  7 r 

16. 

5(cos0  +  isinO) 

17. 

V2e-i,r/4 

18. 

3  eivr/2 

19. 

5  (cos  20°  +fsin20°) 

20. 

7  (cos  110°  —  i  sin  1 10° ) 

5.  COMPLEX  ALGEBRA 

A.  Simplifying  to  x -\-iy  form 

Any  complex  number  can  be  written  in  the  rectangular  form  x  +  iy.  To  add, 
subtract,  and  multiply  complex  numbers,  remember  that  they  follow  the  ordinary 
rules  of  algebra  and  that  i2  =  —1. 


Example  1. 

(1  +  i)2  =  1  +  2i  +  i2  =  1  +  2i  -  1  =  2i 

To  divide  one  complex  number  by  another,  first  write  the  quotient  as  a  fraction. 
Then  reduce  the  fraction  to  rectangular  form  by  multiplying  numerator  and  denom¬ 
inator  by  the  conjugate  of  the  denominator;  this  makes  the  denominator  real. 


Example  2. 

2  +  i  2  +  i  3  +  i  6  +  5  i  +  i2  5  +  5  i  11. 

3  —  *  =  3 ~i  '  3+7  =  9  -i2  =  10  =  2  +  2*' 

It  is  sometimes  easier  to  multiply  or  divide  complex  numbers  in  polar  form. 


Example  3.  To  find  (1  +  i)2  in  polar  form,  we  first  sketch  (or  picture  mentally)  the  point 
(1,1).  From  Figure  5.1,  we  see  that  r  =  v/2,  and  9  =  7r/4,  so  (1  +  i)  =  v/2ei7r/4. 
Then  from  Figure  5.2  we  find  the  same  result  as  in  Example  1. 

(1  +  i)2  =  (v/2ei7r/4)2  =  2ei7r/2  =  2  i. 
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Example  4.  Write  l/[2(cos20°  +  *sin20°]  in  x  +  iy  form.  Since  20°  =  7r/9  radians, 


2(cos20°  +  *  sin  20°)  2(cos7r/9  +  *sin7r/9)  2ei7r/9 

=  0.5(cos7t/9  —  i  sin  7r/9)  =  0.47  —  0 .177. 

by  calculator  in  radian  mode.  We  obtain  the  same  result  leaving  the  angle  in  degrees 
and  using  a  calculator  in  degree  mode:  0.5 (cos  20°  —  i  sin  20°)  =0.47  —  0.17*. 
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First  simplify  each  of  the  following  numbers  to  the  x  +  iy  form  or  to  the  rel°  form.  Then 
plot  the  number  in  the  complex  plane. 


1. 


4. 

7. 

9. 

10. 


1 

1  +i 

2. 

1 

2—1 

i2  +  2i  +  1  5. 

(*  +  V3)2 

3  +  i 

2  +i 

8. 

1.6  -  2.7* 

25  e2i 

Careful!  The  angle 

is  2  radians 

3i  —  7 
i  +  4 

Careful!  Not  3  —  7 * 

3.  i 


6. 


1  +  i 
1  —  i 


11. 

17-  12* 

12. 

3(cos28°  +  *  sin  28°) 

(  2t r  2tt\ 

2.8e“i(11) 

13. 

5  cos  —  +  i  sm  — 

V  5  5  ) 

14. 

15. 

5-2  i 

16. 

1 

5  +  2  i 

0.5(cos40°  +  i  sin  40°) 

17. 

(1.7 -3.2 if 

18. 

(0.64  +  0.77*)4 

Find  each  of  the  following  in  rectangular  (a  +  bi)  form  if  2  =  2  —  3i;  if  z  =  x  +  iy. 


19. 


22. 


20. 

1 

21. 

1 

z+1 

23. 

1  +  z 

1  -z 

24. 

zjz 

B.  Complex  Conjugate  of  a  Complex  Expression 

It  is  easy  to  see  that  the  conjugate  of  the  sum  of  two  complex  numbers  is  the  sum 
of  the  conjugates  of  the  numbers.  If 


z\  =  xi  +  iyi  and  z2  =  x2  +  iyi. 


then 


zi  +  z2  =  X!  -  iyi  +x2-  iy2  =  Xi  +  x2  -  i(y1  +  y2). 


The  conjugate  of  (z\  +  z2)  is 


(xi  +  x2)  +  i(yi  +  y2)  =  (x\  +  x2)  -  i(y i  +  y2). 
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Similarly,  you  can  show  that  the  conjugate  of  the  difference  (or  product  or  quotient) 
of  two  complex  numbers  is  equal  to  the  difference  (or  product  or  quotient)  of  the 
conjugates  of  the  numbers  (Problem  25).  In  other  words,  you  can  get  the  conjugate 
of  an  expression  containing  i' s  by  just  changing  the  signs  of  all  the  i  terms.  We 
must  watch  out  for  hidden  i’s,  however. 

Example.  If 


But  if  z 
of  2  is  z 

PROBLEMS,  SECTION  5 

25.  Prove  that  the  conjugate  of  the  quotient  of  two  complex  numbers  is  the  quotient  of 
the  conjugates.  Also  prove  the  corresponding  statements  for  difference  and  product. 
Hint:  It  is  easier  to  prove  the  statements  about  product  and  quotient  using  the 
polar  coordinate  re*8  form;  for  the  difference,  it  is  easier  to  use  the  rectangular  form 
x  +  iy. 

C.  Finding  the  Absolute  Value  of  z 

Recall  that  the  definition  of  \z\  is  \z\  =  r  =  y/ x2  +  y2  (positive  square  root!). 
Since  =  (x  +  iy) (x  —  iy)  =  x2  +  y2,  or,  in  polar  coordinates,  2:2  =  ( reie)(re~l e)  = 
r2,  we  see  that  \z\2  =  zz,  or  |,s|  =  \/zM.  Note  that  zz  is  always  real  and  >  0,  since 
x,  y ,  and  r  are  real.  We  have 


2  —  3  i  _  2  +  3* 

z  = - ,  then  z  =  - . 

i  +  4  — *  +  4 

=  f+ig ,  where  /  and  g  are  themselves  complex,  then  the  complex  conjugate 
=  f~ig  ( not  /  -  ig ). 


(5.1) 


\z\  =  r  =  \J  x2  +  y2  =  yfzz. 


By  Problem  25  and  (5.1),  the  absolute  value  of  a  quotient  of  two  complex  numbers 
is  the  quotient  of  the  absolute  values  (and  a  similar  statement  for  product). 


Example. 

VS  +3*  _  |-s/5  +  3z|  _  Vu  _  r- 
1  -i  |l-i|  V2 


PROBLEMS,  SECTION  5 

Find  the  absolute  value  of  each  of  the  following  using  the  discussion  above.  Try  to  do 
simple  problems  like  these  in  your  head — it  saves  time. 


2i  -  1 

27. 

2  +  3i 

28. 

Z 

i  -  2 

1  —  i 

Z 

(1  +  2i)3 

30. 

3  i 

31. 

5-2  i 

i-V 3 

5  +  2  i 

"to 

1 

CO 

iS>. 

(P* 

33. 

25 

34. 

(1  +  iV 

3  +  4i 

U-iJ 

5 
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D.  Complex  Equations 

In  working  with  equations  involving  complex  quantities,  we  must  always  remember 
that  a  complex  number  is  actually  a  pair  of  real  numbers.  Two  complex  numbers 
are  equal  if  and  only  if  their  real  parts  are  equal  and  their  imaginary  parts  are  equal. 
For  example,  x  +  iy  =  2  +  3 i  means  x  =  2  and  y  =  3.  In  other  words,  any  equation 
involving  complex  numbers  is  really  two  equations  involving  real  numbers. 


Example.  Find  x  and  y  if 

(5.2)  (x  +  iy)2  =  2i. 

Since  (x  +  iy)2  =  x2  +  2 ixy  —  y2,  (5.2)  is  equivalent  to  the  two  real  equations 

x2-y2  =  0, 

2  xy  =  2. 

From  the  first  equation  y2  =  or,  we  find  y  —  x  or  y  =  —  x.  Substituting  these  into 
the  second  equation  gives 

2x2  =  2  or  —  2x2  =  2. 

Since  x  is  real,  x2  cannot  be  negative.  Thus  we  find  only 

x2  =  1  and  y  =  x, 

that  is, 

x  =  y  =  1  and  x  =  y  =  —  1. 
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Solve  for  all  possible  values  of  the  real  numbers  x  and  y  in  the  following  equations. 


35. 

x  +  iy  =  3i  —  4 

36. 

2  ix  +  3  =  y  —  i 

37. 

x  +  iy  =  0 

38. 

x  +  iy  =  2i  —  7 

39. 

x  +  iy  =  y  +  ix 

40. 

x  +  iy  =  3i  —  ix 

41. 

(2x  —  3y  —  5)  +  i(x  +  2y  +  1)  =0 

42. 

{x  +  2y  +  3)  +  i(3x  —  y 

43. 

( x  +  iy)2  =  2  ix 

44. 

x  +  iy  =  (1  —  i)2 

45. 

{x  +  iy)2  =  (x  -  iy)2 

46. 

x  +  iy 
x  —  iy 

47. 

{x  +  iy)3  =  -1 

48. 

x  +  iy  +  2  +  3i 

—  2  2j 

2x  +  2  iy  —  3 

49. 

|1  —  (x  +  iy)\  =  x  +  iy 

50. 

\x  +  iy  =  y  -  ix 

E. 

Graphs 

Using  the  graphical  representation  of  the  complex  number  z  as  the  point  (x,  y)  in 
a  plane,  we  can  give  geometrical  meaning  to  equations  and  inequalities  involving  z. 


Section  5 


Complex  Algebra  55 


Example  1.  What  is  the  curve  made  up  of  the  points  in  the  (x,y)  plane  satisfying  the 
equation  |z|  =  3? 

Since  _ 

M  =  \A2  +  2/2, 

the  given  equation  is 


\j  x1  +  y2  =  3  or  x2  +  y2  =  9. 

Thus  |*|  =  3  is  the  equation  of  a  circle  of  radius  3  with  center  at  the  origin.  Such 
an  equation  might  describe,  for  example,  the  path  of  an  electron  or  of  a  satellite. 
(See  Section  F  below.) 

Example  2. 


(a)  \z  —  1|  =  2.  This  is  the  circle  ( x  —  l)2  +  y2  =  4. 

(b)  |^r  —  1 1  <2.  This  is  the  disk  whose  boundary  is  the  circle  in  (a). 

Note  that  we  use  “circle”  to  mean  a  curve  and  “disk”  to  mean  an  area.  The  interior 
of  the  disk  is  given  by  \z  —  1|  <2. 

Example  3.  (Angle  of  z)  =  7r/4.  This  is  the  half-line  y  =  x  with  x  >  0;  this  might  be  the 
path  of  a  light  ray  starting  at  the  origin. 

Example  4.  Re  2  >  |.  This  is  the  half-plane  x  >  |. 

PROBLEMS,  SECTION  5 

Describe  geometrically  the  set  of  points  in  the  complex  plane  satisfying  the  following 


equations. 

51. 

M  =  2 

52. 

Re  2  =  0 

53. 

l*-l|  =  1 

54. 

|2-1|  <1 

55. 

z  —  z  =  5i 

56. 

angle  of  2  =  ^ 

57. 

II 

IN 

o> 

58. 

Re  2  >  2 

59. 

\z  +  3i|  =4 

60. 

2  —  1  +  i|  =  2 

61. 

Imz  <  0 

62. 

|z  +  l|  +  |z-l|  =8 

63. 

to 

II 

t'il 

to 

64. 

2  -2 

2  =  — 2 

65. 

Show  that  |  z\  —  Z2 

is  the  distance  between  the  points  21  and  22  in  the  complex 

plane.  Use  this  result  to  identify  the  graphs  in  Problems  53,  54,  59,  and  60  without 
computation. 


F.  Physical  Applications 

Problems  in  physics  as  well  as  geometry  may  often  be  simplified  by  using  one  com¬ 
plex  equation  instead  of  two  real  equations.  See  the  following  example  and  also 
Section  16. 
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Example.  A  particle  moves  in  the  (x,  y)  plane  so  that  its  position  (x,  y)  as  a  function  of 
time  t  is  given  by 

i  +  2t 

z  =  x  +  iy  =  - . 

t  —  % 

Find  the  magnitudes  of  its  velocity  and  its  acceleration  as  functions  of  t. 


We  could  write  z  in  x  +  iy  form  and  so  find  x  and  y  as  functions  of  t.  It  is 
easier  to  do  the  problem  as  follows.  We  define  the  complex  velocity  and  complex 
acceleration  by 


dz 

dt 


dx  dy 

- h  l  — — 

dt  dt 


and 


d2z 

dt2 


d2 x  .  d2y 
dt2  dt2 


Then  the  magnitude  v  of  the  velocity  is  v  =  y/( dx/dt )2  +  ( dy/dt )2  =  \dz/dt\,  and 
similarly  the  magnitude  a  of  the  acceleration  is  a  =  \d2z/di2\.  Thus  we  have 


dz  2  (t  —  *)  —  (*  +  2 1)  —3  i 

dt  (t  —  i )2  (t  —  i)2 


v  = 

d2z 

dt2 

a  = 


dz  I  —3  i  '  +3i 

dt  y  (t  —  i)2  (t  +  i)2 

(— 3*)(— 2)  _  6 i 


(t~i)3  ~(t-i)3’ 

d2z  6 

-  (t2  +  1)3/2‘ 


3 

¥+1’ 


Note  carefully  that  all  physical  quantities  ( x ,  y,  v ,  and  a)  are  real;  the  complex 
expressions  are  used  just  for  convenience  in  calculation. 
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66.  Find  x  and  y  as  functions  of  t  for  the  example  above,  and  verify  for  this  case  that 
v  and  a  are  correctly  given  by  the  method  of  the  example. 

67.  Find  v  and  a  if  z  =  (1  —  it)/{2t  +  i). 

68.  Find  v  and  a  if  z  =  cos  2 t  +  i  sin  2 1.  Can  you  describe  the  motion? 

6.  COMPLEX  INFINITE  SERIES 

In  Chapter  1  we  considered  infinite  series  whose  terms  were  real.  We  shall  be  very 
much  interested  in  series  with  complex  terms;  let  us  reconsider  our  definitions  and 
theorems  for  this  case.  The  partial  sums  of  a  series  of  complex  numbers  will  be 
complex  numbers,  say  Sn  =  Xn  +  iYn,  where  Xn  and  Yn  are  real.  Convergence  is 
defined  just  as  for  real  series:  If  Sn  approaches  a  limit  S  =  X  +  iY  as  n  — >  oo, 
we  call  the  series  convergent  and  call  S  its  sum.  This  means  that  Xn  — >  X  and 
Yn  — >  Y;  in  other  words,  the  real  and  the  imaginary  parts  of  the  series  are  each 
convergent  series. 

It  is  useful,  just  as  for  real  series,  to  discuss  absolute  convergence  first.  It  can 
be  proved  (Problem  1)  that  an  absolutely  convergent  series  converges.  Absolute 
convergence  means  here,  just  as  for  real  series,  that  the  series  of  absolute  values  of 
the  terms  is  a  convergent  series.  Remember  that  \z\  =  r  =  \J x2  +  y2  is  a  positive 
number.  Thus  any  of  the  tests  given  in  Chapter  1  for  convergence  of  series  of 
positive  terms  may  be  used  here  to  test  a  complex  series  for  absolute  convergence. 
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Example  1.  Test  for  convergence 

1  1  +  i  (1  +  i)2  (1  +  i)3 

2  4  8 

Using  the  ratio  test,  we  find 

p  =  lim 


(i  +  iy 


(1  +  i)n+1 

(1  +  i)n 

=  lim 

1  +  i 

1  +  i 

2n+1 

2" 

n— kx) 

2 

2 

.  Since  p  <  1,  the  series  is  absolutely  convergent  and  therefore  convergent. 


Example  2.  Test  for  convergence  in/^/n.  Here  the  ratio  test  gives  1  so  we  must  try 
a  different  test.  Let’s  write  out  a  few  terms  of  the  series: 


i  _  i  1  i  _  1 

*“  71 "  7!  +  Vt +  VE  ~  VE  " ' ' 

We  see  that  the  real  part  of  the  series  is 

j_  i _ L,  y  (-i)ra 

V2  V4  V6  V  ^  ’ 


and  the  imaginary  part  of  the  series  is 

1  1 

73  +  7I 


,  1  1  + 
1  .r +  ,/k'"  E 


_tl 2E 

x/2n  +  1 


Verify  that  both  these  series  satisfy  the  alternating  series  test  for  convergence.  Thus, 
the  original  series  converges. 


Example  3.  Test  for  convergence  Yo^  zn  =  ^^°(rel£,)ra  =  YT  rnem8 .  This  is  a  geometric 
series  with  ratio  =  2  =  re10;  it  converges  if  and  only  if  \z\  <  1.  Recall  that  \z\  =  r. 
Thus,  rnemS  converges  if  and  only  if  r  <  1. 


PROBLEMS,  SECTION  6 

1.  Prove  that  an  absolutely  convergent  series  of  complex  numbers  converges.  This 
means  to  prove  that  Y(an  +  i&n)  converges  (an  and  bn  real)  if  YVan  +  b'n  con¬ 
verges.  Hint:  Convergence  of  Y{a^Hibn)  means  that  ^2  an  and  X)  bn  both  converge. 
Compare  \an\  and  [6„|  with  \/a'2  +  b and  use  Problem  7.9  of  Chapter  1. 


Test  each  of  the  following  series  for  convergence. 


2. 

E(1+*)n 

3. 

E  (1  _|_  j)n 

5. 

El 

\nz  n  J 

6. 

y-  1  +  i 

2—/  n2 

8. 

\  ^  inir/6 

Ee 

9. 

•n 

E- 

^  n 

11. 

El 

(  2  +  i\2n 
l  3  -  4 i 

12. 

(3  +  2 1)‘ 
n\ 

4. 

7. 

10. 

13. 


E 

E 

E 

E 


n 


(»-i) 

n 


1  +  2 
1  -iy/3 


n 


n 


14.  Prove  that  a  series  of  complex  terms  diverges  if  p  >  1  (p  =  ratio  test  limit).  Hint: 
The  nth  term  of  a  convergent  series  tends  to  zero. 
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7.  COMPLEX  POWER  SERIES;  DISK  OF  CONVERGENCE 

In  Chapter  1  we  considered  series  of  powers  of  x,  'Yfanxn.  We  are  now  interested 
in  series  of  powers  of  z, 

(7.1) 


where  z  =  x  +  iy,  and  the  an  are  complex  numbers.  [Notice  that  (7.1)  includes  real 
series  as  a  special  case  since  z  =  x  if  y  =  0.]  Here  are  some  examples. 


(7.2a) 

(7.2b) 

(7.2c) 


1-2 
1  +  iz  + 

OO 

E 


2  3  +  4 

( iz )2  ( iz )3 


22  iz3 


2! 

(. z+l-i)r 


3! 


n— 0 


3n  n 2 


Let  us  use  the  ratio  test  to  find  for  what  z  these  series  are  absolutely  convergent. 
For  (7.2a),  we  have 

z  ■  n 


p  =  lim 


n+l 


=  z  . 


The  series  converges  if  p  <  1,  that  is,  if  |zj  <  1,  or 


\J x1  +  y2  <  1.  This  is  the  interior  of  a  disk  of  radius  1  with 
center  at  the  origin  in  the  complex  plane.  This  disk  is  called 
the  disk  of  convergence  of  the  infinite  series  and  the  radius  of 
the  disk  is  called  the  radius  of  convergence.  The  disk  of  con¬ 
vergence  replaces  the  interval  of  convergence  which  we  had 
for  real  series.  In  fact  (see  Figure  7.1),  the  interval  of  con¬ 
vergence  for  the  series  ]C(— x)n/n  is  just  the  interval  (—1,1) 
on  the  x  axis  contained  within  the  disk  of  convergence  of 
^2{—z)n/n,  as  it  must  be  since  x  is  the  value  of  z  when  y  =  0.  For  this  reason  we 
sometimes  speak  of  the  radius  of  convergence  of  a  power  series  even  though  we  are 
considering  only  real  values  of  2.  (Also  see  Chapter  14,  Equations  (2.5)  and  (2.6) 
and  Figure  2.4.) 

Next  consider  series  (7.2b);  here  we  have 


p  =  lim 


(iz) 


n+l 


(izy 


(n+l)!  ’  n! 


lim 

n— >oo 


IZ 


n+l 


=  0. 


This  is  an  example  of  a  series  which  converges  for  all  values 
of  z.  For  series  (7.2c),  we  have 


p  =  lim 

n— ►  oo 


(z  + 1  —  i) 

n 2 

2  +  1  —  7 

3 

(n  +  l)2 

3 

Thus,  this  series  converges  for 

\z  +  1  —  i\  <  3,  or  \z  -  (—1  +  i)|  <3. 

This  is  the  interior  of  a  disk  (Figure  7.2)  of  radius  3  and 
center  at  z  —  —  1  +  *  (see  Problem  5.65). 
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Just  as  for  real  series,  if  p  >  1,  the  series  diverges  (Problem  6.14).  For  p  =  1 
(that  is,  on  the  boundary  of  the  disk  of  convergence)  the  series  may  either  converge 
or  diverge.  It  may  be  difficult  to  find  out  which  and  we  shall  not  in  general  need  to 
consider  the  question. 

The  four  theorems  about  power  series  (Chapter  1,  Section  11)  are  true  also  for 
complex  series  (replace  interval  by  disk  of  convergence).  Also  we  can  now  state  for 
Theorem  2  what  the  disk  of  convergence  is  for  the  quotient  of  two  series  of  powers 
of  z.  Assume  to  start  with  that  any  common  factor  z  has  been  cancelled.  Let  rq 
and  7*2  be  the  radii  of  convergence  of  the  numerator  and  denominator  series.  Find 
the  closest  point  to  the  origin  in  the  complex  plane  where  the  denominator  is  zero; 
call  the  distance  from  the  origin  to  this  point  s.  Then  the  quotient  series  converges 
at  least  inside  the  smallest  of  the  three  disks  of  radii  rq,r2,  and  s,  with  center  at 
the  origin.  (See  Chapter  14,  Section  2.) 

Example.  Find  the  disk  of  convergence  of  the  Maclaurin  series  for  (sinz)/[z(l  +  z2)]. 

We  shall  soon  see  that  the  series  for  sin  2  has  the  same  form  as  the  real  series 
for  sin  a;  in  Chapter  1.  Using  this  fact  we  find  (Problem  17) 

sin  x  7z2  47 z4  5923z6 

z(l  +  z2)  ~  6  '  40  5040  +  " '  ' 

From  (7.3)  we  can’t  find  the  radius  of  convergence,  but  let’s  use  the  theorem  above. 
Let  the  numerator  series  be  (sin  z)/z.  By  ratio  test,  the  series  for  (sin.z)/z  converges 
for  all  z  (if  you  like,  rq  =  oo) .  There  is  no  r2  since  the  denominator  is  not  an  infinite 
series.  The  denominator  1  +  z2  is  zero  when  z  =  ±i,  so  s  =  1.  Then  the  series  (7.3) 
converges  inside  a  disk  of  radius  1  with  center  at  the  origin. 


PROBLEMS,  SECTION  7 


Find  the  disk  of  convergence  for  each  of  the  following  complex  power  series. 


1. 

Z 

e  = 

=  1  +  2  + 

22  23 

2!  +  3!  ' 

[equation  (8.1)] 

2. 

^  — 

22  23 

2  +  3 

1 

+ 

3. 

3!  +  5! 

4. 

OO 

E*n 

71=0 

5. 

oo 

E 

71=0 

(!)’ 

6. 

OO 

E  n2(3iz)n 

71  =  1 

7. 

-  (_1)nz2n 

h  (2" )! 

8. 

OO 

E 

71  =  1 

z2n 

9. 

zn 

71  =  1 

10. 

(iz)n 

[2n  +  1)! 

'  n2 

71  =  1 

11. 

OO 

E 

71=0 

(n\)3zn 

(3n)! 

12. 

(n\)2zn 
^  (2  n)\ 

71=0  V  ' 

13. 

f'  (z  ~  i)n 

n 

71=1 

^^n(n  +  l)(z  —  2i)n  15. 

71=0  71=0 


(z  —  2  +  i)r' 


^2n(«  +  i-3)2 


14. 


2n 


16. 
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17.  Verify  the  series  in  (7.3)  by  computer.  Also  show  that  it  can  be  written  in  the  form 


Ec-1  )“j2"E 

n=0  k= 0 


l 

(2fc  +  1)! 


Use  this  form  to  show  by  ratio  test  that  the  series  converges  in  the  disk  \z\  <  1. 


8.  ELEMENTARY  FUNCTIONS  OF  COMPLEX  NUMBERS 

The  so-called  elementary  functions  are  powers  and  roots,  trigonometric  and  inverse 
trigonometric  functions,  logarithmic  and  exponential  functions,  and  combinations 
of  these.  All  these  you  can  compute  or  find  in  tables,  as  long  as  you  want  them  as 
functions  of  real  numbers.  Now  we  want  to  find  things  like  i1,  sin(l+i) ,  or  In  i.  These 
are  not  just  curiosities  for  the  amusement  of  the  mathematically  inclined,  but  may 
turn  up  to  be  evaluated  in  applied  problems.  To  be  sure,  the  values  of  experimental 
measurements  are  not  imaginary.  But  the  values  of  Rez,  Im z,  |z|,  angle  of  z,  are 
real,  and  these  are  the  quantities  which  have  experimental  meaning.  Meanwhile, 
mathematical  solutions  of  problems  may  involve  manipulations  of  complex  numbers 
before  we  arrive  finally  at  a  real  answer  to  compare  with  experiment. 

Polynomials  and  rational  functions  (quotients  of  polynomials)  of  z  are  easily 
evaluated. 


Example.  If  /(z)  =  (z2  +  l)/(z  —  3),  we  find  f(i  —  2)  by  substituting  z  =  i  —  2  : 

tt.  (*-  2)2  +  l  —4*  +  4  -f-  5  8a  —  12 

i-  2  —  3  ~  i-  5  '  -i-  5~  13  ' 

Next  we  want  to  investigate  the  possible  meaning  of  other  functions  of  complex 
numbers.  We  should  like  to  define  expressions  like  ez  or  sinz  so  that  they  will 
obey  the  familiar  laws  we  know  for  the  corresponding  real  expressions  [for  example, 
sin  2a;  =  2  sin  a;  cos  a;,  or  ( d/dx)ex  =  ex\.  We  must,  for  consistency,  define  functions 
of  complex  numbers  so  that  any  equations  involving  them  reduce  to  correct  real 
equations  when  z  =  x  +  iy  becomes  z  =  x,  that  is,  when  y  =  0.  These  requirements 
will  be  met  if  we  define  ez  by  the  power  series 


(8.1) 


°°  2 

e*  =  V^  =  l  +  z+  — 
^  n\  2! 

o 


This  series  converges  for  all  values  of  the  complex  number  z  (Problem  7.1)  and 
therefore  gives  us  the  value  of  ez  for  any  z.  If  we  put  z  =  x  (x  real),  we  get  the 
familiar  series  for  ex . 

It  is  easy  to  show,  by  multiplying  the  series  (Problem  1),  that 


(8.2) 


e21  •  eZ2  =  eZl+Z2 


In  Chapter  14  we  shall  consider  in  detail  the  meaning  of  derivatives  with  respect  to 
a  complex  z.  However,  it  is  worth  while  for  you  to  know  that  ( d/dz)zn  =  nz 71-1 , 
and  that,  in  fact,  the  other  differentiation  and  integration  formulas  which  you  know 
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from  elementary  calculus  hold  also  with  x  replaced  by  z.  You  can  verify  that 
( d/dz)ez  =  ez  when  ez  is  defined  by  (8.1)  by  differentiating  (8.1)  term  by  term 
(Problem  2).  It  can  be  shown  that  (8.1)  is  the  only  definition  of  ez  which  pre¬ 
serves  these  familiar  formulas.  We  now  want  to  consider  the  consequences  of  this 
definition. 


PROBLEMS,  SECTION  8 

Show  from  the  power  series  (8.1)  that 


2. 


dz 


3.  Find  the  power  series  for  ex  cos  x  and  for  ex  sin  x  from  the  series  for  ez  in  the  following 
way:  Write  the  series  for  ez;  put  2  =  x  +  iy.  Show  that  ez  =  ex(cos  y  +  i  sin  y);  take 
real  and  imaginary  parts  of  the  equation,  and  put  y  =  x. 


9.  EULER'S  FORMULA 

For  real  0,  we  know  from  Chapter  1  the  power  series  for  sin0  and  cos  0: 

.  n  n  o3  o5 

sm  0  =  0  — 

(9.1) 


3! 

02 


5! 

04 


COS0  =  1--  +  -----  . 

From  our  definition  (8.1),  we  can  write  the  series  for  e  to  any  power,  real  or  imagi¬ 
nary.  We  write  the  series  for  e  ,  where  0  is  real: 


(9.2) 


e"  =  l  +  id  +  Ml 


(id)3  (id)4  (id)' 


9 2  04 

2!  +  4!  +"' 


3! 

03  04 

_  4! 


4! 
,05 
:5!  ‘ 
03 

3! 


5! 


i  y  -  —  +  — 


03 

5! 


(The  rearrangement  of  terms  is  justified  because  the  series  is  absolutely  convergent.) 
Now  compare  (9.1)  and  (9.2);  the  last  line  in  (9.2)  is  just  cos0  +  isin0.  We  then 
have  the  very  useful  result  we  introduced  in  Section  3,  known  as  Euler’s  formula: 


(9.3) 


el6  =  cos  0  +  i  sin  0. 


Thus  we  have  justified  writing  any  complex  number  as  we  did  in  (4.1),  namely 


(9.4) 


z  =  x  +  iy  =  r( cos  0  +  i  sin  0)  =  re™ . 
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Here  are  some  examples  of  the  use  of  (9.3)  and  (9.4).  These  problems  can  be  done 
very  quickly  graphically  or  just  by  picturing  them  in  your  mind. 


Examples.  Find  the  values  of  2  el7r/6,  el7r,  3  e  "r/2,  e2mr\ 

2ei7r/6  jg  re ie  wj^j1  r  =  2,  9  =  7t/6.  From  Figure  9.1, 
x  =  -\/3,  y  =  1,  x  +  iy  =  -\/3  +  i,  so  2  el7r/6  =  x/3  +  *. 


el7r  is  re1 ®  with  r  =  1,  8  =  7r.  From  Figure  9.2,  x  =  —1, 
y  =  0,  x  +  iy  =  —1  +  0*,  soe"  =  —1.  Note  that  r  =  1  and 
0  =  — 7r,  ±37t,  ±57t,  •  •  • ,  give  the  same  point,  so  e_I7r  =  —1, 
e37"  _  _i  ancj  g0  on_ 


Figure  9.1 


r  =  1. 


(-1,  0) 


Figure  9.2 


3e  is  re1®  with  r  =  3,  9  =  — 7r/2.  From  Figure 
9.3,  x  =  0,  y  =  —3,  so  3e-1’1'/2  =  x  +  iy  =  0  —  3i  =  —3i. 


e2mr*  jg  rezd  r  =  \  and  Q  =  2n7r  =  n(27r);  that  is, 
6  is  an  integral  multiple  of  2-7T.  From  Figure  9.4,  x  =  1, 
y  =  0,  so  e2mri  =  1  +  0*  =  1. 


Figure  9.3 


Figure  9.4 


It  is  often  convenient  to  use  Euler’s  formula  when  we  want  to  multiply  or  divide 
complex  numbers.  From  (8.2)  we  obtain  two  familiar  looking  laws  of  exponents 
which  are  now  valid  for  imaginary  exponents: 


(9.5) 


gi+  _  giS2  _  gi(#l+02) 
gifli  g iB2  _  gi(0i— $2) 


Remembering  that  any  complex  number  can  be  written  in  the  form  re1®  by  (9.4), 
we  get 
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(9.6) 


Zi-Z2  =  ri  eif)l  ■  r2  eif>2  =  n  r2  ei{Sl+e2) , 
r2 


In  words,  to  multiply  two  complex  numbers,  we  multiply  their  absolute  values 
and  add  their  angles.  To  divide  two  complex  numbers,  we  divide  the  absolute 
values  and  subtract  the  angles. 


Example.  Evaluate  (1  +  i)2 /{ 1  —  i).  From  Figure  5.1  we  have 
1  +  i  =  •\/2er7r/4.  We  plot  1  —  i  in  Figure  9.5  and  find 
r  =  y/2,  9  =  — 7t/4  (or  +77r/4),  so  1  —  i  =  \/2  Then 

(l  +  »)2  =  (V2e^/4)2  =  2eW2  = 

1  -i  v/2e-i’r/4  v^e""/4 


Figure  9.5 


From  Figure  9.6,  we  find  x  =  — 1,  y  =  1,  so 
(i  +  i)2 

— - =  x  +  iv  =  —  1  +  i. 

l—i 

We  could  use  degrees  in  this  problem.  By  (9.6),  we  find 
that  the  angle  of  (1  +  i)2 / (1  —  i)  is  2(45°)  —  (—45°)  =  135° 
as  in  Figure  9.6. 


PROBLEMS,  SECTION  9 


Express  the  following  complex  numbers  in  the  x  +  iy  form.  Try  to  visualize  each  complex 
number,  using  sketches  as  in  the  examples  if  necessary.  The  first  twelve  problems  you 
should  be  able  to  do  in  your  head  (and  maybe  some  of  the  others — try  it!)  Doing  a 
problem  quickly  in  your  head  saves  time  over  using  a  computer.  Remember  that  the  point 
in  doing  problems  like  this  is  to  gain  skill  in  manipulating  complex  expressions,  so  a  good 
study  method  is  to  do  the  problems  by  hand  and  use  a  computer  to  check  your  answers. 


1. 

—  i7r/4 

e  7 

2. 

i7r/2 

e  7 

3. 

4. 

e(l/3)(3+47Ti) 

5. 

5771 

e 

6. 

7. 

3  e2(1+i,d 

8. 

2  g57ri/6 

9. 

10. 

ITT  |  —i'K 

e  +  e 

11. 

V2e5i7r/4 

12. 

13. 

(f-V3)3 
l  —  i 

14. 

(l  +  n/3j 

15. 

16. 

(i-V 3)  (l  +  i\J 3) 

17. 

1 

(1  +  i)3 

18. 

19. 

(1  -if 

20. 

(ZY 

21. 

9  e3"/2 

—  27 ri  —A'ki  .  —  67ri 

e  —  e  +  e 

2e-W2 

4e-8W3 


(l  +  i)2  +  (l+f)4 


1  +  i 
l  —  i 


4 


l  —  i 

~7T 


40 
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22. 

25. 


23. 


26. 


(1  +  048 
(V3  -if 


2i 


i  +  y/3 


24. 


(l-iV3)21 

(i  -  l)38 


27.  Show  that  for  any  real  y,  \ely\  =  1.  Hence  show  that  \ez\  =  ex  for  every  complex  z. 

28.  Show  that  the  absolute  value  of  a  product  of  two  complex  numbers  is  equal  to  the 
product  of  the  absolute  values.  Also  show  that  the  absolute  value  of  the  quotient 
of  two  complex  numbers  is  the  quotient  of  the  absolute  values.  Hint:  Write  the 
numbers  in  the  relB  form. 


Use  Problems  27  and  28  to  find  the  following  absolute  values.  If  you  understand  Problems 
27  and  28  and  equation  (5.1),  you  should  be  able  to  do  these  in  your  head. 


29.  \ei7r/2\ 

30. 

le^l 

31. 

|5e2"/3| 

32.  |3e2+4i 

33.  \2e3+i7r\ 

34. 

|4e2i-1| 

35. 

|3  e5i  •  7e~2' 

ij  36_  |2eW6|2 

1  +  i 

38. 

ein 

1  —  i 

1  +  i 

10.  POWERS  AND  ROOTS  OF  COMPLEX  NUMBERS 

Using  the  rules  (9.6)  for  multiplication  and  division  of  complex  numbers,  we  have 
(10.1)  zn  =  (rei9)n  =  rneinB 

for  any  integral  n.  In  words,  to  obtain  the  nth  power  of  a  complex  number,  we  take 
the  nth  power  of  the  modulus  and  multiply  the  angle  by  n.  The  case  r  =  1  is  of 
particular  interest.  Then  (10.1)  becomes  DeMoivre’s  theorem: 


(10.2)  ( elB)n  =  (cos  0  +  i  sin  9)n  =  cos  n9  +  i  sin  n9. 


You  can  use  this  equation  to  find  the  formulas  for  sin  2 9,  cos  29,  sin  3 9,  etc.  (Prob¬ 
lems  27  and  28). 

The  nth  root  of  z ,  zl^n ,  means  a  complex  number  whose  nth  power  is  z.  From 
(10.1)  you  can  see  that  this  is 


(10.3) 


21  /"  =  (re*®)1/"  —  r1/neie/n  — 


cos  ■ 


i  sin  ■ 


This  formula  must  be  used  with  care  (see  Examples  2  to  4  below). 
Some  examples  will  show  how  useful  these  formulas  are. 


Example  1. 


[cos(tt/10)  +  isin(7r/10)]25  =  (ei7r/10)25  =  e2"Vw/2  =  1  •  i  =  i. 
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Example  2.  Find  the  cube  roots  of  8.  We  know  that  2  is  a  cube  root  of  8,  but  there  are 
also  two  complex  cube  roots  of  8;  let  us  see  why.  Plot  the  complex  number  8  (that 
is,  x  =  8,  y  =  0)  in  the  complex  plane;  the  polar  coordinates  of  the  point  are  r  =  8, 
and  9  =  0,  or  360°,  720°,  1080°,  etc.  (We  can  use  either  degrees  or  radians  here; 
read  the  end  of  Section  3.)  Now  by  equation  (10.3),  z1/3  =  r1/3e*e/3;  that  is,  to 
find  the  polar  coordinates  of  the  cube  root  of  a  number  relS ,  we  find  the  cube  root 
of  r  and  divide  the  angle  by  3.  Then  the  polar  coordinates  of  are 

(10.4)  r  =  2,  9  =  0°,  360°/3,  720°/3,  1080°/3  ■  ■  • 

=  0°,  120°,  240°,  360°  •  •  •  . 

We  plot  these  points  in  Figure  10.1.  Observe  that  the  point  (2,  0°)  and  the 
point  (2,  360°)  are  the  same.  The  points  in  (10.4)  are  all  on  a  circle  of  radius  2 
and  are  equally  spaced  360°/3  =  120°  apart.  Starting  with  0  =  0,  if  we  add  120° 
repeatedly,  we  just  repeat  the  three  angles  shown. 

Thus,  there  are  exactly  three  cube  roots  for  any 
number  z,  always  on  a  circle  of  radius  y/\z\  and 
spaced  120°  apart. 

Now  to  find  the  values  of  s/8  in  rectangular 
form,  we  can  read  them  from  Figure  10.1,  or  we 
can  calculate  them  from  z  =  r( cos  9  +  i  sin  9)  with 
r  =  2  and  0  =  0,  120°  =  2tt/3,  240°  =  4tt/3. 

We  can  also  use  a  computer  to  solve  the  equation 
z3  =  8.  By  any  of  these  methods  we  find 

v/8  =  {2,  -l  +  iy/3,  -1  -iy/3}.  Figure  10.1 

Example  3.  Find  and  plot  all  values  of  \/— 64.  From  Figure  10.2  (or  by  visualizing  a 
plot  of  —64),  we  see  that  the  polar  coordinates  of  —64  are  r  =  64,  9  =  tt  +  2kir 


Figure  10.2  Figure  10.3 

(where  k  =  0, 1, 2,  •  •  • ).  Then  since  z1/4  =  r1/4e*61/4,  the  polar  coordinates  of  \J—  64 
are 


r  =  VM  =  2V2, 

7T  7T  +  27T  7T  +  47T  7T  +  67T  TT  3n  bn  7lT 

~  4  ’  4  ’  4  ’  4  ’  ~  4  ’  4  ’  4  ’  4  ' 


We  plot  these  points  in  Figure  10.3.  Observe  that  they  are  all  on  a  circle  of 
radius  2^/2,  equally  spaced  27r/4  =  tt/2  apart.  Starting  with  9  =  7r/4,  we  add  n/2 
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repeatedly,  and  find  exactly  4  fourth  roots.  We  can  read  the  values  of  ^—64  in 
rectangular  form  from  Figure  10.3: 

\/ — 64  =  ±2  ±  2 i  (all  four  combinations  of  ±  signs) 

or  we  can  calculate  them  as  in  Example  2,  or  we  can  solve  the  equation  z4  =  —  64 
by  computer. 


Example  4.  Find  and  plot  all  values  of  v7— 8 i.  The  polar  coordinates  of  —8*  are  r  =  8, 
9  =  270°  +  360°fc  =  37t/2  +  2nk.  Then  the  polar  coordinates  of  v7— 8*  are 


(10.5) 


=  \/2,  9  = 


270°  +  360°fc 

6 


=  45° 


60° 


In  Figure  10.4,  we  sketch  a  circle  of  radius  \/2.  On 
it  we  plot  the  point  at  45°  and  then  plot  the  rest  of 
the  6  equally  spaced  points  60°  apart.  To  find  the 
roots  in  rectangular  coordinates,  we  need  to  find  all 
the  values  of  r(cos  9  +  i  sin  9)  with  r  and  9  given  by 
(10.5).  We  can  do  this  one  root  at  a  time  or  more 
simply  by  using  a  computer  to  solve  the  equation 
z6  =  —8 i.  We  find  (see  Problem  33) 

J,  ,  .  V3  +  1  V3-1.  V3-1  V3+1.1 

±\1+!’  — - — *■  — - 

±{1  +  *,  1.366  -0.366*,  0.366  -  1.366*}. 

Summary  In  each  of  the  preceding  examples,  our  steps  in  finding  v7 re1 8  were: 


Figure  10.4 


(a)  Find  the  polar  coordinates  of  the  roots:  Take  the  nth  root  of  r  and  divide 
9  +  2kir  by  n. 

(b)  Make  a  sketch:  Draw  a  circle  of  radius  yfr,  plot  the  root  with  angle  0/n,  and 
then  plot  the  rest  of  the  n  roots  around  the  circle  equally  spaced  2n/n  apart. 
Note  that  we  have  now  essentially  solved  the  problem.  From  the  sketch  you 
can  see  the  approximate  rectangular  coordinates  of  the  roots  and  check  your 
answers  in  (c).  Since  this  sketch  is  quick  and  easy  to  do,  it  is  worthwhile  even 
if  you  use  a  computer  to  do  part  (c) . 

(c)  Find  the  x  +  iy  coordinates  of  the  roots  by  one  of  the  methods  in  the  examples. 
If  you  are  using  a  computer,  you  may  want  to  make  a  computer  plot  of  the 
roots  which  should  be  a  perfected  copy  of  your  sketch  in  (b) . 
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Follow  steps  (a),  (b),  (c)  above  to  find  all  the  values  of  the  indicated  roots. 


1. 

v'T 

2. 

^27 

3. 

v7! 

4. 

\Z~16 

5. 

Vl 

6. 

vTi4 

7. 

V 16 

8. 

9. 

Vi 

10. 

^32 

11. 

v^— 8 

12. 

^1 
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13. 

\/—4 

14. 

W 

15. 

^64 

16. 

W 

17. 

W 

18. 

Vi 

19. 

Vi 

20. 

V~8i 

21. 

\J  2  +  2iV 

22. 

\/2i  -  2 

23. 

00 

<s>. 

Si 

1 

00 

24. 

8  /  —  1  —  %yj 3 

V  2 

25. 

26. 

V 

27.  Using  the  fact  that  a  complex  equation  is  really  two  real  equations,  find  the 
double  angle  formulas  (for  sin  20,  cos  20)  by  using  equation  (10.2). 

28.  As  in  Problem  27,  find  the  formulas  for  sin  30  and  cos  30. 

29.  Show  that  the  center  of  mass  of  three  identical  particles  situated  at  the  points 
z\,  z2,  z3  is  (zi  +  z2  +  z3)/3. 

30.  Show  that  the  sum  of  the  three  cube  roots  of  8  is  zero. 

31.  Show  that  the  sum  of  the  n  nth  roots  of  any  complex  number  is  zero. 

32.  The  three  cube  roots  of  +1  are  often  called  1,  w,  and  w2.  Show  that  this 
is  reasonable,  that  is,  show  that  the  cube  roots  of  +1  are  +1  and  two  other 
numbers,  each  of  which  is  the  square  of  the  other. 

33.  Verify  the  results  given  for  the  roots  in  Example  4.  You  can  find  the  exact 
values  in  terms  of  \/3  by  using  trigonometric  addition  formulas  or  more  easily 
by  using  a  computer  to  solve  z6  =  —8 i.  (You  still  may  have  to  do  a  little 
work  by  hand  to  put  the  computer’s  solution  into  the  given  form.) 

11.  THE  EXPONENTIAL  AND  TRIGONOMETRIC  FUNCTIONS 

Although  we  have  already  defined  ez  by  a  power  series  (8.1),  it  is  worth  while  to 

write  it  in  another  form.  By  (8.2)  we  can  write 


(11.1)  e2  =  ex+ly  =  exeiy  =  ex(cos  y  +  isiny). 


This  is  more  convenient  to  use  than  the  infinite  series  if  we  want  values  of  ez  for 
given  z.  For  example, 


=  e2  •  (-1)  =  -€ 


from  Figure  9.2. 

We  have  already  seen  that  there  is  a  close  relationship  [Euler’s  formula  (9.3)] 
between  complex  exponentials  and  trigonometric  functions  of  real  angles.  It  is 
useful  to  write  this  relation  in  another  form.  We  write  Euler’s  formula  (9.3)  as  it 
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is,  and  also  write  it  with  9  replaced  by  — 9 .  Remember  that  cos(— 9)  =  cos  9  and 
sin(— 9)  =  —  sin  61  Then  we  have 


(11.2) 


e* 6  =  cosd  +  isind, 
e~l 6  =  cos  9  —  i  sin  9. 


These  two  equations  can  be  solved  for  sin  9  and  cos  9.  We  get  (Problem  2) 


sin  9  = 

e  —  e 

(11.3) 

cos  9  = 

2  i 

eid  +  e-i9 

2 

These  formulas  are  useful  in  evaluating  integrals  since  products  of  exponentials  are 
easier  to  integrate  than  products  of  sines  and  cosines.  (See  Problems  11  to  16,  and 
Chapter  7,  Section  5.) 

So  far  we  have  discussed  only  trigonometric  functions  of  real  angles.  We  could 
define  sin  z  and  cos  z  for  complex  z  by  their  power  series  as  we  did  for  ez .  We  could 
then  compare  these  series  with  the  series  for  elz  and  derive  Euler’s  formula  and 
(11.3)  with  9  replaced  by  z.  However,  it  is  simpler  to  use  the  complex  equations 
corresponding  to  (11.3)  as  our  definitions  for  sinz  and  cosz.  We  define 


The  rest  of  the  trigonometric  functions  of  z  are  defined  in  the  usual  way  in  terms 
of  these;  for  example,  tanz  =  sin  zj  cos  z. 


► 


€l  1  -b  6  1  1  6  ^  H-  6 

Example  1.  cos  i  =  - - -  =  — - —  =  1.543  •••.  (We 

this  expression  is  called  the  hyperbolic  cosine  of  1.) 


will  see  in  Section  15  that 


Example  2. 


+  i  In  2^j 


ei(7r/2-M  In  2)  _  e~i(n/2+i  In  2) 

2 i 

ei7r/2g-ln2  _  e-i7r/2gln2 

2 i 


by  (8.2). 


From  Figures  5.2  and  9.3,  el7r/2  =  i,  and  e  I7r/2  =  —i.  By  the  definition  of  In  a: 
[or  see  equations  (13.1)  and  (13.2)],  eln2  =  2,  so  e-ln2  =  l/eln2  =  1/2.  Then 


+  i  In  2^ 


(0(1/2)  -  H)(2)  5 

2  i  4' 
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Notice  from  both  these  examples  that  sines  and  cosines  of  complex  numbers  may 
be  greater  than  1.  As  we  shall  see  (Section  15),  although  |  sinx|  <  1  and  |  cosa;|  <  1 
for  real  x ,  when  z  is  a  complex  number,  sin  z  and  cos  z  can  have  any  value  we  like. 

Using  the  definitions  (11.4)  of  sinz  and  cosz,  you  can  show  that  the  familiar 
trigonometric  identities  and  calculus  formulas  hold  when  8  is  replaced  by  z. 


Example  3. 


Prove  that  sin2  z  +  cos2  z  =  1. 


sin  z  = 


2i 


cos  z  = 


e2iz  -  2  +  e~2iz 
^4 

e2iz  +  2  +  e~2iz 
4 


•2  2  2  2  1 
sin  z  +  cos  z  =  -  +  -  =  1. 

4  4 


Example 4.  Using  the  definitions  (11.4),  verify  that  (d/dz)sinz  =  cosz. 


sin  z  = 


a  i.  /  .  Ay 

—  sin  z=—(ie  +  ze 
dz  2 1 


iz ^ 


2 


cosz. 


PROBLEMS,  SECTION  11 


1.  Define  sinz  and  cosz  by  their  power  series.  Write  the  power  series  for  eiz .  By 
comparing  these  series  obtain  the  definition  (11.4)  of  sinz  and  cosz. 


2. 

Solve  the  equations  etB  =  cos  6  +  i  sin  6,  e  10 
so  obtain  equations  (11.3). 

=  cos  9  —  i  sin  9,  for  cos  9  and  sin  9  and 

Find  each  of  the  following  in  rectangular  form  x  +  iy  and  check  your  results  by  computer. 
Remember  to  save  time  by  doing  as  much  as  you  can  in  your  head. 

3. 

g— (*’ r/4)  +  ln3  4 

3  In  2  — in r 

e 

g  e(iir/4)  +  (ln2)/2 

6. 

cos(iln5)  7. 

tan(i  In  2) 

8.  cos(7r  —  2i  In  3) 

9. 

sin(7r  —  iln3)  10. 

sin(i  Ini) 

In  the  following  integrals  express 
integrate  to  show  that: 

the  sines  and 

cosines  in  exponential  form  and  then 

11. 

/  cos  2x  cos  3a;  dx  =  0 

—  7r 

12. 

J  cos2  3xdx  =  iv 

13. 

n'K 

/  sin  2a;  sin  3a;  da;  =  0 

J  —  7T 

14. 

r2n 

/  sin2  4x  dx  =  n 

Jo 

15. 

P7T 

/  sin  2a;  cos  3a;  da;  =  0 

J  —  7T 

16. 

P  7T 

/  sin  3a;  cos  4a;  da;  =  0 

J  —  7T 

Evaluate  f  e^a+tb^xdx  and  take  real  and  imaginary  parts  to  show  that: 

f  nT  ,  eax(a  cos 6®  +  6  shift*) 

17.  /  e  cos  bx  dx  = - - - x - — - 

J  a1  +  bz 

/ax  ■  ,  ,  eax(a  sin  bx  —  b  cos  6a;) 

e  sm  bx  dx  =  - - - ^ - -x - 1 

a1  +  b 1 


18. 
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12.  HYPERBOLIC  FUNCTIONS 


Let  us  look  at  sinz  and  cosz  for  pure  imaginary  z,  that  is,  z  =  iy : 


(12.1) 


sin  iy  = 
cos iy  = 


e~v  -  ey 
2 i 

e~y  +  ey 
2 


ey  -  e~v 


ey  +  e~y 

2 


The  real  functions  on  the  right  have  special  names  because  these  particular  combi¬ 
nations  of  exponentials  arise  frequently  in  problems.  They  are  called  the  hyperbolic 
sine  (abbreviated  sinh)  and  the  hyperbolic  cosine  (abbreviated  cosh).  Their  defini¬ 
tions  for  all  z  are 


(12.2) 


sinh  z  = 
cosh  z  = 


The  other  hyperbolic  functions 
the  trigonometric  functions: 

tanh  z  = 

(12.3) 

sech  z  = 


are  named  and  defined  in  a  similar  way  to  parallel 


sinh  z 

cosh  z  ’ 
1 

cosh  z  ’ 


coth  z  = ^ — , 

tanh  z 

cschz  =  — - — . 
smh  z 


(See  Problem  38  for  the  reason  behind  the  term  “hyperbolic”  functions.) 
We  can  write  (12.1)  as 


(12.4) 


siniy  =  i  sinhy, 
cos  iy  =  coshy. 


Then  we  see  that  the  hyperbolic  functions  of  y  are  (except  for  one  i  factor)  the 
trigonometric  functions  of  iy.  From  (12.2)  we  can  show  that  (12.4)  holds  with  y 
replaced  by  z.  Because  of  this  relation  between  hyperbolic  and  trigonometric  func¬ 
tions,  the  formulas  for  hyperbolic  functions  look  very  much  like  the  corresponding 
trigonometric  identities  and  calculus  formulas.  They  are  not  identical,  however. 


Example.  You  can  prove  the  following  formulas  (see  Problems  9,  10,  11  and  38). 


cosh2  z  —  sinh2  z  =  1  (compare  sin2  z  +  cos2  z  =  1), 

—  cosh  z  =  sinh  z  (compare  —  cos  z  =  —  sin  z) . 

dz  dz 

PROBLEMS,  SECTION  12 

Verify  each  of  the  following  by  using  equations  (11.4),  (12.2),  and  (12.3). 

1.  sinz  =  sin(a:  +  iy)  =  sins  coshy  +  i  cosx  sinhy 
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2. 

cos  z  =  cos  x  cosh  y  —  i  sin  x  sinh  y 

3. 

sinh  z  =  sinh  x  cos  y  +  i  cosh  x  sin  y 

4. 

cosh  £  =  cosh  x  cos  y  +  i  sinh  x  sin  y 

5. 

sin  2z  =  2  sin  z  cos  z 

6. 

cos  2 z  =  cos2  2  —  sin2  z 

7. 

sinh  2z  =  2  sinh  z  cosh  z 

8. 

cosh  2z  =  cosh2  z  +  sinh2  z 

9. 

d 

—  cos  z  =  —  sm  z 
dz 

10. 

-7-  cosh  z  =  sinh  z 
dz 

11. 

cosh2  z  —  sinh2  z  =  1 

12. 

cos4  z  +  sin4  z  =  1  —  i  sin2  2z 

13. 

cos  3z  =  4  cos3  z  —  3  cos  z 

14. 

siniz  =  i  sinhz 

15. 

sinh  iz  =  i  sin  z 

16. 

tan  iz  =  i  tanh  z 

17. 

tanh  iz  =  i  tan  z 

18. 

.  .  .  tan  x  +  1  tanh  y 

tan  z  =  tan(®  +  iy)  =  - 

1  —  i  tan  x  tanh  y 

19. 

,  tanh  x  +  i  tan  y 

tanh  z  =  - - - — 

1  +  2  tanh  x  tan  y 

20. 

Show  that  enz  =  (coshz  +  sinhz)71 

=  cosh  nz  +  sinh  nz.  Use  this  and  a  similar 

equation  for  e  nz  to  find  formulas  for  cosh3z  and  sinh3z  in  terms  of  sinhz  and 

cosh  z. 

21.  Use  a  computer  to  plot  graphs  of  sinh®,  cosh  2,  and  tanh®. 

22.  Using  (12.2)  and  (8.1),  find,  in  summation  form,  the  power  series  for  sinha;  and 
cosh®.  Check  the  first  few  terms  of  your  series  by  computer. 

Find  the  real  part,  the  imaginary  part,  and  the  absolute  value  of 


23.  cosh(i®) 

24.  cos(j®) 

25. 

sin(®  —  iy) 

26.  cosh(2  —  3i) 

27.  sin(4  +  3  i) 

28. 

tanh(l  —  i7r) 

Find  each  of  the  following  in  the  x  +  iy  form  and  check  your  answers  by  computer. 

37Ti 


29.  cosh27ri 


30.  tanh  - 


31.  sinh  (  In  2  +  — 


32. 


cosh  (  ^  —  In  3 


33.  tani 


„  .  .  m 

34.  sm  — 


35.  cosh(i7r  +  2) 


36.  sinh  (  1  + 


37.  cos(wr) 


38.  The  functions  sin  t,  cos  t,  ■  ■  • ,  are  called  “circular  functions”  and  the  functions  sinh  t, 
cosh  t,  •  •  •  ,  are  called  “hyperbolic  functions” .  To  see  a  reason  for  this,  show  that 
x  =  cost,  y  =  sint,  satisfy  the  equation  of  a  circle  x2  +  y2  =  1,  while  x  =  coshf, 
y  =  sinhf,  satisfy  the  equation  of  a  hyperbola  x2  —  y2  =  1. 
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13.  LOGARITHMS 

In  elementary  mathematics  you  learned  to  find  logarithms  of  positive  numbers  only; 
in  fact,  you  may  have  been  told  that  there  were  no  logarithms  of  negative  numbers. 
This  is  true  if  you  use  only  real  numbers,  but  it  is  not  true  when  we  allow  complex 
numbers  as  answers.  We  shall  now  see  how  to  find  the  logarithm  of  any  complex 
number  z  ^  0  (including  negative  real  numbers  as  a  special  case).  If 

(13.1)  z  =  ew, 

then  by  definition 


(13.2) 


w  =  In  2. 


(We  use  In  for  natural  logarithms  to  avoid  the  cumbersome  loge  and  to  avoid  con¬ 
fusion  with  logarithms  to  the  base  10.) 

We  can  write  the  law  of  exponents  (8.2),  using  the  letters  of  (13.1),  as 


(13.3) 


Zlz2  =eWl  ■ ew 2  =  eWl+W2. 


Taking  logarithms  of  this  equation,  that  is,  using  (13.1)  and  (13.2),  we  get 
(13.4)  In  Z\Z2  =  W\  +  W2  =  In  z  1  +  In  z2  ■ 


This  is  the  familiar  law  for  the  logarithm  of  a  product,  justified  now  for  complex 
numbers.  We  can  then  find  the  real  and  imaginary  parts  of  the  logarithm  of  a 
complex  number  from  the  equation 


(13.5)  w  =  In  2  =  In  (relS)  =  Ln  r  +  In  el6  =  Ln  r  +  id , 

where  Ln  r  means  the  ordinary  real  logarithm  to  the  base  e  of  the  real  positive 
number  r. 


Since  9  has  an  infinite  number  of  values  (all  differing  by  multiples  of  27t),  a  complex 
number  has  infinitely  many  logarithms,  differing  from  each  other  by  multiples  of 
27tz.  The  principal  value  of  ln  z  (often  written  as  Ln*)  is  the  one  using  the  principal 
value  of  9 ,  that  is  0  <  9  <  2tt.  (Some  references  use  —n  <  9  <  tt.) 

Example  1.  Find  ln(—  1).  From  Figure  9.2,  we  see  that  the  polar  coordinates  of  the  point 
z  =  —  1  are  r  =  1  and  9  =  tt,  —i r,  37r,  •  •  • .  Then, 

ln(— 1)  =  Ln(l)  +  z(7r  ±  2mr)  =  iit,  — Z7r,  Zm,  ■  ■  ■  . 

Example  2.  Find  ln(l  +  i).  From  Figure  5.1,  for  z  =  1  +  i,  we  find  r  =  y/2,  and  9  = 
7r/4±2n7r.  Then 

ln(l  +  i)  =  Ln  \[2  +  i  ±  2mr^  =  0.347  •  •  •  +  i  ±  2mr^J  . 

Even  a  positive  real  number  now  has  infinitely  many  logarithms,  since  its  angle 
can  be  taken  as  0,  2-7t,  — 27t,  etc.  Only  one  of  these  logarithms  is  real,  namely  the 
principal  value  Lnr  using  the  angle  9  =  0. 
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14.  COMPLEX  ROOTS  AND  POWERS 

For  real  positive  numbers,  the  equation  In  a6  =  b  In  a  is  equivalent  to  ab  =  eblna.  We 
define  complex  powers  by  the  same  formula  with  complex  a  and  b.  By  definition, 
for  complex  a  and  b  (o  /  e), 


(14.1) 


_  gblna 


[The  case  a  =  e  is  excluded  because  we  have  already  defined  powers  of  e  by  (8.1).] 
Since  In  a  is  multiple  valued  (because  of  the  infinite  number  of  values  of  9) ,  powers 
ab  are  usually  multiple  valued,  and  unless  you  want  just  the  principal  value  of  In  z 
or  of  ab  you  must  use  all  values  of  9.  In  the  following  examples  we  find  all  values  of 
each  complex  power  and  write  the  answers  in  the  x  +  iy  form. 

Example  1.  Find  all  values  of  i~2z.  From  Figure  5.2,  and  equation  (13.5)  we  find  Ini  = 
Ln  1  +  i( 7t/2  ±  2mr)  =  i(tt/2  ±  2mr)  since  Lnl  =  0.  Then,  by  equation  (14.1), 

•— 2i  —  2i\ni  —  /2±2mv)  7r±4n7r  n  5n  —  377 

L  —  fc-  —  fc-  —  fc-  —  fc-  ,  fc-  ?  fc-  ,  *  *  *  , 

where  e7r  =  23.14  •  •  • .  Note  the  infinite  set  of  values  of  i~2\  all  real!  Also  read  the 
end  of  Section  3,  and  note  that  here  the  final  step  is  not  to  find  sine  or  cosine  of 
7r  ±  4n7r;  thus,  in  finding  ln  i  =  i9 ,  we  must  not  write  9  in  degrees. 

Example 2.  Find  all  values  of  i1/2.  Using  Ini  from  Example  1  we  have  i1/2  =  el1/2)!"*  = 
eiO/4+n7r)  =  ei7r/4ein-ny  Now  =  -pi  when  7i  is  even  (Fig.  9.4),  and  em7r  =  — 1 
when  n  is  odd  (Fig.  9.2).  Thus, 

i1/2  =  ±ei7r/4  = 

V2 

using  Figure  5.1.  Notice  that  although  Ini  has  an  infinite  set  of  values,  we  find  just 
two  values  for  i1'2  as  we  should  for  a  square  root.  (Compare  the  method  of  Section 
10  which  is  easier  for  this  problem.) 

Example  3.  Find  all  values  of  (1  +  i)1_b  Using  (14.1)  and  the  value  of  ln(l  +  i)  from 
Example  2,  Section  13,  we  have 

n  _|_  j)1-*  =  gU-i)  ln(i+i)  _  e(l-i)[Ln  v/2+i(7r/4±2n7r)] 

_  Ln  \/2  —i  Ln  y/2  iix/^L  ±2nni  „7r/4  ±2n7r 

C  C  C  C  D  C 

=  V2  g7r/4  g±2n7r  (since  g±2 = 

=  x/2  e’1’/4  e±2rl7r[cos(7r/4  —  Ln  \/2 )  +  i  sin(-7r/4  —  Ln  \/2 )] 

£*  g±2n7r (2.808  +  1.318*). 

Now  you  may  be  wondering  why  not  just  do  these  problems  by  computer.  The 
most  important  point  is  that  it  is  useful  for  advanced  work  to  have  skill  in  manip¬ 
ulating  complex  expressions.  A  second  point  is  that  there  may  be  several  forms  for 
an  answer  (see  Section  15,  Example  2)  or  there  may  be  many  answers  (see  examples 
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above),  and  your  computer  may  not  give  you  the  one  you  want  (see  Problem  25). 
So  to  obtain  needed  skills,  a  good  study  method  is  to  do  problems  by  hand  and 
compare  with  computer  solutions. 


PROBLEMS,  SECTION  14 

Evaluate  each  of  the  following  in  x  +  iy  form,  and  compare  with  a  computer  solution. 


1. 

4. 

7. 

10. 

13. 

16. 

19. 

22. 


ln(-e) 
ln(i  —  1) 


■  2i/ir 
l  ' 

(1+^y 


COs(-7T  +  i  In  2) 


sin 


i  In 


a/3  +  iV 

2  J. 


2. 

M-*) 

3. 

ln(i  +  V3) 

5. 

ln(— V2  -  iV2) 

6. 

ln(^) 

8. 

■  2/3 
l  ' 

9. 

(-l)i 

11. 

12. 

i3+i 

14. 

(2  i)1+i 

15. 

17. 

(i  -  l)i+1 

18. 

cos(2*  Ini) 

20. 

sin  (i  In  1  ~ 

V  1  +  V 

21. 

cos[i  ln(— 1)] 

23 


.  ^1  —  x/2i^  •  Hint:  Find  \p2i  first. 


24.  Show  that  ( ab)c  can  have  more  values  than  abc.  As  examples  compare 

(a)  [(-j)2+i]2"i  and  (-i)(2+0(2-0  =  (-j)s; 

(b)  {Cy  and  r1. 

25.  Use  a  computer  to  find  the  three  solutions  of  the  equation  x3  —  3x  —  1  =  0.  Find  a  way 
to  show  that  the  solutions  can  be  written  as  2  cos(7r/9),  —2  cos(27r/9),  —2  cos(47t/9). 


15.  INVERSE  TRIGONOMETRIC  AND  HYPERBOLIC  FUNCTIONS 


We  have  already  defined  the  trigonometric  and  hyperbolic  functions  of  a  complex 
number  z.  For  example, 

eiz  _|_  g-iz 

(15.1)  w  =  cos  z  =  - - — - 

defines  w  =  cos  z;  that  is,  for  each  complex  number  z,  (15.1)  gives  us  the  complex 
number  w.  We  now  define  the  inverse  cosine  or  arc  cos  w  by 


(15.2) 


2  =  arc  cos  w  if  w  =  cos  z. 


The  other  inverse  trigonometric  and  hyperbolic  functions  are  defined  similarly. 

In  dealing  with  real  numbers,  you  know  that  sin  a;  and  cos  a;  are  never  greater 
than  1.  This  is  no  longer  true  for  sinz  and  cosz  with  z  complex.  To  illustrate  the 
method  of  finding  inverse  trigonometric  (or  inverse  hyperbolic)  functions,  let’s  find 
arc  cos  2. 
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Example  1.  We  want  z,  where 


Then  we  have 


z  =  arc  cos  2  or  cos  z  =  2. 


=  2. 


To  simplify  the  algebra,  let  u  =  elz .  Then  e  lz  =  u  and  the  equation  becomes 


u  +  u 


=  2. 


Multiply  by  2  and  by  u  to  get  u2  +  1  =  4u  or  u2  —  Au  +  1  =  0.  Solve  this  equation 
by  the  quadratic  formula  to  find 

u  =  4  ±  v^16  ~  4  =  2  ±  V3,  or  eiz  =u  =  2±  V3. 

Take  logarithms  of  both  sides  of  this  equation,  and  solve  for  z: 

iz  =  ln(2  ±  \/3)  =  Ln(2  ±  \/3)  +  2mri, 
arc  cos  2  =  z  =  2nir  —  i  Ln(2  ±  \/3 )  =  2mr  ±  i  Ln(2  +  \/3 ) 

since  Ln(2  —  \/3)  =  —  Ln(2  +  -\/3). 

It  is  instructive  now  to  find  cosz  and  see  that  it  is  2.  For  iz  =  In  (2  ±  \/3),  we 


have 


eiz  =  eln(2±v/I)  =  2  ±  -s/3, 

1  1  2=f  V3 

~  2±  V3  _  4-3 


=  2t  V3- 


Then 


2±v/3  +  2tv/3  4  „ 

cos  z  = - - - = - - - =  -  =  2, 

2  2  2 


as  claimed. 

By  the  same  method,  we  can  find  all  the  inverse  trigonometric  and  hyperbolic 
functions  in  terms  of  logarithms.  (See  Problems,  Section  17.)  Here  is  one  more 
example. 

Example  2.  In  integral  tables  or  from  your  computer  you  may  find  for  the  indefinite 
integral 

/'  dx 


Vx2  +  a2 


(15.3) 
either 

(15.4)  sinh-1  —  or  ln(x  +  \/x2  +  a2). 

a 

How  are  these  related?  Put 

(15.5) 


z  =  sinh  1  —  or  —  =  sinh  z  = 
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We  solve  for  z  as  in  the  previous  example.  Let  ez  =  u,  e  2  =  1/u.  Then 


1  _  2x 
u  a  ’ 


(15.6) 


e  =  u  = 


au 2  —  2  xu  —  a  =  0, 

2x  ±  \J\x 2  +  4a2  a:  ±  y/x2  +  a2 

2a  a 


For  real  integrals,  that  is,  for  real  z,  e2  >  0,  so  we  must  use  the  positive  sign.  Then, 
taking  the  logarithm  of  (15.6)  we  have 


(15.7) 


z  —  In  (a;  +  y/  a ;2  +  a2  )  —  In  a. 


Comparing  (15.5)  and  (15.7)  we  see  that  the  two  answers  in  (15.4)  differ  only  by 
the  constant  In  a,  which  is  a  constant  of  integration. 

PROBLEMS,  SECTION  15 

Find  each  of  the  following  in  the  x  +  iy  form  and  compare  a  computer  solution. 


1.  arc  sin  2 
3.  cosh-1  (1/2) 
5.  arc  cos  (  iy 


7.  arc  tan  (iv  2 


9.  tanh  1  (W  3 


2.  arc  tan  2 i 

4.  sinh-1(i/2) 


6.  tanh  x(— i) 


8.  arcsin(5/3) 
10.  arc cos(5/4) 


11.  sinh  ( i/v  2 


12.  cosh 


3/2 


13.  cosh  (—1) 

15.  arctan(2  +  i) 


14.  arc  sin(3i/4) 
16.  tanh-1(l  —  2i) 


17.  Show  that  tan  z  never  takes  the  values  ±i.  Hint:  Try  to  solve  the  equation  tan  z  =  i 
and  find  that  it  leads  to  a  contradiction. 


18.  Show  that  tanh  2  never  takes  the  values  ±1. 


16.  SOME  APPLICATIONS 

Motion  of  a  Particle  We  have  already  seen  (end  of  Section  5)  that  the  path 
of  a  particle  in  the  (x,y)  plane  is  given  by  z  =  z(t).  As  another  example  of  this, 
suppose  z  =  1  +  3e2lt.  We  see  that 

(16.1)  |z-l|  =  |3e2it|  =  3. 

Recall  that  \z  —  1|  is  the  distance  between  the  points  z  and  1;  (16.1)  says  that  this 
distance  is  3.  Thus  the  particle  traverses  a  circle  of  radius  3,  with  center  at  (1,0). 
The  magnitude  of  its  velocity  is  \dz/dt\  =  |6i  e2lt|  =  6,  so  it  moves  around  the  circle 
at  constant  speed.  (Also  see  Problem  2). 
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PROBLEMS,  SECTION  16 

1.  Show  that  if  the  line  through  the  origin  and  the  point  z  is  rotated  90°  about  the 
origin,  it  becomes  the  line  through  the  origin  and  the  point  iz.  This  fact  is  sometimes 
expressed  by  saying  that  multiplying  a  complex  number  by  i  rotates  it  through  90° . 
Use  this  idea  in  the  following  problem.  Let  z  =  ae“f  be  the  displacement  of  a 
particle  from  the  origin  at  time  t.  Show  that  the  particle  travels  in  a  circle  of  radius 
a  at  velocity  v  =  aui  and  with  acceleration  of  magnitude  v2  /a  directed  toward  the 
center  of  the  circle. 

In  each  of  the  following  problems,  z  represents  the  displacement  of  a  particle  from  the 
origin.  Find  (as  functions  of  t )  its  speed  and  the  magnitude  of  its  acceleration,  and 
describe  the  motion. 

2.  z  =  belult ,  uj  =  const.  Hint:  See  Problem  1. 

3.  2  =  ( 1  +  i)e%t. 

4.  2  =  (1  +  i)t  —  (2  +  i)(l  —  t).  Hint:  Show  that  the  particle  moves  along  a  straight 
line  through  the  points  (1  +  i)  and  (—2  —  i). 

5.  2  =  zit  +  22(1  —  t).  Hint:  See  Problem  4;  the  straight  line  here  is  through  the 
points  21  and  22. 


Electricity  In  the  theory  of  electric  circuits,  it  is  shown  that  if  Vr  is  the  voltage 
across  a  resistance  R ,  and  I  is  the  current  flowing  through  the  resistor,  then 

(16.2)  Vr  =  IR  (Ohm’s  law). 


It  is  also  known  that  the  current  and  voltage  across  an  inductance  L  are  related  by 
(16.3)  VL  =  L ^ 


and  the  current  and  voltage  across  a  capacitor  are  related  by 


(16.4) 


dVc  =  I 
dt  C ’ 


where  C  is  the  capacitance.  Suppose  the  cur¬ 
rent  I  and  voltage  V  in  the  circuit  of  Figure 
16.1  vary  with  time  so  that  I  is  given  by 

(16.5)  /  =  Iq  sin  Lot. 

You  can  verify  that  the  following  voltages  across 
R,  L,  and  C  are  consistent  with  (16.2),  (16.3), 
and  (16.4): 

(16.6)  Vr  =  RIq  sin  Lot, 

(16.7)  Vl  =  ojLIq  cos  cot, 

(16.8)  Vc  =  — locos  wf. 


R 


Vr 


The  total  voltage 
(16.9) 


V  =  Vr  +  Vl  +  Vc 
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is  then  a  complicated  function.  A  simpler  method  of  discussing  a-c  circuits  uses 
complex  quantities  as  follows.  Instead  of  (16.5)  we  write 

(16.10)  I  =  I0eiut, 


where  it  is  understood  that  the  actual  physical  current  is  given  by  the  imaginary 
part  of  /  in  (16.10),  that  is,  by  (16.5).  Note,  by  comparing  (16.5)  and  (16.10),  that 
the  maximum  value  of  J,  namely  io,  is  given  in  (16.10)  by  |/|.  Now  equations  (16.6) 
to  (16.9)  become 


(16.11) 

(16.12) 

(16.13) 

(16.14) 


VR  =  RI0  eiut  =  RI, 

Vl  =  iojLIo  e lu,i  =  iujLI, 


Vc  = 


1 


ILOC 


T  „iut  _ 

2  o  e  — 


1 


V  =  Vr  +  Vl  +  Vc  = 


R 


iioC 

i  I  loL  — 


I. 


The  complex  quantity  Z  defined  by 


(16.15)  Z  =  R  +  i\u)L- 

is  called  the  (complex)  impedance.  Using  it  we 


can  write  (16.14)  as 


(16.16) 


V  =  ZI 


which  looks  much  like  Ohm’s  law.  In  fact,  Z  for  an  a-c  circuit  corresponds  to  R  for 
a  d-c  circuit.  The  more  complicated  a-c  circuit  equations  now  take  the  same  simple 
form  as  the  d-c  equations  except  that  all  quantities  are  complex.  For  example,  the 
rules  for  combining  resistances  in  series  and  in  parallel  hold  for  combining  complex 
impedances  (see  Problems  below). 


PROBLEMS,  SECTION  16 


In  electricity  we  learn  that  the  resistance  of  two  resistors  in  series  is  Ai  +  R2  and  the 
resistance  of  two  resistors  in  parallel  is  (-Rj-1  +  A^1)^1.  Corresponding  formulas  hold  for 
complex  impedances.  Find  the  impedance  of  Z\  and  Z2  in  series,  and  in  parallel,  given: 

6.  (a)  Zi  =  2  +  3i,  Z2  =  1  -  hi  (b)  Zx  =  2V3ei'R/6,  Z2  =  2e2i7r/3 

7.  (a)Zi  =  l— i,  Z2=3i  (b)  |Zi|  =3.16,  0i  =  18.4°;  \Z2\  =  4.47,  02  =  63.4° 


8.  Find  the  impedance  of  the  circuit  in  Figure  16.2  (A 
and  L  in  series,  and  then  C  in  parallel  with  them).  A 
circuit  is  said  to  be  in  resonance  if  Z  is  real;  find  u>  in 
terms  of  A,  L,  and  C  at  resonance. 


9.  For  the  circuit  in  Figure  16.1:  Figure  16.2 

(a)  Find  u  in  terms  of  A,  L,  and  C  if  the  angle  of  Z  is  45°. 

(b)  Find  the  resonant  frequency  u>  (see  Problem  8). 

10.  Repeat  Problem  9  for  a  circuit  consisting  of  A,  L,  and  C,  all  in  parallel. 
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Optics  In  optics  we  frequently  need  to  combine  a  number  of  light  waves  (which 
can  be  represented  by  sine  functions) .  Often  each  wave  is  “out  of  phase”  with  the 
preceding  one  by  a  fixed  amount;  this  means  that  the  waves  can  be  written  as  sinf, 
sin(t  +  <5),  sin(f  +  28),  and  so  on.  Suppose  we  want  to  add  all  these  sine  functions 
together.  An  easy  way  to  do  it  is  to  see  that  each  sine  is  the  imaginary  part  of  a 
complex  number,  so  what  we  want  is  the  imaginary  part  of  the  series 


(16.17) 


eit  _|_  e i(t+S)  _|_  ei(t+2<5) 


This  is  a  geometric  progression  with  first  term  ezt  and  ratio  elS.  If  there  are  n  waves 
to  be  combined,  we  want  the  sum  of  n  terms  of  this  progression,  which  is 


(16.18) 


eii(l  _  em<5) 
1  -  eiS 


We  can  simplify  this  expression  by  writing 

(16.19)  l-eiS  =  eiS/2(e-i5/2  -  ei5/2)  =  - eiS/ 2  ■  2 i  sin  ^ 

by  (11.3).  Substituting  (16.19)  and  a  similar  formula  for  (1  —  emS )  into  (16.18),  we 
get 


e^e™6/2  sm{n8/2)  =  i(t+\ln-i)/2]si  sm(n5/2) 
ei&/i  sin(5/2)  sin(<5/2) 


The  imaginary  part  of  the  series  (16.17)  which  we  wanted  is  then  the  imaginary 
part  of  (16.20),  namely 


sin 


PROBLEMS,  SECTION  16 


11.  Prove  that 


cos  8  +  cos  3  6  +  cos  59  +  ■  ■  ■  +  cos(2n  —  1)8 
sin  9  +  sin  3  8  +  sin  59  +  ■  •  •  +  sin(2n  —  1)9 


sin  2nd 
2  sin  (9 
sin2  nd 
sin  9 


Hint:  Use  Euler’s  formula  and  the  geometric  progression  formula. 

12.  In  optics,  the  following  expression  needs  to  be  evaluated  in  calculating  the  intensity 
of  light  transmitted  through  a  film  after  multiple  reflections  at  the  surfaces  of  the 
film: 

(  V"'  r2n  cos  n8  ]  +  (  V"'  r2n  sin  n8  ]  . 

Show  that  this  is  equal  to  |  r2nelTie\2  and  so  evaluate  it  assuming  |r|  <  1  (r  is 

the  fraction  of  light  reflected  each  time). 
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Simple  Harmonic  Motion  It  is  very  convenient  to  use 
complex  notation  even  for  motion  along  a  straight  line. 
Think  of  a  mass  m  attached  to  a  spring  and  oscillating 
up  and  down  (see  Figure  16.3).  Let  y  be  the  vertical  dis¬ 
placement  of  the  mass  from  its  equilibrium  position  (the 
point  at  which  it  would  hang  at  rest).  Recall  that  the 
force  on  m  due  to  the  stretched  or  compressed  spring  is 
then  — ky ,  where  k  is  the  spring  constant,  and  the  mi¬ 
nus  sign  indicates  that  the  force  and  displacement  are  in 
opposite  directions.  Then  Newton’s  second  law  (force  = 
mass  times  acceleration)  gives 


Figure  16.3 


(16.21) 


-ky 


or 


d2y 
dt 2 


— y  =  -u  y 

m 


if 


w2  = 


k 

m 


Now  we  want  a  function  y(t)  with  the  property  that  differentiating  it  twice  just 
multiplies  it  by  a  constant.  You  can  easily  verify  that  this  is  true  for  exponentials, 
sines,  and  cosines  (see  problem  13).  Just  as  in  discussing  electric  circuits  (see 
(16.10)),  we  may  write  a  solution  of  (16.21)  as 


(16.22) 


y  =  y  oe 


iujt 


with  the  understanding  that  the  actual  physical  displacement  is  either  the  real  or  the 
imaginary  part  of  (16.22).  The  constant  u>  =  \J k/m  is  called  the  angular  frequency 
(see  Chapter  7,  Section  2).  We  will  use  this  notation  in  Chapter  3,  Section  12. 


PROBLEMS,  SECTION  16 


13.  Verify  that  e 


iut  ^  —  iut 


cos u>t,  and  sin  uit  satisfy  equation  (16.21). 


17.  MISCELLANEOUS  PROBLEMS 

Find  one  or  more  values  of  each  of  the  following  complex  expressions  and  compare  with  a 
computer  solution. 


1. 


1  +  i 
1  —  i 


2. 


f  l  +  iy/3 

\V2  +  iV2 


3.  ty-4  -  4 i 


4.  sinh(l  +  jvr/2) 

5. 

tanh(i7r/4) 

6. 

/  \  in 

(-e) 

7.  (-i)* 

8. 

COS 

_ .  1  —  i 

2 1  In  - : 

l  +  i 

9. 

arc  sin 

'/V3  +  A12' 
VV3- i) 

10.  e 


2 i  arc  tan(i\/3) 


12.  e 


i  arc  sin  i 


nz.  La.ii  xi  i 

e 

13.  Find  real  x  and  y  for  which  \z  +  3|  =  1  —  iz,  where  z  =  x  +  iy. 

14.  Find  the  disk  of  convergence  of  the  series  XX2  —  2 i)n /n. 

15.  For  what  2  is  the  series  ^2zllln  absolutely  convergent?  Hints:  Use  equation  (14.1). 
Also  see  Chapter  1,  Problem  6.15. 

16.  Describe  the  set  of  points  2  for  which  R e(ex'n^2z)  >  2. 
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Verify  the  formulas  in  Problems  17  to  24. 

17.  arc  sin  z  =  —  i  ln(i2  ±  y/l  —  z2) 

18. 

19. 


arc  cos  z  =  i  In (2  ±  \J  z2  —  1) 

arc  tan  z  =  —  In  -  + 

2i  1  —  iz 


20. 

21. 

22. 

23. 

24. 

25. 


sinh  1  z  =  ln(«  ±  y/  z2  +  1) 

cosh-1  z  =  In (z  ±  \J  z2  —  1)  =  ±  In (z  +  \J z2  —  1) 

.  ,  -i  11  +  2 

tanh  2  =  —  In - 

2  1-2 

COS  22  —  cosh  2 
cosh  iz  =  cos  2 

(a)  Show  that  cos  2  =  cos  2. 

(b) 

(c) 


26. 

27. 

28. 


(d) 

(e) 

Find 


Is  sin  2  =  sin  2? 

If  /0)  =  l  +  iz,  is  f(z)  =  /(f)? 

If  f(z)  is  expanded  in  a  power  series  with  real  coefficients,  show  that  f(z)  = 
/(*)■ 

Using  part  (d),  verify,  without  computing  its  value,  that  i[sinh(l  +  i)  —  sinh(l  —  i)] 
is  real. 


2elt>  -  i 


Hint:  See  equation  (5.1). 


ieie  +  2 

(a)  Show  that  Re2  =  \(z  +  z)  and  that  Im2  =  (1/2 i)(z  —  z). 

(b)  Show  that  \ez\2  =  e2Rez. 

(c)  Use  (b)  to  evaluate  \e('1+lx'>  2  which  occurs  in  quantum  mechanics. 

Evaluate  the  following  absolute  square  of  a  complex  number  (which  arises  in  a 
problem  in  quantum  mechanics).  Assume  a  and  b  are  real.  Express  your  answer  in 
terms  of  a  hyperbolic  function. 

(a  +  bi)2eb  —  (a  —  bi)2e~ 


Aabie~‘la 


29. 

30. 

31. 


If  2  =  —  and  — - 
b  a  +  b 


=  —  +  i  find  2. 
a  b 


Write  the  series  for  ex^1+t'> .  Write  1  +  i  in  the  re*e  form  and  so  obtain  (easily)  the 
powers  of  (1  +i).  Thus  show,  for  example,  that  the  e^cosa:  series  has  no  x2  term, 
no  x6  term,  etc.,  and  a  similar  result  for  the  e^sina;  series.  Find  (easily)  a  formula 
for  the  general  term  for  each  series. 

Show  that  if  a  sequence  of  complex  numbers  tends  to  zero,  then  the  sequence  of 
absolute  values  tends  to  zero  too,  and  vice  versa.  Hint:  an  +  ibn  — >  0  means  an  — >  0 
and  bn  —>  0. 


Use  a  series  you  know  to  show  that 


(1  +  in)r‘ 


32. 


=  — e. 


CHAPTER 


3 

Linear  Algebra 


1.  INTRODUCTION 

In  this  chapter,  we  are  going  to  discuss  a  combination  of  algebra  and  geometry 
which  is  important  in  many  applications.  As  you  know,  problems  in  various  fields 
of  science  and  mathematics  involve  the  solution  of  sets  of  linear  equations.  This 
sounds  like  algebra,  but  it  has  a  useful  geometric  interpretation.  Suppose  you  have 
solved  two  simultaneous  linear  equations  and  have  found  x  =  2  and  y  =  —  3.  We 
can  think  of  x  =  2,  y  =  —3  as  the  point  (2,  —3)  in  the  (x,  y)  plane.  Since  two  linear 
equations  represent  two  straight  lines,  the  solution  is  then  the  point  of  intersection  of 
the  lines.  The  geometry  helps  us  to  understand  that  sometimes  there  is  no  solution 
(parallel  lines)  and  sometimes  there  are  infinitely  many  solutions  (both  equations 
represent  the  same  line). 

The  language  of  vectors  is  very  useful  in  studying  sets  of  simultaneous  equations. 
You  are  familiar  with  quantities  such  as  the  velocity  of  an  object,  the  force  acting 
on  it,  or  the  magnetic  field  at  a  point,  which  have  both  magnitude  and  direction. 
Such  quantities  are  called  vectors ;  contrast  them  with  such  quantities  as  mass,  time, 
or  temperature,  which  have  magnitude  only  and  are  called  scalars.  A  vector  can 
be  represented  by  an  arrow  and  labeled  by  a  boldface  letter  (A  in  Figure  1.1;  also 
see  Section  4).  The  length  of  the  arrow  tells  us  the  magnitude  of  the  vector  and 
the  direction  of  the  arrow  tells  us  the  direction  of  the  vector.  It  is  not  necessary 
to  use  coordinate  axes  as  in  Figure  1.1;  we  can,  for  example,  point  a  finger  to  tell 
someone  which  way  it  is  to  town  without  knowing  the  direction  of  north.  This 
is  the  geometric  method  of  discussing  vectors  (see  Section  4).  However,  if  we  do 
use  a  coordinate  system  as  in  Figure  1.1,  we  can  specify 
the  vector  by  giving  its  components  Ax  and  Ay  which  are 
the  projections  of  the  vector  on  the  x  axis  and  the  y  axis. 

Thus  we  have  two  distinct  methods  of  defining  and  working 
with  vectors.  A  vector  may  be  a  geometric  entity  (arrow), 
or  it  may  be  a  set  of  numbers  (components  relative  to  a 
coordinate  system)  which  we  use  algebraically.  As  we  shall 
see,  this  double  interpretation  of  everything  we  do  makes 
the  use  of  vectors  a  very  powerful  tool  in  applications. 

One  of  the  great  advantages  of  vector  formulas  is  that  they  are  independent  of 
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the  choice  of  coordinate  system.  For  example,  suppose  we  are  discussing  the  motion 
of  a  mass  m  sliding  down  an  inclined  plane.  Newton’s  second  law  F  =  ma.  is  then  a 
correct  equation  no  matter  how  we  choose  our  axes.  We  might,  say,  take  the  x  axis 
horizontal  and  the  y  axis  vertical,  or  alternatively  we  might  take  the  x  axis  along 
the  inclined  plane  and  the  y  axis  perpendicular  to  the  plane.  Fx  would,  of  course, 
be  different  in  the  two  cases,  but  for  either  case  it  would  be  true  that  Fx  =  max 
and  Fy  =  may,  that  is,  the  vector  equation  F  =  ma  would  be  true. 

As  we  have  just  seen,  a  vector  equation  in  two  dimensions  is  equivalent  to  two 
component  equations.  In  three  dimensions,  a  vector  equation  is  equivalent  to  three 
component  equations.  We  will  find  it  useful  to  generalize  this  to  n  dimensions  and 
think  of  a  set  of  n  equations  in  n  unknowns  as  the  component  equations  for  a  vector 
equation  in  an  n  dimensional  space  (Section  10). 

We  shall  also  be  interested  in  sets  of  linear  equations  which  you  can  think  of  as 
changes  of  variable,  say 


n  n  /  x'  =  ax  +  by , 

1  1  y'  =  cx  +  dy, 

where  a,  b,  c,  d,  are  constants.  Alternatively,  we  can  think  of  (1.1)  geometrically  as 
telling  us  to  move  each  point  (x ,  y)  to  another  point  ( x',y' ),  an  operation  we  will 
refer  to  as  a  transformation  of  the  plane.  Or  if  we  think  of  ( x,y )  and  (x',y')  as 
being  components  of  vectors  from  the  origin  to  the  given  points,  then  (1.1)  tells  us 
how  to  change  each  vector  in  the  plane  to  another  vector.  Equations  (1.1)  could 
also  correspond  to  a  change  of  axes  (say  a  rotation  of  axes  around  the  origin)  where 
(x,  y)  and  (x' ,  y')  are  the  coordinates  of  the  same  point  relative  to  different  axes.  We 
will  learn  (Sections  11  and  12)  how  to  choose  the  best  coordinate  system  or  set  of 
variables  to  use  in  solving  various  problems.  The  same  methods  and  tools  (such  as 
matrices  and  determinants)  which  can  be  used  to  solve  sets  of  numerical  equations 
are  what  we  need  to  work  with  transformations  and  changes  of  coordinate  system. 
After  we  have  considered  2-  and  3-dimensional  space,  we  will  extend  these  ideas 
to  n-dimensional  space  and  finally  to  a  space  in  which  the  “vectors”  are  functions. 
This  generalization  is  of  great  importance  in  applications. 


2.  MATRICES;  ROW  REDUCTION 

A  matrix  (plural:  matrices)  is  just  a  rectangular  array  of  quantities,  usually  inclosed 
in  large  parentheses,  such  as 

M  a-(4  »  1)- 

We  will  ordinarily  indicate  a  matrix  by  a  roman  letter  such  as  A  (or  B,  C,  M,  r, 
etc.),  but  the  letter  does  not  have  a  numerical  value;  it  simply  stands  for  the  array. 
To  indicate  a  number  in  the  array,  we  will  write  Al3  where  i  is  the  row  number 
and  j  is  the  column  number.  For  example,  in  (2.1),  An  =  1,  A 12  =  5,  A13  =  —2, 
A21  =  —3,  A22  =  0,  A23  =  6.  We  will  call  a  matrix  with  m  rows  and  n  columns 
an  m  by  n  matrix.  Thus  the  matrix  in  (2.1)  is  a  2  by  3  matrix,  and  the  matrix  in 
(2.2)  below  is  a  3  by  2  matrix. 
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Transpose  of  a  Matrix  We  write 

(2.2)  aT  =  (  5  0 j  , 

and  call  AT  the  transpose  of  the  matrix  A  in  (2.1).  To  transpose  a  matrix,  we 
simply  write  the  rows  as  columns,  that  is,  we  interchange  rows  and  columns.  Note 
that,  using  index  notation,  we  have  (AT)y  =  Aji.  You  will  find  a  summary  of 
matrix  notation  in  Section  9. 

Sets  of  Linear  Equations  Historically  linear  algebra  grew  out  of  efforts  to  find 
efficient  methods  for  solving  sets  of  linear  equations.  As  we  have  said,  the  subject 
has  developed  far  beyond  the  solution  of  sets  of  numerical  equations  (which  are 
easily  solved  by  computer),  but  the  ideas  and  methods  developed  for  that  purpose 
are  needed  in  later  work.  A  simple  way  to  learn  these  techniques  is  to  use  them 
to  solve  some  numerical  problems  by  hand.  In  this  section  and  the  next  we  will 
develop  methods  of  working  with  sets  of  linear  equations,  and  introduce  definitions 
and  notation  which  will  be  useful  later.  Also,  as  you  will  see,  we  will  discover  how 
to  tell  whether  a  given  set  of  equations  has  a  solution  or  not. 

Example  1.  Consider  the  set  of  equations 

(  2x  —  z  =  2, 

(2.3)  <  6x  +  by  +  3z  =  7, 

[  2x  —  y  = 4 . 

Let’s  agree  always  to  write  sets  of  equations  in  this  standard  form  with  the  x  terms 
lined  up  in  a  column  (and  similarly  for  the  other  variables),  and  with  the  constants 
on  the  right  hand  sides  of  the  equations.  Then  there  are  several  matrices  of  interest 
connected  with  these  equations.  First  is  the  matrix  of  the  coefficients  which  we  will 
call  M: 

(2  0  -!\ 

(2.4)  M  =  6  5  3  . 

\2  -i  v 

Then  there  are  two  3  by  1  matrices  which  we  will  call  r  and  k: 


If  we  use  index  notation  and  replace  x,  y,  z,  by  x\,  X2,  £3,  and  call  the  constants 
fci,  /c2,  A3 ,  then  we  could  write  the  equations  (2.3)  in  the  form  (Problem  1) 

3 

(2.6)  Mjj  Xj  =  kj,  i=  1,2,3. 

j= 1 

It  is  interesting  to  note  that,  as  we  will  see  in  Section  6,  this  is  exactly  how  matrices 
are  multiplied,  so  we  will  learn  to  write  sets  of  equations  like  (2.3)  as  Mr  =  k. 
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For  right  now  we  are  interested  in  the  fact  that  we  can  display  all  the  essential 
numbers  in  equations  (2.3)  as  a  matrix  known  as  the  augmented  matrix  which  we 
call  A.  Note  that  the  first  three  columns  of  A  are  just  the  columns  of  M,  and  the 
fourth  column  is  the  column  of  constants  on  the  right  hand  sides  of  the  equations. 

/  2  0  -1  2\ 

(2.7)  A  =  6  5  3  7. 

\2  -1  0  4/ 

Instead  of  working  with  a  set  of  equations  and  writing  all  the  variables,  we  can  just 
work  with  the  matrix  (2.7).  The  process  which  we  are  going  to  show  is  called  row 
reduction  and  is  essentially  the  way  your  computer  solves  a  set  of  linear  equations. 
Row  reduction  is  just  a  systematic  way  of  taking  linear  combinations  of  the  given 
equations  to  produce  a  simpler  but  equivalent  set  of  equations.  We  will  show  the 
process,  writing  side-by-side  the  equations  and  the  matrix  corresponding  to  them. 


(a)  The  first  step  is  to  use  the  first  equation  in  (2.3)  to  eliminate  the  x  terms  in 
the  other  two  equations.  The  corresponding  matrix  operation  on  (2.7)  is  to  subtract 
3  times  the  first  row  from  the  second  row  and  subtract  the  first  row  from  the  third 
row.  This  gives: 


(  2x  —  2  = 

2, 

/2 

0 

-1 

2\ 

<  by  +  62  = 

1, 

° 

5 

6 

1 

l  -  y  +  2  = 

2. 

V« 

-1 

1 

V 

(b) 


Now  it  is  convenient  to  interchange  the  second 


and  third  equations  to  get: 


(  2x  —  2  = 

2, 

/2 

0 

-1 

2\ 

-  y  +  Z  = 

2, 

° 

-1 

1 

2 

[  by  +  62  = 

1. 

V° 

5 

6 

V 

(c)  Next  we  use  the  second  equation  to  eliminate  the 
equations: 


y  terms  from  the  other 


(  2x  —  2  = 

2, 

/  2 

0 

-1 

2  \ 

1  -  y  +  z  = 

2, 

0 

-1 

1 

2 

[  11 2  = 

11. 

v° 

0 

11 

w 

(d)  Finally,  we  divide  the  third  equation  by  11  and  then  use  it  to  eliminate  the 
2:  terms  from  the  other  equations: 


( 2x  =3, 

/  2 

0 

0 

3\ 

-y  =1, 

0 

-1 

0 

1 

1  2  =  1. 

v° 

0 

1 

V 

It  is  customary  to  divide  each  equation  by  the  leading  coefficient  so  that  the  equa¬ 
tions  read  x  =  3/2,  y  =  —1,  2  =  1.  The  row  reduced  matrix  is  then: 


/I  0  0  3/2\ 

0  1  0  — 1  J  . 

\0  0  1  1  / 

The  important  thing  to  understand  here  is  that  in  finding  a  row  reduced  matrix 

we  have  just  taken  linear  combinations  of  the  original  equations.  This  process  is 
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reversible,  so  the  final  simple  equations  are  equivalent  to  the  original  ones.  Let’s 
summarize  the  allowed  operations  in  row  reducing  a  matrix  (called  elementary  row 
operations). 


(2.8)  i.  Interchange  two  rows  [see  step  (b)]; 

ii.  Multiply  (or  divide)  a  row  by  a  (nonzero)  constant  [see  step  (d)]; 

iii.  Add  a  multiple  of  one  row  to  another;  this  includes  subtracting, 
that  is,  using  a  negative  multiple  [see  steps  (a)  and  (c)]. 


Example  2.  Write  and  row  reduce  the  augmented  matrix  for  the  equations: 

{x  —  y  +  Az  =  5, 

2x  —  3y  +  82  =  4, 
x  —  2y  +  Az  =  9. 

This  time  we  won’t  write  the  equations,  just  the  augmented  matrix.  Remember  the 
routine:  Use  the  first  row  to  clear  the  rest  of  the  first  column;  use  the  new  second 
row  to  clear  the  rest  of  the  second  column;  etc.  Also,  since  matrices  are  equal  only 
if  they  are  identical,  we  will  not  use  equal  signs  between  them.  Let’s  use  arrows. 

/I  -1  4  5\  /I  -14  5\  /I  0  4  11  \ 

2  -3  8  4  —4  0  -1  0  -6  ->  0  -1  0  -6 

\1  -2  4  9/  \0  -10  4/  \0  0  0  -20/ 

We  don’t  need  to  go  any  farther!  The  last  row  says  0  •  z  =  —20  which  isn’t  true  for 
any  finite  value  of  z.  Now  you  see  why  your  computer  doesn’t  give  an  answer — there 
isn’t  any.  We  say  that  the  equations  are  inconsistent.  If  this  happens  for  a  set  of 
equations  you  have  written  for  a  physics  problem,  you  know  to  look  for  a  mistake. 


Rank  of  a  Matrix  There  is  another  way  to  discuss  Example  2  using  the  following 
definition:  The  number  of  nonzero  rows  remaining  when  a  matrix  has  been  row 
reduced  is  called  the  rank  of  the  matrix.  (It  is  a  theorem  that  the  rank  of  AT  is  the 
same  as  the  rank  of  A.)  Now  look  at  the  reduced  augmented  matrix  for  Example  2; 
it  has  3  nonzero  rows  so  its  rank  is  3.  But  the  matrix  M  (matrix  of  the  coefficients 
=  first  three  columns  of  A)  has  only  2  nonzero  rows  so  its  rank  is  2.  Note  that 
(rank  of  M)  <  (rank  of  A)  and  the  equations  are  inconsistent. 

Example  3.  Consider  the  equations 

{x  +  2y  —  z  =  4, 

2x  -  z  =  1, 

x  —  2  y  =  —3. 

Either  by  hand  or  by  computer  we  row  reduce  the  augmented  matrix  to  get: 


1 

2  -1 

4\ 

A 

0 

-1/2 

l/2\ 

2 

0  -1 

1 

4  0 

1 

-1/4 

7/4 

-2  0 

-3/ 

\o 

0 

0 

0  / 
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The  last  row  of  zeros  tells  us  that  there  are  infinitely  many  solutions.  For  any  z  we 
find  from  the  first  two  rows  that  x  =  (z  +  l)/2  and  y  =  (2  +  7)/4.  Here  we  see  that 
the  rank  of  M  and  the  rank  of  A  are  both  2  but  the  number  of  unknowns  is  3,  and 
we  are  able  to  find  two  unknowns  in  terms  of  the  third. 

To  make  this  all  very  clear,  let’s  look  at  some  simple  examples  where  the  re¬ 
sults  are  obvious.  We  write  three  sets  of  equations  together  with  the  row  reduced 
matrices: 


(2.11) 


(  x  +  y  =  2, 
\  x  +  y  =  5. 


1  1 
0  0 


(2.12) 


f  x  +  y  =  2, 
\  2x  +  2y  =  4. 


112 
0  0  0 


(2.13) 


(x  +  y  =  2, 
\  x  -  y  =  4. 


In  (2.11),  since  x  +  y  can’t  be  equal  to  both  2  and  5,  it  is  clear  that  there  is  no 
solution;  the  equations  are  inconsistent.  Note  that  the  last  row  of  the  reduced 
matrix  is  all  zeros  except  for  the  last  entry  and  so  (rank  M)  <  (rank  A).  In  (2.12), 
the  second  equation  is  just  twice  the  first  so  they  are  really  the  same  equation;  we 
say  that  the  equations  are  dependent.  There  is  an  infinite  set  of  solutions,  namely 
all  points  on  the  line  y  =  2  —  x.  Note  that  the  last  line  of  the  matrix  is  all  zeros;  this 
indicates  linear  dependence.  We  have  (rank  A)  =  (rank  M)  =  1,  and  we  can  solve 
for  one  unknown  in  terms  of  the  other.  Finally  in  (2.13)  we  have  a  set  of  equations 
with  one  solution,  x  =  3,  y  =  —1,  and  we  see  that  the  row  reduced  matrix  gives 
this  result.  Note  that  (rank  A)  =  (rank  M)  =  number  of  unknowns  =  2. 

Now  let’s  consider  the  general  problem  of  solving  m  equations  in  n  unknowns. 
Then  M  has  m  rows  (corresponding  to  in  equations)  and  n  columns  (corresponding 
to  n  unknowns)  and  A  has  one  more  column  (the  constants) .  The  following  summary 
outlines  the  possible  cases. 


a.  If  (rank  M)  <  (rank  A) ,  the  equations  are  inconsistent  and  there 
is  no  solution. 

b.  If  (rankM)  =  (rank  A)  =  n  (number  of  unknowns),  there  is  one 
solution. 

c.  If  (rank  M)  =  (rank  A)  =  R  <  n,  then  R  unknowns  can  be  found 
in  terms  of  the  remaining  n  —  R  unknowns. 


Example  4.  Here  is  a  set  of  equations  and  the  row  reduced  matrix: 


(  X  + 

y  - 

z  = 

=  7, 

(1 

0 

-2 

3\ 

\  2x  - 

y  - 

52  = 

=  2, 

0 

l 

1 

4 

1  -5x  + 

Ay  + 

142  = 

=  1, 

0 

0 

0 

0 

{  3x  - 

y  - 

72  = 

=  5. 

V> 

0 

0 

0/ 

(2.15) 


88  Linear  Algebra 


Chapter  3 


From  the  reduced  matrix,  the  solution  is  x  =  3  +  2 z,  y  =  4  —  z.  We  see  that  this 
is  an  example  of  (2.14c)  with  m  =  4  (number  of  equations),  n  =  3  (number  of 
unknowns),  (rank  M)  =  (rank  A)  =  R =  2  <  n  =  3.  Then  by  (2.14c),  we  solve  for 
R  =  2  unknowns  ( x  and  y)  in  terms  of  the  n  —  R  =  1  unknown  (z). 


PROBLEMS,  SECTION  2 

1.  The  first  equation  in  (2.6)  written  out  in  detail  is 

Miian  +  M12X2  +  M13X3  =  ki. 

Write  out  the  other  two  equations  in  the  same  way  and  then  substitute  an, an, X3  = 
x,  y,  z  and  the  values  of  Mij  and  ki  from  (2.4)  and  (2.5)  to  verify  that  (2.6)  is 
really  (2.3). 

2.  As  in  Problem  1,  write  out  in  detail  in  terms  of  Mij,  Xj,  and  ki,  equations  like  (2.6) 
for  two  equations  in  four  unknowns;  for  four  equations  in  two  unknowns. 

For  each  of  the  following  problems  write  and  row  reduce  the  augmented  matrix  to  find  out 
whether  the  given  set  of  equations  has  exactly  one  solution,  no  solutions,  or  an  infinite  set 
of  solutions.  Check  your  results  by  computer.  Warning  hint:  Be  sure  your  equations  are 
written  in  standard  form.  Comment:  Remember  that  the  point  of  doing  these  problems 
is  not  just  to  get  an  answer  (which  your  computer  will  give  you),  but  to  become  familiar 
with  the  terminology,  ideas,  and  notation  we  are  using. 


3. 


5. 


7. 


9. 


11. 


13. 


x  —  2y  +  13  =  0 
y  —  Ax  =  17 

2x  +  y  —  2  =  2 

4x  +  2y  —  2z  =  3 

’  2x  +  3y  =  1 
x  +  2y  =  2 
x  +  3y  =  5 

x  —  y  +  2z  =  5 
2x  +  3y  —  2  =  4 
2x  —  2y  +  4z  =  6 

x  —  2y  =4 
5*  +2  =  7 

x  +  2y  —  2  =  3 

4x  +  6y  —  12  2  =  7 

5a:  —  2  y  +  4  2  =  —15 
3x  +  4y  —  82  =  4 


4. 


6. 


8. 


10. 


12. 


14. 


Find  the  rank  of  each  of  the  following  matrices. 

2\ 


15. 


2x  +  y  —  2  =  2 
4x  +  y  —  2z  =  3 

x  +  y  —  2  =  1 

3x  +  2y  —  2z  =  3 

—x  +  y  —  2  =  4 

x  —  y  +  2z  =  3 
2x  —  2y  +  42  =  6 

*  +  2y  —  z  =  1 

2a;  +  3y  —  2z  =  —  1 

_  3a;  +  4j/  —  32  =  — 4 

"  2a;  +  5  y  +  2  =  2 

x  +  y  +  2z  =  1 

x  +  52  =  3 

'  2x  +  3y  —  2  =  — 2 

x  +  2y  —  2  =  4 

4x  +  7y  —  3z  =  11 


16. 


17. 


A 

1 

4 

(  1 

0 

1 

°\ 

3 

1 

10 

7 

18. 

-1 

-2 

-1 

0 

4 

0 

2 

n 

14 

a 

10 

A 

2 

O 

2 

A 

5 

O 

3 

a  1 
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3.  DETERMINANTS;  CRAMER'S  RULE 

We  have  said  that  a  matrix  is  simply  a  display  of  a  set  of  numbers;  it  does  not 
have  a  numerical  value.  For  a  square  matrix,  however,  there  is  a  useful  number 
called  the  determinant  of  the  matrix.  Although  a  computer  will  quickly  give  the 
value  of  a  determinant,  we  need  to  know  what  this  value  means  in  order  to  use  it 
in  applications.  [See,  for  example,  equations  (4.19),  (6.24)  and  (8.5).]  We  also  need 
to  know  how  to  work  with  determinants.  An  easy  way  to  learn  these  things  is  to 
solve  some  numerical  problems  by  hand.  We  shall  outline  some  of  the  facts  about 
determinants  without  proofs  (for  more  details,  see  linear  algebra  texts). 

Evaluating  Determinants  To  indicate  that  we  mean  the  determinant  of  a  square 
matrix  A  (written  det  A),  we  replace  the  large  parentheses  inclosing  A  by  single  bars. 
The  value  of  det  A  if  A  is  a  1  by  1  matrix  is  just  the  value  of  the  single  element. 
For  a  2  by  2  matrix, 

(3.1)  A=(»  ‘),  det  A  = 

Equation  (3.1)  gives  the  value  of  a  second  order  determinant.  We  shall  describe 
how  to  evaluate  determinants  of  higher  order. 

First  we  need  some  notation  and  definitions.  It  is  convenient  to  write  an  nth  or¬ 
der  determinant  like  this: 


an 

012 

Ol3  ’ 

n 

021 

022 

023  ’ 

&2n 

031 

a  32 

033  ’ 

<23  n 

Onl 

On2 

Ori3 

a  b 
c  d 


=  og?  —  be. 


Notice  that  023  is  the  element  in  the  second  row  and  the  third  column;  that  is,  the 
first  subscript  is  the  number  of  the  row  and  the  second  subscript  is  the  number  of 
the  column  in  which  the  element  is.  Thus  the  element  aij  is  in  row  i  and  column  j. 
As  an  abbreviation  for  the  determinant  in  (3.2),  we  sometimes  write  simply  |a,j|, 
that  is,  the  determinant  whose  elements  are  a ij.  In  this  form  it  looks  exactly  like 
the  absolute  value  of  the  element  a^  and  you  have  to  tell  from  the  context  which 
of  these  meanings  is  intended. 

If  we  remove  one  row  and  one  column  from  a  determinant  of  order  n,  we  have 
a  determinant  of  order  n  —  1.  Let  us  remove  the  row  and  column  containing  the 
element  Ojj  and  call  the  remaining  determinant  M.tj .  The  determinant  My  is  called 
the  minor  of  a%j.  For  example,  in  the  determinant 


(3.3) 


1-5  2 
7  3  4 

2  15 


the  minor  of  the  element  a.23  =  4  is 


A/23 


! 

2 


-5 
1  ’ 
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obtained  by  crossing  off  the  row  and  column  containing  4.  The  signed  minor 
(— l)!+3My  is  called  the  cofactor  of  a,j.  In  (3.3),  the  element  4  is  in  the  sec¬ 
ond  row  ( i  =  2)  and  third  column  (j  =  3),  so  i  +  j  =  5,  and  the  cofactor  of  4  is 
(— 1)5M23  =  —11.  It  is  very  convenient  to  get  the  proper  sign  (plus  or  minus)  for 
the  factor  (— 1)*+-'  by  thinking  of  a  checkerboard  of  plus  and  minus  signs  like  this: 


(3.4) 


+  -  +  - 

-  +  -  + 

+  —  +  —  etc. 


etc. 


+ 


+ 


Then  the  sign  (— 1  )l+-?  to  be  attached  to  My  is  just  the  checkerboard  sign  in  the 
same  position  as  at] .  For  the  element  023,  you  can  see  that  the  checkerboard  sign 
is  minus. 


Now  we  can  easily  say  how  to  find  the  value  of  a  determinant:  Multiply  each 
element  of  one  row  (or  one  column)  by  its  cofactor  and  add  the  results.  It  can  be 
shown  that  we  get  the  same  answer  whichever  row  or  column  we  use. 


► 


Example  1.  Let  us  evaluate  the 
We  get 


determinant  in  (3.3)  using  elements  of  the  third  column. 


1  -5 
7  3 

2  1 


-5 

1 


-5 

3 


=  2-1  -4-11  +  5-38  =  148. 


As  a  check,  using  elements  of  the  first  row,  we  get 


1 


3 

1 


3 

1 


11  +  135  +  2  =  148. 


The  method  of  evaluating  a  determinant  which  we  have  described  here  is  one 
form  of  Laplace’s  development  of  a  determinant.  If  the  determinant  is  of  fourth  order 
(or  higher),  using  the  Laplace  development  once  gives  us  a  set  of  determinants  of 
order  one  less  than  we  started  with;  then  we  use  the  Laplace  development  all  over 
again  to  evaluate  each  of  these,  and  so  on  until  we  get  determinants  of  second  order 
which  we  know  how  to  evaluate.  This  is  obviously  a  lot  of  work!  We  will  see  below 
how  to  simplify  the  calculation.  A  word  of  warning  to  anyone  who  has  learned  a 
special  method  of  evaluating  a  third-order  determinant  by  recopying  columns  to  the 
right  and  multiplying  along  diagonals:  this  method  does  not  work  for  fourth  order 
(and  higher). 
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Useful  Facts  About  Determinants  We  state  these  facts  without  proof.  (See 
algebra  books  for  proofs.) 


1.  If  each  element  of  one  row  (or  one  column)  of  a  determinant  is  multiplied 
by  a  number  k,  the  value  of  the  determinant  is  multiplied  by  k. 

2.  The  value  of  a  determinant  is  zero  if 

(a)  all  elements  of  one  row  (or  column)  are  zero;  or  if 

(b)  two  rows  (or  two  columns)  are  identical;  or  if 

(c)  two  rows  (or  two  columns)  are  proportional. 

3.  If  two  rows  (or  two  columns)  of  a  determinant  are  interchanged,  the  value 
of  the  determinant  changes  sign. 

4.  The  value  of  a  determinant  is  unchanged  if 

(a)  rows  are  written  as  columns  and  columns  as  rows;  or  if 

(b)  we  add  to  each  element  of  one  row,  k  times  the  corresponding  element 
of  another  row,  where  k  is  any  number  (and  a  similar  statement  for 
columns) . 


Let  us  look  at  a  few  examples  of  the  use  of  these  facts. 


Example  2.  Find  the  equation  of  a  plane  through  the  three  given  points  (0, 0,  0),  (1,  2, 5), 
and  (2,  —1, 0). 

We  shall  verify  that  the  answer  in  determinant  form  is 


x  y  z 

0  0  0 

1  2  5 

2—10 


1 

1 

1 

1 


=  0. 


By  a  Laplace  development  using  elements  of  the  first  row,  we  would  find  that  this 
is  a  linear  equation  in  x,  y,  z\  thus  it  represents  a  plane.  We  need  now  to  show 
that  the  three  points  are  in  the  plane.  Suppose  ( x,y,z )  =  (0,0,0);  then  the  first 
two  rows  of  the  determinant  are  identical  and  by  Fact  2b  the  determinant  is  zero. 
Similarly  if  the  point  (x,  y,  z)  is  either  of  the  other  given  points,  two  rows  of  the 
determinant  are  identical  and  the  determinant  is  zero.  Thus  all  three  points  lie  in 
the  plane. 


Example  3.  Evaluate  the  determinant 


0  a  —b 
—a  0  c 
b  — c  0 


D  = 


92  Linear  Algebra 


Chapter  3 


If  we  interchange  rows  and  columns  in  D ,  then  by  Facts  4a  and  1  we  have 


0 

—a 

b 

0 

a 

- b 

D  = 

a 

0 

— c 

=  (-1)3 

—a 

0 

c 

- b 

c 

0 

b 

— c 

0 

where  in  the  last  step  we  have  factored  —1  out  of  each  column  by  Fact  1.  Thus  we 
have  D  =  —D,  so  D  =  0. 

We  can  use  Facts  1  to  4  to  simplify  finding  the  value  of  a  determinant.  First 
we  check  Facts  2a,  2b,  2c,  in  case  the  determinant  is  trivially  equal  to  zero.  Then 
we  try  to  get  as  many  zeros  as  possible  in  some  row  or  column  in  order  to  have 
fewer  terms  in  the  Laplace  development.  We  look  for  rows  (or  columns)  which  can 
be  combined  (using  Fact  4b)  to  give  zeros.  Although  this  is  something  like  row 
reduction,  we  can  operate  with  columns  as  well  as  rows.  However,  we  can’t  just 
cancel  a  number  from  a  row  (or  column);  by  Fact  1  we  must  keep  it  as  a  factor  in 
our  answer.  And  we  must  keep  track  of  any  row  (or  column)  interchanges  since  by 
Fact  3  each  interchange  multiplies  the  determinant  by  (— 1). 


Example  4.  Evaluate  the  determinant 


D  = 


4  3 
9  7 
4  0 
3  -1 


0  1 
2  3 
2  1 
4  0 


Subtract  4  times  the  fourth  column  from  the  first  column,  and  subtract  2  times  the 
fourth  column  from  the  third  column  to  get: 


0  3 


3  -1 


-2  1 
-4  3 
0  1 
4  0 


Do  a  Laplace  development  using  the  third  row: 


(3.5) 


D  =  (~  1) 


0 

-3 

3 


3  -2 
7  -4 
-1  4 


Add  the  second  row  to  the  third  row: 


D  =  (~  1) 


0 

-3 

0 


3  -2 
7  -4 
6  0 


Do  a  Laplace  development  using  the  first  column: 


D=(-l)(-l)(-3) 


-2 

0 


(— 3)[0  —  6( — 2)]  =  -36. 


This  is  the  answer  but  you  might  like  to  look  for  some  shorter  solutions.  For  exam¬ 
ple,  consider  the  determinant  (3.5)  above.  If  we  immediately  do  another  Laplace 
development  using  the  first  row,  the  minor  of  3  in  the  first  row,  second  column  is 


-3  -4 
3  4 
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Without  even  evaluating  it,  we  should  recognize  by  Fact  2c  that  it  is  zero.  Then 
proceeding  with  the  Laplace  development  of  (3.5)  using  the  first  row  gives  just 


£>  =  (—!)(— 2) 


-3 

3 


7 

-1 


2(3  —  21)  =  —36  as  above. 


Now  you  may  be  wondering  why  you  should  learn  about  this  when  your  computer 
will  do  it  for  you.  Suppose  you  have  a  determinant  with  elements  which  are  algebraic 
expressions,  and  you  want  to  write  it  in  a  different  form.  Then  you  need  to  know 
what  manipulations  you  can  do  without  changing  its  value.  Also,  if  you  know  the 
rules,  you  may  see  that  a  determinant  is  zero  without  evaluating  it.  An  easy  way 
to  learn  these  things  is  to  evaluate  some  simple  numerical  determinants  by  hand. 


Cramer’s  Rule  This  is  a  formula  in  terms  of  determinants  for  the  solution  of  n 
linear  equations  in  n  unknowns  when  there  is  exactly  one  solution.  As  we  said  for 
row  reduction  and  for  evaluating  determinants,  your  computer  will  quickly  give  you 
the  solution  of  a  set  of  linear  equations  when  there  is  one.  However,  for  theoretical 
purposes,  we  need  the  Cramer’s  rule  formula,  and  a  simple  way  to  learn  about  it  is 
to  use  it  to  solve  sets  of  linear  equations  with  numerical  coefficients. 

Let  us  first  show  the  use  of  Cramer’s  rule  to  solve  two  equations  in  two  unknowns. 
Then  we  will  generalize  it  to  n  equations  in  n  unknowns.  Consider  the  set  of 
equations 


(3.6) 


f  a\x  +  biy  =  ci, 
\  a2x  +  b2y  =  c2. 


If  we  multiply  the  first  equation  by  b2 ,  the  second  by  b\,  and  then  subtract  the 
results  and  solve  for  x,  we  get  (if  a\b2  —  a2b\  ^  0) 


(3.7a) 


ci 62  -  c2bi 
a\b2  —  a2b\ 


Solving  for  y  in  a  similar  way,  we  get 


(3.7b) 


a\c2  —  a2Ci 
a\b2  —  a2bi 


Using  the  definition  (3.1)  o 

(3.7)  of  (3.6)  in  the  form 

(3.8)  x 


a  second  order  determinant, 


Cl 

bi 

ai 

Cl 

c2 

^2 

a2 

c2 

a\ 

bi 

■?  y  — 

ai 

bi 

a2 

b2 

a2 

b2 

we  can  write  the  solutions 


It  is  helpful  in  remembering  (3.8)  to  say  in  words  how  we  find  the  correct  determi¬ 
nants.  First,  the  equations  must  be  written  in  standard  form  as  for  row  reduction 
(Section  2).  Then  if  we  simply  write  the  array  of  coefficients  on  the  left-hand  side 
of  (3.6),  these  form  the  denominator  determinant  in  (3.8).  This  determinant  (which 
we  shall  denote  by  D)  is  called  the  determinant  of  the  coefficients.  To  find  the  nu¬ 
merator  determinant  for  x,  start  with  D,  erase  the  x  coefficients  ai  and  a2,  and 
replace  them  by  the  constants  ci  and  c2  from  the  right-hand  sides  of  the  equa¬ 
tions.  Similarly,  we  replace  the  y  coefficients  in  D  by  the  constant  terms  to  find  the 
numerator  determinant  in  y. 
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Example  5. 


Use  (3.8)  to  solve  the  set  of  equations 

2x  +  3  y 
x  -  2y 


We  find 


D 

x 

y 


2 

1 

1 

D 

1 

D 


3 

5 

2 

1 


3 

2 

3 

-2 

3 

5 


-4-3  =  -7, 

-6-15 
-7  " 

i° -3  _ 

"  -7 


This  method  of  solution  of  a  set  of  linear  equations  is  called  Cramer’s  rule.  It 
may  be  used  to  solve  n  equations  in  n  unknowns  if  D  ^  0;  the  solution  then 
consists  of  one  value  for  each  unknown.  The  denominator  determinant  D  is  the 
n  by  n  determinant  of  the  coefficients  when  the  equations  are  arranged  in  standard 
form.  The  numerator  determinant  for  each  unknown  is  the  determinant  obtained 
by  replacing  the  column  of  coefficients  of  that  unknown  in  D  by  the  constant 
terms  from  the  right-hand  sides  of  the  equations.  Then  to  find  the  unknowns,  we 
must  evaluate  each  of  the  determinants  and  divide. 


Rank  of  a  Matrix  Here  is  another  way  to  find  the  rank  of  a  matrix  (Section  2). 
A  submatrix  means  a  matrix  remaining  if  we  remove  some  rows  and/or  remove 
some  columns  from  the  original  matrix.  To  find  the  rank  of  a  matrix,  we  look  at 
all  the  square  submatrices  and  find  their  determinants.  The  order  of  the  largest 
nonzero  determinant  is  the  rank  of  the  matrix. 


Example  6.  Find  the  rank  of  the  matrix 


We  need  to  look  at  the  four  3  by  3  determinants  containing  columns  1,2,3  or  1,2,4  or 
1,3,4  or  2,3,4.  We  note  that  the  first  two  columns  are  negatives  of  each  other,  so  by 
Fact  2c  the  first  two  of  these  determinants  are  both  zero.  The  last  two  determinants 
differ  only  in  the  sign  of  their  first  column,  so  we  just  have  to  look  at  one  of  them, 
say: 


If  we  now  subtract  twice  the  first  row  from  the  third  row,  we  have 
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and  we  see  by  Fact  2c  that  the  determinant  is  zero.  So  the  rank  of  the  matrix  is 
less  than  3.  To  show  that  it  is  2,  we  just  have  to  find  one  2  by  2  submatrix  with 
nonzero  determinant.  There  are  several  of  them;  find  one.  Thus  the  rank  of  the 
matrix  is  2.  (If  we  had  needed  to  show  that  the  rank  was  1,  we  would  have  had  to 
show  that  all  the  2  by  2  submatrices  had  determinants  equal  to  zero.) 


PROBLEMS,  SECTION  3 


Evaluate  the  determinants  in  Problems  1  to  6  by  the  methods  shown  in  Example  4.  Re¬ 
member  that  the  reason  for  doing  this  is  not  just  to  get  the  answer  (your  computer  can 
give  you  that)  but  to  learn  how  to  manipulate  determinants  correctly.  Check  your  answers 
by  computer. 


1. 


-2  3  4 

3  4-2 

5  6-3 


2. 


5  17  3 
2  4-3 

11  0  2 


1111 
12  3  4 

1  3  6  10 

1  4  10  20 


-2  4  7  3 

8  2-9  5 

-4  6  8  4 

2-9  3  8 


5. 


7  0  1-35 

2-1  0  14 

7-3  2-14 

8  6-2-7  4 

1  3-5  7  5 


6. 


0  1111 
10  111 
110  11 
1110  1 
11110 


7.  Prove  the  following  by  appropriate  manipulations  using  Facts  1  to  4;  do  not  just 
evaluate  the  determinants. 


1 

a 

be 

i 

a 

a2 

i 

a 

a2 

1 

b 

ac 

= 

i 

b 

b2 

CcT 

1 

CJ 

1 

-o 

i 

II 

0 

1 

b  +  a 

1 

c 

ab 

i 

c 

c2 

0 

0 

1 

=  (c  —  a)(b  —  a)(c  —  6). 


8.  Show  that  if,  in  using  the  Laplace  development,  you  accidentally  multiply  the  ele¬ 
ments  of  one  row  by  the  cofactors  of  another  row,  you  get  zero. 

Hint:  Consider  Fact  2b. 

9.  Show  without  computation  that  the  following  determinant  is  equal  to  zero. 

Hint:  Consider  the  effect  of  interchanging  rows  and  columns. 

0  2-3 

-2  0  4 

3-4  0 


10.  A  determinant  or  a  square  matrix  is  called  skew-symmetric  if  aij  =  —a,ji.  (The 
determinant  in  Problem  9  is  an  example  of  a  skew-symmetric  determinant.)  Show 
that  a  skew-symmetric  determinant  of  odd  order  is  zero. 

In  Problems  11  and  12  evaluate  the  determinants. 


0 

5 

-3 

-4 

1 

-5 

0 

2 

6 

-2 

11. 

3 

-2 

0 

-3 

7 

4 

-6 

3 

0 

-3 

-1 

2 

-7 

3 

0 

12. 


0  1  2-1 

-10-3  0 

-2  3  0  1 

10-1  0 
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13.  Show  that 


cos  6  1  0 

1  2  cos  0  1 

0  12  cos  0 


cos  36. 


14.  Show  that  the  n-rowed  determinant 

cos  6  1  0  0 

1  2  cos  6  1  0 

0  1  2  cos  6  1 

0  0  12  cos  6 


0 

0 

0 

0 


=  cos  n6. 


0  0  0  0 


2  cos  6  1 

1  2  cos  6 


Hint:  Expand  using  elements  of  the  last  row  or  column.  Use  mathematical  induction 
and  the  trigonometric  addition  formulas. 

15.  Use  Cramer’s  rule  to  solve  Problems  2.3  and  2.11. 


16.  In  the  following  set  of  equations  (from  a  quantum  mechanics  problem),  A  and  B  are 
the  unknowns,  k  and  K  are  given,  and  i  =  y/—l.  Use  Cramer’s  rule  to  find  A  and 
show  that  |  A  | 2  =  1. 

f  A  —  B  —  —1 
1  ikA  -  KB  =  ik 


17.  Use  Cramer’s  rule  to  solve  for  x  and  f  the  Lorentz  equations  of  special  relativity: 


f  x'  =  j(x  —  vt) 

(  t!  =  7 (t  —  vx/c2) 


where  72(1  —  v2 /(?)  =  1 


Caution:  Arrange  the  equations  in  standard  form. 

18.  Find  2  by  Cramer’s  rule: 


(a  —  b)x  — 
(a  +  2  b)x  — 
bx  + 


( a  —  b)y  +  3  b2z 

(a  +  2b)y  —  (3ab  +  3b2)z 
ay  —  (2b2  +  a2)z 


3  ab 
3b2 
0 


4.  VECTORS 


Notation  We  shall  indicate  a  vector  by  a  boldface  letter  (for 
example,  A)  and  a  component  of  a  vector  by  a  subscript  (for 
example  Ax  is  the  x  component  of  A),  as  in  Figure  4.1.  Since  it 
is  not  easy  to  handwrite  boldface  letters,  you  should  write  a  vec¬ 
tor  with  an  arrow  over  it  (for  example,  A) .  It  is  very  important 
to  indicate  clearly  whether  a  letter  represents  a  vector,  since, 
as  we  shall  see  below,  the  same  letter  in  italics  (not  boldface) 
is  often  used  with  a  different  meaning. 


Magnitude  of  a  Vector  The  length  of  the  arrow  representing  a  vector  A  is 
called  the  length  or  the  magnitude  of  A  (written  |A|  or  A)  or  (see  Section  10)  the 
norm  of  A  (written  ||A||).  Note  the  use  of  A  to  mean  the  magnitude  of  A;  for  this 
reason  it  is  important  to  make  it  clear  whether  you  mean  a  vector  or  its  magnitude 
(which  is  a  scalar).  By  the  Pythagorean  theorem,  we  find 


Section  4 


Vectors  97 


(4.1) 


A  = 


in  two  dimensions,  or 


A  = 


in  three  dimensions. 


Example  1. 

3  lb. 


In  Figure  4.2  the  force  F  has  an  x  component  of  4  lb  and  a  y  component  of 
Then  we  write 


Fx  =  4  lb, 

Fy  =  3  lb, 

|F|  =  5  lb, 

9  =  arc  tan  | . 


Addition  of  Vectors  There  are  two  ways  to  get  the  sum  of  two  vectors.  One  is 
by  the  parallelogram  law:  To  find  A  +  B,  place  the  tail  of  B  at  the  head  of  A  and 
draw  the  vector  from  the  tail  of  A  to  the  head  of  B  as  shown  in  Figures  4.3  and  4.4. 


The  second  way  of  finding  A  +  B  is  to  add  components:  A  +  B  has  components 
Ax  +  Bx  and  Ay  +  By.  You  should  satisfy  yourself  from  Figure  4.3  that  these  two 
methods  of  finding  A  +  B  are  equivalent.  From  Figure  4.4  and  either  definition  of 
vector  addition,  it  follows  that 

A  +  B  =  B  +  A  (commutative  law  for  addition); 

(A  +  B)  +  C  =  A  +  (B  +  C)  (associative  law  for  addition). 

In  other  words,  vectors  may  be  added  together  by  the  usual  laws  of  algebra. 

It  seems  reasonable  to  use  the  symbol  3 A  for  the  vector  A  +  A  +  A.  By  the 
methods  of  vector  addition  above,  we  can  say  that  the  vector  A  +  A  +  A  is  a  vector 
three  times  as  long  as  A  and  in  the  same  direction  as  A  and  that  each  component 
of  3 A  is  three  times  the  corresponding  component  of  A.  As  a  natural  extension  of 
these  facts  we  define  the  vector  cA  (where  c  is  any  real  positive  number)  to  be  a 
vector  c  times  as  long  as  A  and  in  the  same  direction  as  A;  each  component  of  cA 
is  then  c  times  the  corresponding  component  of  A  (Figure  4.5). 

The  negative  of  a  vector  is  defined  as  a  vector  of  the  same  magnitude  but  in 
the  opposite  direction.  Then  (Figure  4.6)  each  component  of  B  is  the  negative  of 
the  corresponding  component  of  B.  We  can  now  define  subtraction  of  vectors  by 
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Figure  4.5 


saying  that  A  —  B  means  the  sum  of  the  vectors  A  and  — B.  Each  component  of 
A  —  B  is  then  obtained  by  subtracting  the  corresponding  components  of  A  and  B, 
that  is,  (A  —  B)x  =  Ax  —  Bx,  etc.  Like  addition,  subtraction  of  vectors  can  be 
done  geometrically  (by  the  parallelogram  law)  or  algebraically  by  subtracting  the 
components  (Figure  4.6). 

The  zero  vector  (which  might  arise  asA  =  B  —  B  =  0,  orasA  =  cB  with  c  =  0) 
is  a  vector  of  zero  magnitude;  its  components  are  all  zero  and  it  does  not  have  a 
direction.  A  vector  of  length  or  magnitude  1  is  called  a  unit  vector.  Then  for  any 
A/0,  the  vector  A/|A|  is  a  unit  vector.  In  Example  1,  F/5  is  a  unit  vector. 

We  have  just  seen  that  there  are  two  ways  to  combine  vectors:  geometric  (head 
to  tail  addition) ,  and  algebraic  (using  components) .  Let  us  look  first  at  an  example 
of  the  geometric  method;  then  we  shall  consider  the  algebraic  method.  Example  2 
below  illustrates  the  geometric  method.  By  similar  proofs,  many  of  the  facts  of 
elementary  geometry  can  be  easily  proved  using  vectors,  with  no  reference  to  com¬ 
ponents  or  a  coordinate  system.  (See  Problems  3  to  8.) 


Example  2.  Prove  that  the  medians  of  a  triangle  intersect  at  a  point  two-thirds  of  the 
way  from  any  vertex  to  the  midpoint  of  the  opposite  side. 

To  prove  this,  we  call  two  of  the  sides  of  the  triangle  A  and  B.  The  third  side 
of  the  triangle  is  then  A  +  B  by  the  parallelogram  law,  with  the  directions  of  A, 
B,  and  A  +  B  as  indicated  in  Figure  4.7.  If  we  add  the  vector  ^B  to  the  vector  A 
(head  to  tail  as  in  Figure  4.7b),  we  have  a  vector  from  point  O  to  the  midpoint  of 
the  opposite  side  of  the  triangle,  that  is,  we  have  the  median  to  side  B.  Next,  take 
two-thirds  of  this  vector;  we  now  have  the  vector  |(A+  ^B)  =  |A+  -|B  extending 
from  O  to  P  in  Figure  4.7b.  We  want  to  show  that  P  is  the  intersection  point  of 
the  three  medians  and  also  the  “  |  point”  for  each.  We  prove  this  by  showing  that 
P  is  the  “  |  point”  on  the  median  to  side  A;  then  since  A  and  B  represent  any  two 
sides  of  the  triangle,  the  proof  holds  for  all  three  medians.  The  vector  from  RtoQ 
(Figure  4.7c)  is  ^A  +  B;  this  is  the  median  to  A.  The  “  |  point”  on  this  median  is 
the  point  P'  (Figure  4.7d);  the  vector  from  R  to  P'  is  equal  to  |(|A  +  B).  Then 
the  vector  from  O  to  P'  is  ^A  +  ^(^A  +  B)  =  §A  +  ^B.  Thus  P  and  P'  are  the 
same  point  and  all  three  medians  have  their  “  |  points”  there.  Note  that  we  have 
made  no  reference  to  a  coordinate  system  or  to  components  in  this  proof. 
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6.  In  any  quadrilateral  (four-sided  figure  with  sides  of  various  lengths  and-  in  general — 
four  different  angles),  the  lines  joining  the  midpoints  of  opposite  sides  bisect  each 
other.  Hint:  Label  three  sides  A,  B,  C;  what  is  the  vector  along  the  fourth  side? 

7.  A  line  through  the  midpoint  of  one  side  of  a  triangle  and  parallel  to  a  second  side 
bisects  the  third  side.  Hint:  Call  parallel  vectors  A  and  cA. 

8.  The  median  of  a  trapezoid  (four-sided  figure  with  just  two  parallel  sides)  means  the 
line  joining  the  midpoints  of  the  two  nonparallel  sides.  Prove  that  the  median  bisects 
both  diagonals;  that  the  median  is  parallel  to  the  two  parallel  bases  and  equal  to 
half  the  sum  of  their  lengths. 

We  have  discussed  in  some  detail  the  geometric  method  of  adding  vectors  (par¬ 
allelogram  law  or  head  to  tail  addition)  and  its  importance  in  stating  and  proving 
geometric  and  physical  facts  without  the  intrusion  of  a  special  coordinate  system. 
There  are,  however,  many  cases  in  which  algebraic  methods  (using  components 
relative  to  a  particular  coordinate  system)  are  better.  We  shall  discuss  this  next. 

Vectors  in  Terms  of  Components  We  consider  a  set  of  rectangular  axes  as  in 
Figure  4.8.  Let  the  vector  i  be  a  unit  vector  in  the  positive  x  direction  (out  of  the 
paper  toward  you),  and  let  j  and  k  be  unit  vectors  in  the  positive  y  and  z  directions. 
If  Ax  and  Ay  are  the  scalar  components  of  a  vector  in  the  ( x ,  y )  plane,  then  i Ax 
and  jAy  are  its  vector  components,  and  their  sum  is  the  vector  A  (Figure  4.9). 

A  =  i  Ax  +  j  Ay. 


Similarly,  in  three  dimensions 


Figure  4.9 


A  —  i  Ax  +  j  Ay  +  k  Az. 

It  is  easy  to  add  (or  subtract)  vectors  in  this  form:  If  A  and  B  are  vectors  in  two 
dimensions,  then 

A  +  B  =  (i  Ax  +  j  Ay)  +  (i  Bx  +  j  By)  =  i  {Ax  +  Bx )  +  j  (Ay  +  By). 

This  is  just  the  familiar  result  of  adding  components;  the  unit  vectors  i  and  j  serve  to 
keep  track  of  the  separate  components  and  allow  us  to  write  A  as  a  single  algebraic 
expression.  The  vectors  i,  j,  k  are  called  unit  basis  vectors. 

Multiplication  of  Vectors  There  are  two  kinds  of  product  of  two  vectors.  One, 
called  the  scalar  product  (or  dot  product  or  inner  product) ,  gives  a  result  which  is  a 
scalar;  the  other,  called  the  vector  product  (or  cross  product ),  gives  a  vector  answer. 
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Scalar  Product  By  definition,  the  scalar  product  of  A  and  B  (written  A  •  B)  is 
a  scalar  equal  to  the  magnitude  of  A  times  the  magnitude  of  B  times  the  cosine  of 
the  angle  6  between  A  and  B: 


You  should  observe  from  (4.2)  that  the  commutative  law  (4.3)  holds  for  scalar  mul¬ 
tiplication: 

(4.3)  A  •  B  =  B  •  A. 

A  useful  interpretation  of  the  dot  product  is  shown  in  Figure  4.10. 

|B|  =  8,  |A|  =  6. 

Projection  of  B  on  A  =  4; 

A  •  B  =  6  •  4  =  24. 

Or,  projection  of  A  on  B  =  3; 

B  •  A  =  3  •  8  =  24. 

Figure  4.10 

Since  |B|  cos  9  is  the  projection  of  B  on  A,  we  can  write 

(4.4)  A  •  B  =  |A|  times  (projection  of  B  on  A), 
or,  alternatively, 

A  •  B  =  |B|  times  (projection  of  A  on  B). 

Also  we  find  from  (4.2)  that 

(4.5)  A  ■  A  =  | A j 2 cosO°  =  | A | 2  =  A2. 

Sometimes  A2  is  written  instead  of  |A|2  or  A2;  you  should  understand  that  the 
square  of  a  vector  always  means  the  square  of  its  magnitude  or  its  dot  product  with 
itself. 


Figure  4.11 
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From  Figure  4.11  we  can  see  that  the  projection  of  B  +  C  on  A  is  equal  to  the 
projection  of  B  on  A  plus  the  projection  of  C  on  A.  Then  by  (4.4) 

(4.6)  A  ■  (B  +  C)  =  | A |  times  (projection  of  (B  +  C)  on  A) 

=  |  A j  times  (projection  of  B  on  A  +  projection  of  C  on  A) 

=  AB  +  AC. 

This  is  the  distributive  law  for  scalar  multiplication.  By  (4.3)  we  get  also 

(4.7)  (B  +  C)  A  =  BA  +  CA  =  AB  +  AC. 

The  component  form  of  A  •  B  is  very  useful.  We  write 

(4.8)  A  •  B  =  (i Ax  +  j Ay  +  k Az)  •  (i Bx  +  j By  +  k Bz). 

By  the  distributive  law  we  can  multiply  this  out  getting  nine  terms  such  as  AXBX i •  i, 
AxByi  •  j,  and  so  on.  Using  the  definition  of  the  scalar  product,  we  find 

i  •  i  =  |i|  •  |i|  cos  0°  =  1  •  1  •  1  =  1,  and  similarly,  j  •  j  =  1,  k  •  k  =  1; 

i  •  j  =  |i|  •  |j|  cos 90°  =  1  •  1  •  0  =  0,  and  similarly,  i  •  k  =  0,  j  •  k  =  0. 

Using  (4.9)  in  (4.8),  we  get 


(4.10) 


A  •  B  —  AXBX  +  Ay  By  +  AZBZ. 


Equation  (4.10)  is  an  important  formula  which  you  should  memorize.  There  are  sev¬ 
eral  immediate  uses  of  this  formula  and  of  the  dot  product. 

Angle  Between  Two  Vectors  Given  the  vectors,  we  can  find  the  angle  between 
them  by  using  both  (4.2)  and  (4.10)  and  solving  for  cos6L 

Example  3.  Find  the  angle  between  the  vectors  A  =  3i  +  6j  +  9k  and  B 
By  (4.2)  and  (4.10)  we  get 

A  •  B  =  |A|  |B|  cos  9  =  3  •  (-2)  +  6-3  +  9-1  =  21, 

(4.11)  |A|  =  \/32  +  62  +  92  =  3  VTi,  |  B  |  =  ^22  +  32  +  l2 

3Vl4:VT4:Cos9  =  21,  cos0  =  ^,  0  =  60°. 

Perpendicular  and  Parallel  Vectors  If  two  vectors  are  perpendicular,  then 
cos  0  =  0;  thus 


—  — 2i  +  3j  +  k. 


=  Vu, 


(4.12)  AXBX  +  AyBy  +  AZBZ  =  0  if  A  and  B  are  perpendicular  vectors. 


If  two  vectors  are  parallel,  their  components  are  proportional;  thus  (when  no  com¬ 
ponents  are  zero) 
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(4.13) 

Bx 

ii 

tal  ^ 

|<«£ 

Ax 

Bz 

if  A  and  B  are  parallel  vectors. 

(Of  course,  if  Bx  =  0,  then  Ax  =  0,  etc.) 

Vector  Product  The  vector  or  cross  product  of  A  and  B  is  written  A  X  B.  By 
definition,  A  X  B  is  a  vector  whose  magnitude  and  direction  are  given  as  follows: 


The  magnitude  of  A  X  B  is 

(4.14)  |A  X  B|  =  |A|  |B|sin0, 

where  6  is  the  positive  angle  (<  180°)  between  A  and  B. 
The  direction  of  A  X  B  is  perpendicular  to  the  plane  of  A 
and  B  and  in  the  sense  C  of  advance  of  a  right-handed  screw 
rotated  from  A  to  B  as  in  Figure  4.12. 


C  =  A*B 


It  is  convenient  to  find  the  direction  of  C  =  A  X  B  by  the  following  right-hand  rule. 
Think  of  grasping  the  line  C  (or  a  screwdriver  driving  a  right-handed  screw  in  the 
direction  C)  with  the  right  hand.  The  fingers  then  curl  in  the  direction  of  rotation 
of  A  into  B  (arrow  in  Figure  4.12)  and  the  thumb  points  along  C  =  A  X  B. 

Perhaps  the  most  startling  result  of  the  vector  product  definition  is  that  A  X  B 
and  B  X  A  are  not  equal;  in  fact,  A  X  B  =  — B  X  A.  In  mathematical  language, 
vector  multiplication  is  not  commutative. 

We  find  from  (4.14)  that  the  cross  product  of  any  two  parallel  (or  antiparallel) 
vectors  has  magnitude  |A  X  B|  =  AB  sin0°  =  0  (or  ABsinl80°  =  0).  Thus 


(4.15) 

A  x  B  =  0 

if  A  and  B  are  parallel  or  antiparallel, 

A  x  A  =  0 

for  any  A. 

Then  we 

have  the  useful  results 

(4.16) 

ixi  =  jxj  =  kxk  =  0. 

Also  from  (4.14)  we  find 


|i  X  j|  =  |i|  |j|  sin 90°  =  1  •  1  •  1  =  1, 


and  similarly  for  the  magnitude  of  the  cross  product  of  any  two  different  unit  vectors 
i,  j,  k.  From  the  right-hand  rule  and  Figure  4.13,  we  see  that  the  direction  of  i  X  j 
is  k,  and  since  its  magnitude  is  1,  we  have  i  X  j  =  k;  however,  j  X  i  =  —  k.  Similarly 
evaluating  the  other  cross  products,  we  find 
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i  X  j  =  k  j  xk  =  i  k  x  i  =  j. 

jxi  =  ~k  k  X  j  =  -i  i  X  k  =  -j. 


A  good  way  to  remember  these  is  to  write  them  cyclically  (around  a  circle  as  in¬ 
dicated  in  Figure  4.14).  Reading  around  the  circle  counterclockwise  (positive  9 
direction) ,  we  get  the  positive  products  (for  example,  i  X  j  =  k) ;  reading  the  other 
way  we  get  the  negative  products  (for  example,  i  X  k  =  —  j). 


It  is  well  to  note  here  that  the  results  (4.17)  depend  upon  the  way  we  have  labeled 
the  axes  in  Figure  4.13.  We  have  arranged  the  (x,  y ,  z)  axes  so  that  a  rotation  of  the 
x  into  the  y  axis  (through  90°)  corresponds  to  the  rotation  of  a  right-handed  screw 
advancing  in  the  positive  2  direction.  Such  a  coordinate  system  is  called  a  right- 
handed  system.  If  we  used  a  left-handed  system  (say  exchanging  x  and  y),  then  all 
the  equations  in  (4.17)  would  have  their  signs  changed.  This  would  be  confusing; 
consequently,  we  practically  always  use  right-handed  coordinate  systems,  and  we 
must  be  careful  about  this  in  drawing  diagrams.  (See  Chapter  10,  Section  6.) 

To  write  A  X  B  in  component  form  we  need  the  distributive  law,  namely 

(4.18)  Ax(B  +  C)  =  AxB  +  AxC. 

(see  Problem  7.18). 

Then  we  find 


The  second  line  in  (4.19)  is  obtained  by  multiplying  out  the  first  line  (getting  nine 
products)  and  using  (4.16)  and  (4.17).  The  determinant  in  (4.19)  is  the  most  con¬ 
venient  way  to  remember  the  component  form  of  the  vector  product.  You  should 
verify  that  multiplying  out  the  determinant  using  the  elements  of  the  first  row  gives 
the  result  in  the  line  above  it. 


Section  4 


Vectors  105 


Since  A  X  B  is  a  vector  perpendicular  to  A  and  to  B,  we  can  use  (4.19)  to  find 
a  vector  perpendicular  to  two  given  vectors. 


Example  4.  Find  a  vector  perpendicular  to  both  A  =  2i  +  j  —  k  and  B  =  i  +  3 j  2k. 


A  X  B 


1  j  k 

2  1-1 
13-2 


i(— 2  +  3)  —  j(— 4  +  1)  +  k(6  —  1) 


=  i  +  3j  +  5k. 


PROBLEMS,  SECTION  4 

9.  Let  A  =  2i  +  3j  and  B  =  4i  —  4j.  Show  graphically,  and  find  algebraically,  the 
vectors  —A,  3B,  A  —  B,  B  +  2A,  1(A  +  B). 

10.  If  A  +  B  =  4j  —  i  and  A  —  B  =  i  +  3j,  find  A  and  B  algebraically.  Show  by  a 
diagram  how  to  find  A  and  B  geometrically. 

11.  Let  3i  —  j  +  4k,  7j  — 2k,  i  — 3j  +  k  be  three  vectors  with  tails  at  the  origin.  Then  their 
heads  determine  three  points  A,  B,  C  in  space  which  form  a  triangle.  Find  vectors 
representing  the  sides  AB,  BC,  CA  in  that  order  and  direction  (for  example,  A  to 
B,  not  B  to  A)  and  show  that  the  sum  of  these  vectors  is  zero. 

12.  Find  the  angle  between  the  vectors  A  =  — 2i  +  j  —  2k  and  B  =  2i  —  2j. 

13.  If  A  =  4i  —  3k  and  B  =  — 2i  +  2 j  —  k,  find  the  scalar  projection  of  A  on  B,  the 
scalar  projection  of  B  on  A,  and  the  cosine  of  the  angle  between  A  and  B. 

14.  Find  the  angles  between  (a)  the  space  diagonals  of  a  cube;  (b)  a  space  diagonal  and 
an  edge;  (c)  a  space  diagonal  and  a  diagonal  of  a  face. 

15.  Let  A  =  2i  —  j  +  2k.  (a)  Find  a  unit  vector  in  the  same  direction  as  A.  Hint: 
Divide  A  by  |Aj.  (b)  Find  a  vector  in  the  same  direction  as  A  but  of  magnitude  12. 
(c)  Find  a  vector  perpendicular  to  A.  Hint:  There  are  many  such  vectors;  you  are 
to  find  one  of  them,  (d)  Find  a  unit  vector  perpendicular  to  A.  See  hint  in  (a). 

16.  Find  a  unit  vector  in  the  same  direction  as  the  vector  A  =  4i  —  2j  +  4k,  and  another 

unit  vector  in  the  same  direction  as  B  =  — 4i  +  3k.  Show  that  the  vector  sum  of 

these  unit  vectors  bisects  the  angle  between  A  and  B.  Hint:  Sketch  the  rhombus 
having  the  two  unit  vectors  as  adjacent  sides. 

17.  Find  three  vectors  (none  of  them  parallel  to  a  coordinate  axis)  which  have  lengths 
and  directions  such  that  they  could  be  made  into  a  right  triangle. 

18.  Show  that  2i  —  j  +  4k  and  5i  +  2j  —  2k  are  orthogonal  (perpendicular).  Find  a  third 
vector  perpendicular  to  both. 

19.  Find  a  vector  perpendicular  to  both  i  —  3j  +  2k  and  5i  —  j  —  4k. 

20.  Find  a  vector  perpendicular  to  both  i  +  j  and  i  —  2k. 

21.  Show  that  B|A|  +  A|Bj  and  A|Bj  —  B|A|  are  orthogonal. 

22.  Square  (A  +  B);  interpret  your  result  geometrically.  Hint:  Your  answer  is  a  law 
which  you  learned  in  trigonometry. 

23.  If  A  =  2i  —  3j  +  k  and  A  ■  B  =  0,  does  it  follow  that  B  =  0?  (Either  prove  that  it 
does  or  give  a  specific  example  to  show  that  it  doesn’t.)  Answer  the  same  question 
if  A  X  B  =  0.  And  again  answer  the  same  question  if  A  ■  B  =  0  and  A  X  B  =  0. 
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24.  What  is  the  value  of  (A  X  B)2  +  (A  ■  B)2?  Comment:  This  is  a  special  case  of 
Lagrange’s  identity.  (See  Chapter  6,  Problem  3.12b,  page  284.) 

Use  vectors  as  in  Problems  3  to  8,  and  also  the  dot  and  cross  product,  to  prove  the  following 
theorems  from  geometry. 

25.  The  sum  of  the  squares  of  the  diagonals  of  a  parallelogram  is  equal  to  twice  the  sum 
of  the  squares  of  two  adjacent  sides  of  the  parallelogram. 

26.  The  median  to  the  base  of  an  isosceles  triangle  is  perpendicular  to  the  base. 

27.  In  a  kite  (four-sided  figure  made  up  of  two  pairs  of  equal  adjacent  sides),  the  diag¬ 
onals  are  perpendicular. 

28.  The  diagonals  of  a  rhombus  (four-sided  figure  with  all  sides  of  equal  length)  are 
perpendicular  and  bisect  each  other. 

5.  LINES  AND  PLANES 

A  great  deal  of  analytic  geometry  can  be  simplified  by  the  use  of  vector  notation. 
Such  things  as  equations  of  lines  and  planes,  and  distances  between  points  or  be¬ 
tween  lines  and  planes  often  occur  in  physics  and  it  is  very  useful  to  be  able  to 
find  them  quickly.  We  shall  talk  about  three-dimensional  space  most  of  the  time 
although  the  ideas  apply  also  to  two  dimensions.  In  analytic  geometry  a  point  is 
a  set  of  three  coordinates  ( x,y,z );  we  shall  think  of  this  point  as  the  head  of  a 
vector  r  =  ia;  +  jy  +  \tz  with  tail  at  the  origin.  Most  of  the  time  the  vector  will 
be  in  the  background  of  our  minds  and  we  shall  not  draw  it;  we  shall  just  plot  the 
point  (. x,y,z )  which  is  the  head  of  the  vector.  In  other  words,  the  point  ( x,y,z ) 
and  the  vector  r  will  be  synonymous.  We  shall  also  use  vectors  joining  two  points. 
In  Figure  5.1  the  vector  A  from  (1, 2,  3)  to  (a;,  y,  z )  is 

A  =  r  —  C  =  (x,  y,  z)  —  (1,  2, 3)  =  (a;  -  1,  y  -  2,  z  -  3)  or 
A  =  ia:  +  jy  -I-  kz  —  (i  +  2j  +  3k)  =  i(a;  —  1)  +  j(y  —  2)  +  k (z  —  3). 


Figure  5.1 


Thus  we  have  two  ways  of  writing  vector  equations;  we  may  choose  the  one  we  prefer. 
Note  the  possible  advantage  of  writing  (1,  0,  —2)  for  i— 2k;  since  the  zero  is  explicitly 
written,  there  is  less  chance  of  accidentally  confusing  i  —  2k  with  i  —  2j  =  (1,  —2,  0). 
On  the  other  hand,  5j  is  simpler  than  (0,5,0). 

In  two  dimensions,  we  write  the  equation  of  a  straight  line  through  (a:o,  yo)  with 
slope  m  as 

y-  yo 

- =  m. 

x  —  Xq 


(5.1) 
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Suppose,  instead  of  the  slope,  we  are  given  a  vector  in  the  direction  of  the  line, 
say  A  =  ia+j&  (Figure  5.2).  Then  the  line  through  (xo,  yo)  and  in  the  direction  A  is 
determined  and  we  should  be  able  to  write  its  equation.  The  directed  line  segment 
from  ( xo,  yo )  to  any  point  (x,y)  on  the  line  is  the  vector  r  —  ro  with  components 
x  -  x0  and  y  -  y0: 

(5.2)  r-  r0  =  i(a:  -  rr0)  +j(y-  yo)- 


This  vector  is  parallel  to  A  =  ia+jb.  Now  if  two  vectors  are  parallel,  their  compo¬ 
nents  are  proportional.  Thus  we  can  write  (for  a,b  ^  0) 


(5.3) 


X  -  Xo  y  -  yo  y~Vo  b 

-  =  — - -  or  - =  — . 

a  b  x  —  xq  a 


This  is  the  equation  of  the  given  straight  line.  As  a  check  we  see  that  the  slope  of 
the  line  is  m  =  b/a ,  so  (5.3)  is  the  same  as  (5.1). 

Another  way  to  write  this  equation  is  to  say  that  if  r  —  ro  and  A  are  parallel 
vectors,  one  is  some  scalar  multiple  of  the  other,  that  is, 


(5.4) 


r  —  ro  =  At,  or  r  =  r0  +  At, 


where  t  is  the  scalar  multiple.  We  can  think  of  t  as  a  parameter;  the  component 
form  of  (5.4)  is  a  set  of  parametric  equations  of  the  line,  namely 


x  —  xo  =  at,  x  =  Xo  T  at, 

(5.5)  or 

y~yo  =  bt,  y  =  y0  +  bt. 

Eliminating  t  yields  the  equation  of  the  line  in  (5.3). 

In  three  dimensions,  the  same  ideas  can  be  used.  We  want  the  equations  of 
a  straight  line  through  a  given  point  (xo,yo,zo)  and  parallel  to  a  given  vector 
A  =  ai  +  bj  +  ck.  If  (x,  y,  z)  is  any  point  on  the  line,  the  vector  joining  (xo,  yo ,  ~o) 
and  ( x,y,z )  is  parallel  to  A.  Then  its  components  x  —  xo,  y  —  yo,  z  —  Zo  are 
proportional  to  the  components  a,  b,  c  of  A  and  we  have 


(5.6) 


X  —  Xo 
a 


y-y  o 

b 


(symmetric  equations  of  a  straight  line, 
a,  b,  0). 


If  c,  for  instance,  happens  to  be  zero,  we  would  have  to  write  (5.6)  in  the  form 
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The  parametric  equations  (5.8)  have  a  partic¬ 
ularly  useful  interpretation  when  the  parameter 
t  means  time.  Consider  a  particle  m  (electron, 
billiard  ball,  or  star)  moving  along  the  straight 
line  L  in  Figure  5.3.  Position  yourself  at  the  ori¬ 
gin  and  watch  m  move  from  Pq  to  P  along  L. 

Your  line  of  sight  is  the  vector  r;  it  swings  from 
r0  at  t  =  0  to  r  =  r0  +  At  at  time  t.  Note  that 
the  velocity  of  m  is  dr/dt  =  A;  A  is  a  vector 
along  the  line  of  motion.  Figure  5.3 

Going  back  to  two  dimensions,  suppose  we  want  the  equation  of  a  straight  line  L 
through  the  point  (xo,  yo)  and  perpendicular  to  a  given  vector  N  =  ai+&j.  As  above, 
the  vector 

r  -  r0  =  (x  -  x0)i  +  (y  -  y0)j 

lies  along  the  line.  This  time  we  want  this  vector  perpendicular  to  N;  recall  that 
two  vectors  are  perpendicular  if  their  dot  product  is  zero.  Setting  the  dot  product 
of  N  and  r  —  ro  equal  to  zero  gives 

(5.9)  a(x  -  x0)  +  b{y  -  y0)  =  0  or  — — —  = 

x  —  Xo  b 

This  is  the  desired  equation  of  the  straight  line  L  perpendicular  to  N.  As  a  check, 
note  from  Figure  5.4  that  the  slope  of  the  line  L  is 

tan#  =  —  cot  0  =  —a/b. 


Figure  5.4 


Figure  5.5 

In  three  dimensions,  we  use  this  method  to  write  the  equation  of  a  plane.  If 
(x0, 2/0)  Zq)  is  a  given  point  in  the  plane  and  (x,  y,  z )  is  any  other  point  in  the  plane, 


Section  5 


Lines  and  Planes  109 


the  vector  (Figure  5.5) 

r  -  r0  =  (x  -  a;0)i  +  (y  -  y0)i  +  (z  -  z0)k 

is  in  the  plane.  If  N  =  ai  +  5j  +  ck  is  normal  (perpendicular)  to  the  plane,  then  N 
and  r  —  r0  are  perpendicular,  so  the  equation  of  the  plane  is  N  •  (r  —  r0)  =  0,  or 


(5.10) 


a(x 


Xo)  +  b(y  -  j/o )  +  c(z  -  z0)  =  0, 
or  ax  +  by  +  cz  =  d, 


(equation  of  a  plane) 


where  d  =  ax o  +  byo  +  czq. 


If  we  are  given  equations  like  the  ones  above,  we  can  read  backwards  to  find  A 
or  N.  Thus  we  can  say  that  the  equations  (5.6),  (5.7),  and  (5.8)  are  the  equations 
of  a  straight  line  which  is  parallel  to  the  vector  A  =  ai  +  5j  +  ck,  and  either  equation 
in  (5.10)  is  the  equation  of  a  plane  perpendicular  to  the  vector  N  =  ai  +  6j  +  ck. 


Example  1.  Find  the  equation  of  the  plane  through  the  three  points  A(— 1, 1,1),  B( 2,  3, 0), 

<7(0,1, -2). 

A  vector  joining  any  pair  of  the  given  points  lies  in  the  plane.  Two  such  vectors 
are  AB  =  (2,  3, 0)  —  (—1, 1, 1)  =  (3, 2,  —1)  and  AC  =  (1, 0,  —3).  The  cross  product 
of  these  two  vectors  is  perpendicular  to  the  plane.  This  is 


N  =  (AB)  x  (AC) 


i  j  k 

3  2-1 
10-3 


— 6i  +  8 j  —  2k. 


Now  we  write  the  equation  of  the  plane  with  normal  direction  N  through  one  of  the 
given  points,  say  B ,  using  (5.10): 

— 6(x  —  2)  +  8(y  —  3)  —  2z  =  0  or  3a;  —  Ay  +  z  +  6  =  0. 


(Note  that  we  could  have  divided  N  by  —2  to  save  arithmetic.) 


Example  2.  Find  the  equations  of  a  line  through  (1, 0,  —2)  and  perpendicular  to  the  plane 
of  Example  1. 

The  vector  3i  —  4j  +  k  is  perpendicular  to  the  plane  of  Example  1  and  so  parallel 
to  the  desired  line.  Thus  by  (5.6)  the  symmetric  equations  of  the  line  are 

(x  -  1)  _  _y_  _  (z  +  2) 

3  “  -4  ““  1 

By  (5.8)  the  parametric  equations  of  the  line  are  r  =  i  —  2k  +  (3i  —  4j  +  k)f  or,  if 
you  like,  r  =  (1,  0,  —2)  +  (3,  —4,  l)t. 

Vectors  give  us  a  very  convenient  way  of  finding  distances  between  points  and 
lines  or  planes.  Suppose  we  want  to  find  the  (perpendicular)  distance  from  a  point  P 
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Figure  5.6 


to  the  plane  (5.10).  (See  Figure  5.6.)  We  pick  any  point  Q  we  like  in  the  plane 
(just  by  looking  at  the  equation  of  the  plane  and  thinking  of  some  simple  numbers 
x ,  y,  z  that  satisfy  it).  The  distance  PR  is  what  we  want.  Since  PR  and  RQ  are 
perpendicular  (because  PR  is  perpendicular  to  the  plane),  we  have  from  Figure  5.6 

(5.11)  PR  =  PQ  cos  9. 

From  the  equation  of  the  plane,  we  can  find  a  vector  N  normal  to  the  plane.  If  we 
divide  N  by  its  magnitude,  we  have  a  unit  vector  normal  to  the  plane;  we  denote 
this  unit  vector  by  n.  Then  \PQ  ■  n|  =  ( PQ )  cos  9,  which  is  what  we  need  in  (5.11) 
to  find  PR.  (We  have  put  in  absolute  value  signs  because  PQ  ■  n  might  be  negative, 
whereas  (PQ)  cos  9,  with  9  acute  as  in  Figure  5.6,  is  positive.) 


Example 3.  Find  the  distance  from  the  point  P(l,  —  2, 3)  to  the  plane  3a:  —  2y  +  z+l  =  0. 

One  point  in  the  plane  is  (1,2,0);  call  this  point  Q.  Then  the  vector  from  P 
to  Q  is 

PQ  =  (1,2,0)  -  (1,  -2, 3)  =  (0, 4,  —3)  =  4j  -  3k. 

From  the  equation  of  the  plane  we  get  the  normal  vector 

N  =  3i  —  2j  +  k. 


We  get  n  by  dividing  N  by  |N|  =  \Zl4.  Then  we  have 

\PR\  =  \PQ  •  n|  =  |(4j  -  3k)  •  (3i  -  2 j  +  k)/v/l4 
=  |(-8  -  3 )/v/Ti  I  =  11/^14. 


We  can  find  the  distance  from  a  point  P  to  a  line  in  a  similar  way.  In  Figure  5.7 
we  want  the  perpendicular  distance  PR.  We  select  any  point  on  the  line  [that  is,  we 
pick  any  (x,  y ,  z)  satisfying  the  equations  of  the  line];  call 


this  point  Q.  Then  (see  Figure  5.7)  PR  =  PQsin9.  Let 
A  be  a  vector  along  the  line  and  u  a  unit  vector  along  the 
line  (obtained  by  dividing  A  by  its  magnitude).  Then 

PQ  X  u  =  |PQ|  sin  9, 

so  we  get 

|PP|  =  \pQxu\. 
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Example4.  Find  the  distance  from  P(  1,  2,  —1)  to  the  line  joining  Pl(0,  0, 0)  and  P2(— 1,  0,  2). 
- » 

Let  A  =  P1P2  =  — i  +  2k;  this  is  a  vector  along  the  line.  Then  a  unit  vector 
along  the  line  is  u  =  (l/v/5)(— i+  2k).  Let  us  take  Q  to  be  Pi(0,0,0).  Then 
PQ  =  —  i  —  2j  +  k,  so  we  get  for  the  distance  |PP|: 

|PP|  =  -L|(-i-2j+k)  x  (-i  +  2k)|  =  -i=|-4i+j-2k|  =  v/2l75. 

It  is  also  straightforward  to  find  the  distance  between  two  skew  lines  (and  if 
you  really  want  to  appreciate  vectors,  just  look  up  this  calculation  in  an  analytic 
geometry  book  that  doesn’t  use  vectors!).  Pick  two  points  P  and  Q ,  one  on  each 
line  (Figure  5.8).  Then  | PQ  ■  n|,  where  n  is  a  unit  vector  perpendicular  to  both 
lines,  is  the  distance  we  want.  Now  if  A  and  B  are  vectors  along  the  two  lines,  then 
A  X  B  is  perpendicular  to  both,  and  n  is  just  A  X  B  divided  by  jA  X  B|. 


Figure  5.8 


Example  5.  Find  the  distance  between  the  lines  r  =  i  —  2j  +  (i  —  k)f  and  r  =  2j  —  k+(j  —  i)f. 

If  we  write  the  first  line  as  r  =  ro  +  At,  then  (the  head  of)  ro  is  a  simple  choice 
for  P,  so  we  have 

P=(l,— 2,0)  and  A  =  i  k. 

Similarly,  from  the  second  line  we  find 

Q  =  (0,2,—  1)  and  B=j  — i. 

Then  AxB  =  i+  j  +  k  and  n  =  (l/ -\/3 )  (i  +  j  +  k) .  Also 

PQ  =  (0, 2,  -1)  -  (1,  -2, 0)  =  (-1, 4,  -1)  =  -i  +  4j  -  k. 


Thus  we  get  for  the  distance  between  the  lines 


PQ  •  n 


(~i  +  4j  -  k)  •  (i  +  j  +  k)/-\/3 


1  —  1  +  4  — 1|/V3  =  2/V3. 


Example  6.  Find  the  direction  of  the  line  of  intersection  of  the  planes 
x  —  2y  +  2>z  =  4  and  2x  +  y  —  z  =  5. 

The  desired  line  lies  in  both  planes,  and  so  is  perpendicular  to  the  two  normal 
vectors  to  the  planes,  namely  i  —  2j  +  3k  and  2i  +  j  —  k.  Then  the  direction  of  the 
line  is  that  of  the  cross  product  of  these  normal  vectors;  this  is  — i  +  7j  +  5k. 
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Example  7.  Find  the  cosine  of  the  angle  between  the  planes  of  Example  6. 

The  angle  between  the  planes  is  the  same  as  the  angle  between  the  normals  to 
the  planes.  Thus  our  problem  is  to  find  the  angle  between  the  vectors  A  =  i  —  2j  +  3k 
and  B  =  2i  +  j  —  k.  Since  A  •  B  =  |A|  |B|  cos  9,  we  have  —3  =  v^Vficos 9,  and  so 
cos  9  =  —  -y/3/28.  This  gives  the  obtuse  angle  between  the  planes;  the  corresponding 
acute  angle  is  tt  —  9,  or  arc  cos  1/3/28. 

PROBLEMS,  SECTION  5 

In  Problems  1  to  5,  all  lines  are  in  the  ( x ,  y)  plane. 

1.  Write  the  equation  of  the  straight  line  through  (2,-3)  with  slope  3/4,  in  the  para¬ 
metric  form  r  =  ro  +  At. 

2.  Find  the  slope  of  the  line  whose  parametric  equation  is  r  =  (i  —  j)  +  (2i  +  3j)f. 

3.  Write,  in  parametric  form  [as  in  Problem  1],  the  equation  of  the  straight  line  that 
joins  (1,-2)  and  (3,0). 

4.  Write,  in  parametric  form,  the  equation  of  the  straight  line  that  is  perpendicular  to 
r  =  (2i  +  4j)  +  (i  —  2j)t  and  goes  through  (1,  0). 

5.  Write,  in  parametric  form,  the  equation  of  the  y  axis. 

Find  the  symmetric  equations  (5.6)  or  (5.7)  and  the  parametric  equations  (5.8)  of  a  line, 
and/or  the  equation  (5.10)  of  the  plane  satisfying  the  following  given  conditions. 

6.  Line  through  (1,  —1,  —5)  and  (2,  —3,  —3). 

7.  Line  through  (2,  3,  4)  and  (5, 1,  —2). 

8.  Line  through  (0,  —2,  4)  and  (3,  —2,  —1). 

9.  Line  through  (—1,3,7)  and  (-1,-2,  7). 

10.  Line  through  (3, 4,  —1)  and  parallel  to  2i  —  3j  +  6k. 

11.  Line  through  (4,  —1,3)  and  parallel  to  i  —  2k. 

12.  Line  through  (5,  —4,  2)  and  parallel  to  the  line  r  =  i  —  j  +  (5i  —  2j  +  k )t. 

13.  Line  through  (3,  0,  —5)  and  parallel  to  the  line  r  =  (2,  1,  —5)  +  (0,  —3, 1  )t. 

14.  Plane  containing  the  triangle  ABC  of  Problem  4.11. 

15.  Plane  through  the  origin  and  the  points  in  Problem  8. 

16.  Plane  through  the  point  and  perpendicular  to  the  line  in  Problem  12. 

17.  Plane  through  the  point  and  perpendicular  to  the  line  in  Problem  13. 

18.  Plane  containing  the  two  parallel  lines  in  Problem  12. 

19.  Plane  containing  the  two  parallel  lines  in  Problem  13. 

20.  Plane  containing  the  three  points  (0,  1, 1),  (2, 1,3),  and  (4,2,  1). 

In  Problems  21  to  23,  find  the  angle  between  the  given  planes. 

21.  2x  +  6y  —  3z  =  10  and  5x  +  2y  —  z  =  12. 

22.  2x  —  y  —  z  =  4  and  3x  —  2y  —  6z  =  7. 

23.  2x  +  y  —  2z  =  3  and  3x  —  Qy  —  2z  =  4. 
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24.  Find  a  point  on  both  the  planes  (that  is,  on  their  line  of  intersection)  in  Problem  21. 
Find  a  vector  parallel  to  the  line  of  intersection.  Write  the  equations  of  the  line  of 
intersection  of  the  planes.  Find  the  distance  from  the  origin  to  the  line. 

25.  As  in  Problem  24,  find  the  equations  of  the  line  of  intersection  of  the  planes  in 
Problem  22.  Find  the  distance  from  the  point  (2, 1,  —1)  to  the  line. 

26.  As  in  Problem  24,  find  the  equations  of  the  line  of  intersection  of  the  planes  in 
Problem  23.  Find  the  distance  from  the  point  (1,0,0)  to  the  line. 

27.  Find  the  equation  of  the  plane  through  (2,  3,  —2)  and  perpendicular  to  both  planes 
in  Problem  21. 

28.  Find  the  equation  of  the  plane  through  (—4,  —1,  2)  and  perpendicular  to  both  planes 
in  Problem  22. 

29.  Find  a  point  on  the  plane  2x  —  y  —  z  =  13.  Find  the  distance  from  (7, 1,  —2)  to  the 
plane. 

30.  Find  the  distance  from  the  origin  to  the  plane  3x  —  2y  —  6z  =  7. 

31.  Find  the  distance  from  (—2,  4,  5)  to  the  plane  2x  +  6y  —  3z  =  10. 

32.  Find  the  distance  from  (3,  —1,  2)  to  the  plane  5x  —  y  —  z  =  4. 

33.  Find  the  perpendicular  distance  between  the  two  parallel  lines  in  Problem  12. 

34.  Find  the  distance  (perpendicular  is  understood)  between  the  two  parallel  lines  in 
Problem  13. 

35.  Find  the  distance  from  (2,  5, 1)  to  the  line  in  Problem  10. 

36.  Find  the  distance  from  (3,  2,  5)  to  the  line  in  Problem  11. 

37.  Determine  whether  the  lines 

x—  1  y  +  3  z  —  4  ,  x  +  3  y  +  4  8  — z 

2  =  I  =  ^3~  and  —  =  —  =  — 

intersect.  Two  suggestions:  (1)  Can  you  find  the  intersection  point,  if  any?  (2)  Con¬ 
sider  the  distance  between  the  lines. 

38.  Find  the  angle  between  the  lines  in  Problem  37. 

In  Problems  39  and  40,  show  that  the  given  lines  intersect  and  find  the  acute  angle  between 
them. 


39.  r  =  2j  +  k  +  (3i  —  k)ti  and  r  =  7i  +  2k  +  (2i  —  j  +  k)t2. 

40.  r  =  (5,  —2,  0)  +  (1,  —1,  — l)ti  and  r  =  (4, -4, -1)  +  (0,  3, 2)t2. 

In  Problems  41  to  44,  find  the  distance  between  the  two  given  lines. 


41. 

42. 

43. 

44. 

45. 


r  «  (4, 3,-1) +  (1,1,1)*  and  r  =  (4, -1, 1)  +  (1, -2, -1  )t. 

The  line  that  joins  (0,  0,  0)  to  (1,  2,-1),  and  the  line  that  joins  (1, 1, 1)  to  (2,  3, 4). 
y  +  2  2z  —  1  x  +  2  2  —  y 


x  —  1 


and 


2  3  4  -1 

The  x  axis  and  r  =  j  —  k  +  (2i  —  3j  +  k)f. 


1 

Z  ~  ~2 


A  particle  is  traveling  along  the  line  (x  —  3)  / 2  =  (y  +  1)  /  ( — 2)  =  z  —  1.  Write  the 
equation  of  its  path  in  the  form  r  =  ro  +  At.  Find  the  distance  of  closest  approach 
of  the  particle  to  the  origin  (that  is,  the  distance  from  the  origin  to  the  line).  If 
t  represents  time,  show  that  the  time  of  closest  approach  is  t  =  —  (ro  •  A)/|A|2. 
Use  this  value  to  check  your  answer  for  the  distance  of  closest  approach.  Hint:  See 
Figure  5.3.  If  P  is  the  point  of  closest  approach,  what  is  A  •  r? 
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6.  MATRIX  OPERATIONS 

In  Section  2  we  used  matrices  simply  as  arrays  of  numbers.  Now  we  want  to  go 
farther  into  the  subject  and  discuss  the  meaning  and  use  of  multiplying  a  matrix 
by  a  number  and  of  combining  matrices  by  addition,  subtraction,  multiplication, 
and  even  (in  a  sense)  division.  We  will  see  that  we  may  be  able  to  find  functions  of 
matrices  such  as  eM.  These  are,  of  course,  all  questions  of  definition,  but  we  shall 
show  some  applications  which  might  suggest  reasonable  definitions;  or  alternatively, 
given  the  definitions,  we  shall  see  what  applications  we  can  make  of  the  matrix 
operations. 

Matrix  Equations  Let  us  first  emphasize  again  that  two  matrices  are  equal  only 
if  they  are  identical.  Thus  the  matrix  equation 

x  r  u\  _  f  2  1  —5  \ 

y  s  vj  \^3  —  7  i  1  —  i) 

is  really  the  set  of  six  equations 

x  =  2,  y  =  3,  r  =  1,  s  =  —7  i,  u  =  —5,  v  =  1  —  i. 

(Recall  similar  situations  we  have  met  before:  The  equation  z  =  x  +  iy  =  2  —  3i 
is  equivalent  to  the  two  real  equations  x  =  2,  y  =  —3;  a  vector  equation  in  three 
dimensions  is  equivalent  to  three  component  equations.)  In  complicated  problems 
involving  many  numbers  or  variables,  it  is  often  possible  to  save  a  great  deal  of  writ¬ 
ing  by  using  a  single  matrix  equation  to  replace  a  whole  set  of  ordinary  equations. 
Any  time  it  is  possible  to  so  abbreviate  the  writing  of  a  mathematical  equation  (like 
using  a  single  letter  for  a  complicated  parenthesis)  it  not  only  saves  time  but  often 
enables  us  to  think  more  clearly. 

Multiplication  of  a  Matrix  by  a  Number  A  convenient  way  to  display  the 
components  of  the  vector  A  =  2i  +  3j  is  to  write  them  as  elements  of  a  matrix, 
either 

called  a  column  matrix  or  column  vector, 
or 

At  =  (2  3)  called  a  row  matrix  or  row  vector. 

The  row  matrix  AT  is  the  transpose  of  the  column  matrix  A.  Observe  the  notation 
we  are  using  here:  We  will  often  use  the  same  letter  for  a  vector  and  its  column 
matrix,  but  we  will  usually  write  the  letter  representing  the  matrix  as  A  (roman, 
not  boldface),  the  vector  as  boldface  A,  and  the  length  of  the  vector  as  italic  A. 

Now  suppose  we  want  a  vector  of  twice  the  length  of  A  and  in  the  same  direction; 
we  would  write  this  as  2 A  =  4i  +  6j.  Then  we  would  like  to  write  its  matrix 
representation  as 

2A  =  2  (3)  =  (e)  ’  2AT  =  2(2  3)  =  (4  6)- 

This  is,  in  fact,  exactly  how  a  matrix  is  multiplied  by  a  number:  every  element  of 
the  matrix  is  multiplied  by  the  number.  Thus 

a  c  e\  _  fka  kc  ke\ 
b  d  f)  =  \kb  kd  kf ) 
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and 


-6 

5 


Note  carefully  a  difference  between  determinants  and  matrices:  multiplying  a 
matrix  by  a  number  k  means  multiplying  every  element  by  k ,  but  multiplying  just 
one  row  of  a  determinant  by  k  multiplies  the  determinant  by  k.  Thus  det(fcA)  = 
k 2  det  A  for  a  2  by  2  matrix,  det(fcA)  =  k3  det  A  for  a  3  by  3  matrix,  and  so  on. 


Addition  of  Matrices  When  we  add  vectors  algebraically,  we  add  them  by  com¬ 
ponents.  Matrices  are  added  in  the  same  way,  by  adding  corresponding  elements. 
For  example, 


(6.1) 


/ 1  +  2  3-1  -2  +  4\ 

^4  +  3  7-7  1-2  J 


Note  that  if  we  add  A  +  A  we  would  get  2A  in  accord  with  our  definition  of  twice 
a  matrix  above.  Suppose  we  have 


A  = 


1  3 
4  7 


In  this  case  we  cannot  add  A  and  B;  we  say  that  the  sum  is  undefined  or  meaningless. 

In  applications,  then,  matrices  are  useful  in  representing  things  which  are  added 
by  components.  Suppose,  for  example,  that,  in  (6.1),  the  columns  represent  dis¬ 
placements  of  three  particles.  The  first  particle  is  displaced  by  i  +  4j  (first  column  of 
the  first  matrix)  and  later  by  2i  +  3j  (first  column  of  the  second  matrix).  The  total 
displacement  is  then  3i  +  7j  (first  column  of  the  sum  of  the  matrices).  Similarly  the 
second  and  third  columns  represent  displacements  of  the  second  and  third  particles. 


Multiplication  of  Matrices  Let  us  start  by  defining  the  product  of  two  matrices 
and  then  see  what  use  we  can  make  of  the  process.  Here  is  a  simple  example  to 
show  what  is  meant  by  the  product  AB  =  C  of  two  matrices  A  and  B: 

<«■*>  ab=(:  s)G  0-(“+* 

Observe  that  in  the  product  matrix  C,  the  element  in  the  first  row  and  first  column  is 
obtained  by  multiplying  each  element  of  the  first  row  in  A  times  the  corresponding 
element  in  the  first  column  of  B  and  adding  the  results.  This  is  referred  to  as 
“row  times  column”  multiplication;  when  we  compute  ae  +  bg ,  we  say  that  we  have 
“multiplied  the  first  row  of  A  times  the  first  column  of  B.”  Next  examine  the 
element  af  +  bh  in  the  first  row  and  second  column  of  C;  it  is  the  “first  row  of  A 
times  the  second  column  of  B.”  Similarly,  ce  +  dg  in  the  second  row  and  first  column 
of  C  is  the  “second  row  of  A  times  the  first  column  of  B,”  and  cf  +  dh  in  the  second 
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row  and  second  column  of  C  is  the  “second  row  of  A  times  the  second  column  of  B.” 
Thus  all  the  elements  of  C  may  be  obtained  by  using  the  following  simple  rule: 


The  element  in  row  i  and  column  j  of  the  product  matrix  AB  is 
equal  to  row  i  of  A  times  column  j  of  B.  In  index  notation 

(AB)y  =  ^  -Aa,  Bkj  - 

k 


Here  is  another  useful  way  of  saying  this:  Think  of  the  elements  in  a  row  (or 
a  column)  of  a  matrix  as  the  components  of  a  vector.  Then  row  times  column 
multiplication  for  the  matrix  product  AB  corresponds  to  finding  the  dot  product  of 
a  row  vector  of  A  and  a  column  vector  of  B. 

It  is  not  necessary  for  matrices  to  be  square  in  order  for  us  to  multiply  them. 
Consider  the  following  example. 


Example  1.  Find  the  product  of  A  and  B  if 


A  = 


Following  the  rule  we  have  stated,  we  get 


AB 


4  2 
-3  1 


15  3 

2  7-4 


4  •  1  +  2  •  2  4  •  5  +  2  •  7 

-3 -1  +  1-  2  -3 -5  +  1-  7 

8  34  4 

-1  -8  -13 


4  •  3  +  2(— 4)  \ 
— 3  •  3  +  1(— 4)  J 


Notice  that  the  third  column  in  B  caused  us  no  difficulty  in  following  our  rule;  we 
simply  multiplied  each  row  of  A  times  the  third  column  of  B  to  obtain  the  elements 
in  the  third  column  of  AB.  But  suppose  we  tried  to  find  the  product  BA.  In  B  a 
row  contains  3  elements,  while  in  A  a  column  contains  only  two;  thus  we  are  not 
able  to  apply  the  “row  times  column”  method.  Whenever  this  happens,  we  say  that 
B  is  not  conformable  with  respect  to  A,  and  the  product  BA  is  not  defined  (that  is, 
it  is  meaningless  and  we  do  not  use  it). 


The  product  AB  (in  that  order)  can  be  found  if  and  only  if  the  number  of  elements 
in  a  row  of  A  equals  the  number  of  elements  in  a  column  of  B;  the  matrices  A, 
B  in  that  order  are  then  called  conformable.  (Observe  that  the  number  of  rows 
in  A  and  of  columns  in  B  have  nothing  to  do  with  the  question  of  whether  we 
can  find  AB  or  not.) 
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Example  2.  Find  AB  and  BA,  given 


Note  that  here  the  matrices  are  conformable  in  both  orders,  so  we  can  find  both 
AB  and  BA. 


AB 


BA 


(  3  •  5  -  l(-7) 
V— 4-5  +  2(— 7) 


(  5  •  3  +  2(— 4) 
V— 7-3  +  3(— 4) 


3  •  2  —  1  •  3  \ 
-4- 2  + 2 -3/ 


5(— 1)  +  2-2  \ 
— 7(— 1)  +  3  •  2y 


Observe  that  AB  is  not  the  same  as  BA.  We  say  that  matrix  multiplication  is 
not  commutative,  or  that,  in  general,  matrices  do  not  commute  under  multiplica¬ 
tion.  (Of  course,  two  particular  matrices  may  happen  to  commute.)  We  define  the 
commutator  of  the  matrices  A  and  B  by 


(6.3)  [A,  B]  =  AB  —  BA  =  commutator  of  A  and  B. 


(Commutators  are  of  interest  in  classical  and  quantum  mechanics.)  Since  matrices 
do  not  in  general  commute,  be  careful  not  to  change  the  order  of  factors  in  a  product 
of  matrices  unless  you  know  they  commute.  For  example 

(A  —  B)(A  +  B)  =  A2  +  AB-BA  -  B2  =  A2  —  B2  +  [A, B]. 

This  is  not  equal  to  A2  —  B2  when  A  and  B  don’t  commute.  Also  see  the  discussion 
just  after  (6.17).  On  the  other  hand,  the  associative  law  is  valid,  that  is,  A(BC)  = 
(AB)C,  so  we  can  write  either  as  simply  ABC.  Also  the  distributive  law  holds: 
A(B  +  C)  =  AB  +  AC  and  (A  +  B)C  =  AC  +  BC  as  we  have  been  assuming  above. 
(See  Section  9.) 


Zero  Matrix  The  zero  or  null  matrix  means  one  with  all  its  elements  equal  to 
zero.  It  is  often  abbreviated  by  0,  but  we  must  be  careful  about  this.  For  example: 

(6.4)  If  M  =  ^  ifj,  then  M2  = 

so  we  have  M2  =  0,  but  M  yf  0.  Also  see  Problems  9  and  10. 
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Identity  Matrix  or  Unit  Matrix  This  is  a  square  matrix  with  every  element 
of  the  main  diagonal  (upper  left  to  lower  right)  equal  to  1  and  all  other  elements 
equal  to  zero.  For  example 

/I  0  0 

(6.5)  010 

\0  0  1 

is  a  unit  or  identity  matrix  of  order  3  (that  is,  three  rows  and  three  columns).  An 
identity  or  unit  matrix  is  called  1  or  I  or  U  or  E  in  various  references.  You  should 
satisfy  yourself  that  in  multiplication,  a  unit  matrix  acts  like  the  number  1,  that 
is,  if  A  is  any  matrix  and  I  is  the  unit  matrix  conformable  with  A  in  the  order  in 
which  we  multiply,  then  IA  =  AI  =  A  (Problem  11). 

Operations  with  Determinants  We  do  not  define  addition  for  determinants. 
However,  multiplication  is  useful;  we  multiply  determinants  the  same  way  we  mul¬ 
tiply  matrices.  It  can  be  shown  that  if  A  and  B  are  square  matrices  of  the  same 
order,  then 


(6.6)  det  AB  =  detBA  =  (det  A)  •  (detB). 


Look  at  Example  2  above  to  see  that  (6.6)  is  true  even  when  matrices  AB  and  BA 
are  not  equal,  that  is,  when  A  and  B  do  not  commute. 


Applications  of  Matrix  Multiplication  We  can  now  write  sets  of  simultaneous 
linear  equations  in  a  very  simple  form  using  matrices.  Consider  the  matrix  equation 


If  we  multiply  the  first  two  matrices,  we  have 


/  x  —  z  \  /  5 

(6.8)  -2x  +  32/  =  1 

\x  —  3y  +  2z  J  \-10 

Now  recall  that  two  matrices  are  equal  only  if  they  are  identical.  Thus  (6.8)  is  the 
set  of  three  equations 

(  x  —  z  =  5 

(6.9)  l  -2x  +  3y  =  1  . 

^  x  —  3y  +  2z  =  —10 

Consequently  (6.7)  is  the  matrix  form  for  the  set  of  equations  (6.9).  In  this  way  we 
can  write  any  set  of  linear  equations  in  matrix  form.  If  we  use  letters  to  represent 
the  matrices  in  (6.7), 


(6.10) 


M  = 
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then  we  can  write  (6.7)  or  (6.9)  as 


(6.11) 


Mr  =  k. 


Or,  in  index  notation,  we  can  write  yV  MijXj  =  fcj.  [Review  Section  2,  equations 
(2.3)  to  (2.6).]  Note  that  (6.11)  could  represent  any  number  of  equations  or  un¬ 
knowns  (say  100  equations  in  100  unknowns!).  Thus  we  have  a  great  simplification 
in  notation  which  may  help  us  to  think  more  clearly  about  a  problem.  For  example, 
if  (6.11)  were  an  ordinary  algebraic  equation,  we  would  solve  it  for  r  to  get 


(6.12) 


r  =  M  1k. 


Since  M  is  a  matrix,  (6.12)  only  makes  sense  if  we  can  give  a  meaning  to  M  1  such 
that  (6.12)  gives  the  solution  of  (6.7)  or  (6.9).  Let’s  try  to  do  this. 


Inverse  of  a  Matrix  The  reciprocal  or  inverse  of  a  number  x  is  x~l  such  that 
the  product  xx~x  =  1.  We  define  the  inverse  of  a  matrix  M  (if  it  has  one)  as  the 
matrix  M^1  such  that  MM^1  and  M-1M  are  both  equal  to  a  unit  matrix  I.  Note 
that  only  square  matrices  can  have  inverses  (otherwise  we  could  not  multiply  both 
MM'1  and  M-1M).  Actually,  some  square  matrices  do  not  have  inverses  either. 
You  can  see  from  (6.6)  that  if  M_1M  =  I,  then  (det  M_1)(det  M)  =  detl  =  1.  If 
two  numbers  have  product  =  1,  then  neither  of  them  is  zero;  thus  detM  ^  0  is  a 
requirement  for  M  to  have  an  inverse. 

If  a  matrix  has  an  inverse  we  say  that  it  is  invertible;  if  it  doesn’t  have  an  inverse, 
it  is  called  singular.  For  simple  numerical  matrices  your  computer  will  easily  produce 
the  inverse  of  an  invertible  matrix.  However,  for  theoretical  purposes,  we  need  a 
formula  for  the  inverse;  let’s  discuss  this.  The  cofactor  of  an  element  in  a  square 
matrix  M  means  exactly  the  same  thing  as  the  cofactor  of  that  element  in  det  M 
[see  (3.3)  and  (3.4)].  Thus,  the  cofactor  of  the  element  ml:j  in  row  i  and  column 
j  is  a  number  equal  to  (— 1  )l+J  times  the  value  of  the  determinant  remaining  when 
we  cross  off  row  i  and  column  j.  Then  to  find  M_1:  Find  the  cofactors  G'y  of  all 
elements,  write  the  matrix  C  whose  elements  are  Gy ,  transpose  it  (interchange  rows 
and  columns),  and  divide  by  detM.  (See  Problem  23.) 


(6.13) 


M  1  =  - — —  CT  where  GY,  =  cofactor  of  m a 
det  M 


Although  (6.13)  is  particularly  useful  in  theoretical  work,  you  should  practice  using 
it  (as  we  said  for  Cramer’s  rule)  on  simple  numerical  problems  in  order  to  learn 
what  the  formula  means. 
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Example  3.  For  the  matrix  M  of  the  coefficients  in  equations  (6.7)  or  (6.9),  find  M 


-l 


We  find  clet  M  =  3.  The  cofactors  of  the  elements  are: 


row  : 


-)iid 


3rd 


row  : 


row  : 


3 

-3 


0  -1 
-3  2 

0  -1 
3  0 


=  6, 
=  3, 
=  3, 


-2 

1 

1 

1 


1 

-2 


0 

2 

-1 

2 

-1 

0 


=  4, 


=  3, 


=  2, 


-2 

1 

1 

1 


3 

-3 

0 

-3 


1 

-2 


=  3. 
=  3. 
=  3. 


Then 


C  = 


so 


M"1  = 


1 


cletM 


C1  = 


Now  we  can  use  M  1  to  solve  equations  (6.9).  By  (6.12),  the  solution  is  given  by 
the  column  matrix  r  =  M-1k,  so  we  have 


1 


V  = 


or  x  =  1,  y  =  1,  z  =  —4.  (See  Problem  12.) 

Rotation  Matrices  As  another  example  of  matrix  multiplication,  let’s  consider 
a  case  where  we  know  the  answer,  just  to  see  that  our  definition  of  matrix  multi¬ 
plication  works  the  way  we  want  it  to.  You  probably  know  the  rotation  equations 
[for  reference,  see  the  next  section,  equation  (7.12)  and  Figure  7.4].  Equation  (7.12) 
gives  the  matrix  which  rotates  the  vector  r  =  ix  +  jy  through  angle  9  to  become 
the  vector  R  =  iX  +  j  Y.  Suppose  we  further  rotate  R  through  angle  <f>  to  become 
R'  =  iX'  +  j Y' .  We  could  write  the  matrix  equations  for  the  rotations  in  the  form 
R  =  Mr  and  R'  =  M'R  where  M  and  M'  are  the  rotation  matrices  (7.12)  for  rotation 
through  angles  9  and  4>.  Then,  solving  for  R/  in  terms  of  r,  we  get  R/  =  M'Mr.  We 
expect  the  matrix  product  M'M  to  give  us  the  matrix  for  a  rotation  through  the 
angle  9  +  (f),  that  is  we  expect  to  find 


(6.14) 


cos  (p  —  sin  c, 
sin  <b  cos  < 


cos  9  —  sin  9 

sin  9  cos  9 


cos  (9  +  <j>)  —  sin(0  +  <j>) 
sin(0  +  <j>)  cos  (9  +  <j>) 


It  is  straightforward  to  multiply  the  two  matrices  (Problem  25)  and  verify  (by  using 
trigonometric  identities)  that  (6.14)  is  correct.  Also  note  that  these  two  rotation 
matrices  commute  (that  is,  rotation  through  angle  9  and  then  through  angle  (j)  gives 
the  same  result  as  rotation  through  <f>  followed  by  rotation  through  9) .  This  is  true 
in  this  problem  in  two  dimensions.  As  we  will  see  in  Section  7,  rotation  matrices  in 
three  dimensions  do  not  in  general  commute  if  the  two  rotation  axes  are  different. 
(See  Problems  7.30  and  7.31.)  But  all  rotations  in  the  (x,  y)  plane  are  rotations 
about  the  2  axis  and  so  they  commute. 
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Functions  of  Matrices  Since  we  now  know  how  to  multiply  matrices  and  how 
to  add  them,  we  can  evaluate  any  power  of  a  matrix  A  and  so  evaluate  a  polynomial 
in  A.  The  constant  term  c  or  cA°  in  a  polynomial  is  defined  to  mean  c  times  the 
unit  matrix  I  [see  (6.16)  below]. 


Example  4. 

(6.15) 


If 


A=(-U  ^)’  the" 

A3  =  -A,  A4  =  I, 


and  so  on. 


-I, 


(Verify  these  powers  and  the  fact  that  higher  powers  simply  repeat  these  four  results: 
A,  —I,  —A,  I,  over  and  over.)  Then  we  can  find  (Problem  28) 

(6.16)  /(A)  =  3  -  2A2  -  A3  -  5A4  +  A6 

-3I  +  2I  +  A  —  5I-I  =  A  —  I=(_^  ^). 


We  can  extend  this  to  other  functions  by  expanding  a  given  f{x)  in  a  power  series 
if  all  the  series  we  need  to  use  happen  to  converge.  For  example,  the  series  for 
e2  converges  for  all  z,  so  we  can  find  ekA  when  A  is  a  given  matrix  and  k  is  any 
number,  real  or  complex.  Let  A  be  the  matrix  in  (6.15).  Then  (Problem  28),  we 
find 


(6.17) 


„fcA 


=l+fcAi 

k2 

=  (1  -  — 

^  2! 


k2  A2  fc3  A3  fc4A4  k5  A5 


2!  3! 

k 4  ,T 

iT  +  "')14 


=  (cos  k)  I  +  (sin  k)  A  = 


4!  5! 

<^3  b,5 

3!  +  5!)A 

cos  k  +  sin  k  \/2  sin  k 
—  a/2  sin  k  cos  k  —  sin  k 


A  word  of  warning  about  functions  of  two  matrices  when  A  and  B  don’t  com¬ 
mute:  Familiar  formulas  may  mislead  you;  see  (6.3)  and  the  discussion  following  it. 
Be  sure  to  write  (A  +  B)2  =  A2  +  AB  +  BA  +  B2;  don’t  write  2AB.  Similarly,  you 
can  show  that  eA+B  is  not  the  same  as  eAeB  when  A  and  B  don’t  commute  (see 
Problem  29  and  Problem  15.34). 


PROBLEMS,  SECTION  6 


In  Problems  1  to  3,  find  AB,  BA,  A  +  B,  A  —  B,  A2,  B2,  5A,  3B.  Observe  that  AB  V  BA. 
Show  that  (A  —  B)(A  +  B)  ±  (A  +  B)(A  —  B)  ±  A2  -  B2.  Show  that  det  AB  =  detBA  = 
(det  A)(det  B),  but  that  det(A  +  B)  ^  det  A  +  detB.  Show  that  det(5A)  ^  5det  A,  and 
find  n  so  that  det(5A)  =  5”  det  A.  Find  similar  results  for  det(3B).  Remember  that  the 
point  of  doing  these  simple  problems  by  hand  is  to  learn  how  to  manipulate  determinants 
and  matrices  correctly.  Check  your  answers  by  computer. 


1.  A  = 

2.  A  = 


B  = 
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A 

0 

2\ 

A 

1 

°\ 

3.  A  =  3 

-1 

°  , 

B  =  0 

2 

1) 

Vo 

5 

1/ 

\3 

-1 

0/ 

4.  Given  the  matrices 


A  = 


3  1 
1  0 


compute  or  mark  as  meaningless  all  products  of  two  of  these  matrices  (AB,  BA,  A2, 
etc.);  of  three  of  them  (ABC,  A2C,  A3,  etc.). 


5.  Compute  the  product  of  each  of  the  matrices  in  Problem  4  with  its  transpose  [see 
(2.2)  or  (9.1)]  in  both  orders,  that  is  AAT  and  ATA,  etc. 

6.  The  Pauli  spin  matrices  in  quantum  mechanics  are 


A  = 


(You  will  probably  find  these  called  ax,  ay,  az  in  your  quantum  mechanics  texts.) 
Show  that  A2  =  B2  =  C2  =  a  unit  matrix.  Also  show  that  any  two  of  these  matrices 
anticommute,  that  is,  AB  =  —BA,  etc.  Show  that  the  commutator  of  A  and  B,  that 
is,  AB  —  BA,  is  2iC,  and  similarly  for  other  pairs  in  cyclic  order. 

7.  Find  the  matrix  product 


By  evaluating  this  in  two  ways,  verify  the  associative  law  for  matrix  multiplication, 
that  is,  A(BC)  =  (AB)C,  which  justifies  our  writing  just  ABC. 

8.  Show,  by  multiplying  the  matrices,  that  the  following  equation  represents  an  ellipse. 


(x 


=  30. 


9.  Find  AB  and  BA  given 


A  = 


Observe  that  AB  is  the  null  matrix;  if  we  call  it  0,  then  AB  =  0,  but  neither  A  nor 
B  is  0.  Show  that  A  is  singular. 


10.  Given 


and  A  as  in  Problem  9,  show  that  AC  =  AD,  but  C  ^  D  and  A  ^  0. 


11.  Show  that  the  unit  matrix  I  has  the  property  that  we  associate  with  the  number  1, 
that  is,  IA  =  A  and  AI  =  A,  assuming  that  the  matrices  are  conformable. 

12.  For  the  matrices  in  Example  3,  verify  that  MM-1  and  M_1M  both  equal  a  unit 
matrix.  Multiply  M^k  to  verify  the  solution  of  equations  (6.9). 


In  Problems  13  to  16,  use  (6.13)  to  find  the  inverse  of  the  given  matrix. 


6  9 
3  5 


13. 
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/- 1 

2  3\ 

~2 

0 

15. 

2 

1 

o 

16. 

1 

-1 

V-i 

-1  1 J 

V  3 

1 

17. 

Given 

the  matrices 

A  = 


1 

A 

/I 

0 

0 

-1  > 

II 

m 

1 

1 

2 

0/ 

\2 

1 

2/ 

(a)  Find  A"1,  B"1,  B^AB,  and  B-1A-1B. 

(b)  Show  that  the  last  two  matrices  are  inverses,  that  is,  that  their  product  is  the 
unit  matrix. 


18.  Problem  17(b)  is  a  special  case  of  the  general  theorem  that  the  inverse  of  a  product 
of  matrices  is  the  product  of  the  inverses  in  reverse  order.  Prove  this.  Hint:  Multiply 
ABCD  times  D_1C_1B~1A_1  to  show  that  you  get  a  unit  matrix. 


In  Problems  19  to  22,  solve  each  set  of  equations  by  the  method  of  finding  the  inverse  of 
the  coefficient  matrix.  Hint:  See  Example  3. 


19. 


f  x  -  2  y  =  5 
\  3x  +  y  =  15 


20. 


f  2x  +  3y  =  —  1 
(  5x  +  4y  =  8 


(  x  +  2z  —  8 

21.  I  2x  -  y  =  -5 

\  x  +  y  +  z  =  4 


{x  —  y  +  z  =  4 

2x  +  y  -  z  =  -1 
3x  +  2y  +  2z  =  5 


23.  Verify  formula  (6.13).  Hint:  Consider  the  product  of  the  matrices  MCT.  Use 
Problem  3.8. 

24.  Use  the  method  of  solving  simultaneous  equations  by  finding  the  inverse  of  the 
matrix  of  coefficients,  together  with  the  formula  (6.13)  for  the  inverse  of  a  matrix, 
to  obtain  Cramer’s  rule. 

25.  Verify  (6.14)  by  multiplying  the  matrices  and  using  trigonometric  addition  formulas. 

26.  In  (6.14),  let  6  =  <j>  =  7r/2  and  verify  the  result  numerically. 

27.  Do  Problem  26  if  9  =  7r/2,  <j>  =  7r/4. 

28.  Verify  the  calculations  in  (6.15),  (6.16),  and  (6.17). 

29.  Show  that  if  A  and  B  are  matrices  which  don’t  commute,  then  eA+B  eAeB,  but 
if  they  do  commute  then  the  relation  holds.  Hint:  Write  out  several  terms  of  the 
infinite  series  for  eA,  eB,  and  eA+B  and  do  the  multiplications  carefully  assuming 
that  A  and  B  don’t  commute.  Then  see  what  happens  if  they  do  commute. 

30.  For  the  Pauli  spin  matrix  A  in  Problem  6,  find  the  matrices  sin  fcA,  cos  fcA,  efcA,  and 
elkA  where  i  =  \J—  1. 

31.  Repeat  Problem  30  for  the  Pauli  spin  matrix  C  in  Problem  6.  Hint:  Show  that  if  a 

matrix  is  diagonal,  say  D  =  ^  ^  ,  then  /(D)  = 

32.  For  the  Pauli  spin  matrix  B  in  Problem  6,  find  elSB  and  show  that  your  result  is  a 
rotation  matrix.  Repeat  the  calculation  for  e_l9B. 


/(«) 

0 


0 

mj ■ 
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7.  LINEAR  COMBINATIONS,  LINEAR  FUNCTIONS,  LINEAR  OPERATORS 

Given  two  vectors  A  and  B,  the  vector  3 A  —  2B  is  called  a  “linear  combination” 
of  A  and  B.  In  general,  a  linear  combination  of  A  and  B  means  a  A  +  6B  where 
a  and  b  are  scalars.  Geometrically,  if  A  and  B  have  the  same  tail  and  do  not  lie 
along  a  line,  then  they  determine  a  plane.  You  should  satisfy  yourself  that  all  linear 
combinations  of  A  and  B  then  lie  in  the  plane.  It  is  also  true  that  every  vector  in 
the  plane  can  be  written  as  a  linear  combination  of  A  and  B;  we  shall  consider  this 
in  Section  8.  The  vector  r  =  ix  +  jy  +  with  tail  at  the  origin  (which  we  used 
in  writing  equations  of  lines  and  planes)  is  a  linear  combination  of  the  unit  basis 
vectors  i,  j,  k. 


A  function  of  a  vector,  say  /( r),  is  called  linear  if 

(7.1)  /( ri  +  r2)  =  /( ri)  +  /(: r2),  and  /(or)  =  af(  r), 

where  a  is  a  scalar. 


For  example,  if  A  =  2i  +  3j  —  k  is  a  given  vector,  then  /( r)  =  A  ■  r  =  2x  +  3y  —  z 
is  a  linear  function  because 

/( ri  +  r2)  =  A  ■  (ri  +  r2)  =  A  ■  ri  +  A  ■  r2  =  /( iq)  +  /(r2),  and 
/(ar)  =  A  •  (ar)  =  aA  •  r  =  a/( r). 

On  the  other  hand,  /( r)  =  |r|  is  not  a  linear  function,  because  the  length  of  the 
sum  of  two  vectors  is  not  in  general  the  sum  of  their  lengths.  That  is, 

/( ri  +r2)  =  |ri  +  r2|  ^  |ri|  +  |r2|  =  /( iq)  +  /( r2), 

as  you  can  see  from  Figure  7.1.  Also  note  that  although  we  <*■ 

call  y  =  mx  +  b  a  linear  equation  (it  is  the  equation  of  a 
straight  line),  the  function  f(x)  =  mx  +  b  is  not  linear  (unless  irp 

6  =  0)  because 

Figure  7.1 

f{x i  +  x2)  =  m(x i  +  x2)  +  byb  (mx i  +  b)  +  (mx2  +  b)  =  f(x i)  +  f(x2). 

We  can  also  consider  vector  functions  of  a  vector  r.  The  magnetic  field  at  each 
point  (x,  y,  z),  that  is,  at  the  head  of  the  vector  r,  is  a  vector  B  =  i Bx  + j By  +  k Bz. 
The  components  Bx,  By ,  Bz  may  vary  from  point  to  pint,  that  is,  they  are  functions 
of  (x,  y,  z )  or  r.  Then 


F(r)  is  a  linear  vector  function  if 

(7.2)  F(ri  +  r2)  =  F(ri)  +  F(r2)  and  F(ar)  =  aF(r), 

where  a  is  a  scalar. 


For  example,  F(r)  =  br  (where  b  is  a  scalar)  is  a  linear  vector  function  of  r. 


Section  7 


Linear  Combinations,  Linear  Functions,  Linear  Operators  125 


You  know  from  calculus  that 


(7.3) 


where  k  is  a  constant.  We  say  that  d/dx  is  a  “linear  operator”  [compare  (7.3)  with 
(7.1)  and  (7.2)].  An  “operator”  or  “operation”  simply  means  a  rule  or  some  kind  of 
instruction  telling  us  what  to  do  with  whatever  follows  it.  In  other  words,  a  linear 
operator  is  a  linear  function.  Then 


O  is  a  linear  operator  if 

(7.4)  0(A  +  B)  =  O(A)  +  O(B)  and  0{kA)  =  kO(A), 


where  k  is  a  number,  and  A  and  B  are  numbers,  functions,  vectors,  and  so  on. 
Many  of  the  errors  people  make  happen  because  they  assume  that  operators  are 
linear  when  they  are  not  (see  problems). 


Example  1.  Is  square  root  a  linear  operator?  We  are  asking,  is  \J A  +  B  the  same  as 
\J~A  +  VB  ?  The  answer  is  no;  taking  the  square  root  is  not  a  linear  operation. 


Example  2.  Is  taking  the  complex  conjugate  a  linear  operation?  We  want  to  know  whether 
A  +  B  =  A  +  B  and  kA  =  kA.  The  first  equation  is  true;  the  second  equation  is 
true  if  we  restrict  k  to  real  numbers. 


Matrix  Operators,  Linear  Transformations  Consider  the  set  of  equations 


(7.5) 


(  X  =  ax  +  by ,  / X\ 

(  Y  =  cx  +  dy,  \Y J 


or  R  =  Mr, 


where  a,  b,  c,  d,  are  constants.  For  every  point  ( x,y ),  these  equations  give  us  a 
point  (A,  Y).  If  we  think  of  each  point  of  the  (x,  y)  plane  being  moved  to  some 
other  point  (with  some  points  like  the  origin  not  being  moved),  we  can  call  this 
process  a  mapping  or  transformation  of  the  plane  into  itself.  All  the  information 
about  this  transformation  is  contained  in  the  matrix  M.  We  say  that  this  matrix  is 
an  operator  which  maps  the  plane  into  itself.  Any  matrix  can  be  thought  of  as  an 
operator  on  (conformable)  column  matrices  r.  Since 


(7.6)  M(ri  +  r2)  =  Mri  +  Mr2  and  M(fcr)  =  fc(Mr), 


the  matrix  M  is  a  linear  operator. 
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Equations  (7.5)  can  be  interpreted  geometrically  in  two  ways.  In  Figure  7.2,  we 
have  one  set  of  coordinate  axes  and  the  vector  r  has  been  changed  to  the  vector 
Rby  the  transformation  (7.5).  In  Figure  7.3,  we  have  two  sets  of  coordinate  axes, 


(. x,y )  and  ( x',y '),  and  one  vector  r  =  r'  with  coordinates  relative  to  each  set  of 
axes.  This  time  the  transformation 


(7.7) 


J  x'  =  ax  +  by ,  f  x'\ 

\  y'  =  cx  +  dy ,  or  \yy 


or  r'  =  Mr, 


tells  us  how  to  get  the  components  of  the  vector  r  =  r'  relative  to  axes  ( x' ,  y')  when 
we  know  its  components  relative  to  axes  ( x,y ). 


Orthogonal  Transformations  We  shall  be  particularly  interested  in  the  special 
case  of  a  linear  transformation  which  preserves  the  length  of  a  vector.  We  call  (7.7) 
an  orthogonal  transformation  if 


(7.8) 


/2  /2  2 
x  +  y  =  x 


and  similarly  for  (7.5).  You  can  see  from  the  figures  that  this  requirement  says 
that  the  length  of  a  vector  is  not  changed  by  an  orthogonal  transformation.  In 
Figure  7.2,  the  vector  would  be  rotated  (or  perhaps  reflected)  with  its  length  held 
fixed  (that  is  R  =  r  for  an  orthogonal  transformation).  In  Figure  7.3,  the  axes  are 
rotated  (or  reflected),  while  the  vector  stays  fixed.  The  matrix  M  of  an  orthogonal 
transformation  is  called  an  orthogonal  matrix.  Let’s  show  that  the  inverse  of  an 
orthogonal  matrix  equals  its  transpose;  in  symbols 

(7.9)  M^1  =  Mt,  M  orthogonal. 


From  (7.8)  and  (7.7)  we  have 

x'2  +  y 2  =  (ax  +  by)2  +  (cx  +  dy)2 

=  (a2  +  c2)x2  +  2  (ab  +  cd)xy  +  (b2  +  d2)y2  =  x2  +  y2 . 


Thus  we  must  have  a2  +  c2  =  1,  b2  +  d2  =  1,  ab  +  cd  =  0.  Then 

(,10)  =  ;)(•  J) 

_  / a2  +  c2  ab  +  cd\  _  { 1  0\ 
yab  +  cd  b2  +  d2  J  —  1^0  lj 
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Since  MTM  is  the  unit  matrix,  M  and  MT  are  inverse  matrices  as  we  claimed  in 
(7.9).  We  have  defined  an  orthogonal  transformation  in  two  dimensions  and  we  have 
proved  (7.9)  for  the  2-dimensional  case.  However,  a  square  matrix  of  any  order  is 
called  orthogonal  if  it  satisfies  (7.9),  and  you  can  easily  show  that  the  corresponding 
transformation  preserves  the  lengths  of  vectors  (Problem  9.24). 

Now  if  we  write  (7.9)  as  MTM  =  I  and  use  the  facts  from  Section  3  that 
det(MTM)  =  (det  MT)(det  M)  and  det  MT  =  det  M,  we  have  (det  M)2  =  det(MTM)  = 
detl  =  1,  so 

(7.11)  cletM  =  ±l,  M  orthogonal. 

This  is  true  for  M  of  any  order  since  we  have  used  only  the  definition  (7.9)  of  an 
orthogonal  matrix  and  some  properties  of  determinants.  As  we  shall  see,  det  M  =  1 
corresponds  geometrically  to  a  rotation,  and  detM  =  —1  means  that  a  reflection  is 
involved. 


Rotations  in  2  Dimensions  In  Figure  7.4,  we  have  sketched  the  vector  r  = 
( x,y ),  and  the  vector  R  =  (X,Y)  which  is  the  vector  r  rotated  by  angle  9.  We 
write  in  matrix  form  the  equations  relating  the  components  of  r  and  R  (Problem  19) . 


(7.12) 


f cos  6  —  sin  6\  f x\ 

^sin  6  cos  9J  \  y )  ’ 


vector  rotated. 


In  Figure  7.5,  we  have  sketched  two  sets  of  axes  with  the  primed  axes  rotated  by 
angle  9  with  respect  to  the  unprimed  axes.  The  vector  r  =  (x,y),  and  the  vector 
r'  =  (x',  y')  are  the  same  vector,  but  with  components  relative  to  different  axes. 
These  components  are  related  by  the  equations  (Problem  20). 


(7.13) 


cos  9  sin#\  /x\ 
—  sin#  cos 9 )  \y )  ’ 


axes  rotated. 


Both  equations  (7.12)  and  equations  (7.13)  are  referred  to  as  “rotation  equations” 
and  the  9  matrices  are  called  “rotation  matrices” .  To  distinguish  them,  we  refer  to 
the  rotation  (7.12)  as  an  “active”  transformation  (vectors  rotated),  and  to  (7.13) 
as  a  “passive”  transformation  (vectors  not  moved  but  their  components  changed 
because  the  axes  are  rotated).  Equations  (7.7)  or  (7.13)  are  also  referred  to  as  a 
“change  of  basis”.  (Remember  that  we  called  i,  j,  k  unit  basis  vectors;  here  we 
have  changed  from  the  i,  j  ,  k  basis  to  the  i',  j',  k'  basis.  Also  see  Section  10.) 
Observe  that  the  matrices  in  (7.12)  and  (7.13)  are  inverses  of  each  other.  You  can 
see  from  the  figures  why  this  must  be  so.  The  rotation  of  a  vector  in,  say,  the 
counterclockwise  direction  produces  the  same  result  as  the  rotation  of  the  axes  in 
the  opposite  (clockwise)  direction. 
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We  note  that  clet  M  =  cos2  6  +  sin2  6  =  1  for  a  rotation  matrix.  Any  2  by  2 
orthogonal  matrix  with  determinant  1  corresponds  to  a  rotation,  and  any  2  by  2 
orthogonal  matrix  with  determinant  =  —  1  corresponds  to  a  reflection  through  a 
line. 


Example  3.  Find  what  transformation  corresponds  to  each  of  the  following  matrices. 


(7.14) 


C  =  AB,  D  =  BA. 


First  we  can  show  that  all  these  matrices  are  orthogonal,  and  that  det  A  =  1,  but 
the  determinants  of  the  other  three  are  —1  (Problem  21).  Thus  A  is  a  rotation 
and  B,  C  and  D  are  reflections.  Let’s  view  these  as  active  transformations  (fixed 
axes,  vectors  rotated  or  reflected).  Then  by  comparing  A  with  (7.12),  we  have 
cosd  =  —1/2,  sin 6  =  ,  so  this  is  a  rotation  of  240°  (or  —120°).  Alternatively, 

we  could  ask  what  happens  to  the  vector  i.  We  multiply  matrix  A  times  the  column 
matrix  ( J )  and  get 


1 

2 


v/3 

-1 


H-a)  “  -5(i+j'/5)' 


which  is  i  rotated  by  240°  as  we  had  before. 

Now  B  operating  on  (y)  leaves  x  fixed  and  changes  the  sign  of  y  (check  this); 
that  is,  B  corresponds  to  a  reflection  through  the  x  axis. 

We  find  C  =  AB  and  D  =  BA  by  multiplying  the  matrices  (Problem  21). 


(7.15) 


C  =  AB 


1 

2 


D  =  BA  = 


1 

2 


We  know  that  these  are  reflections  since  they  have  determinant  =  —  1.  To  find  the 
line  through  which  the  plane  is  reflected,  we  realize  that  the  vectors  along  that  line 
are  unchanged  by  the  reflection,  so  we  want  to  find  x  and  y,  that  is  vector  r,  which 
is  mapped  to  itself  by  the  transformation.  For  matrix  C  we  write  Cr  =  r. 


(7.16) 


-(-1 
2  V-\/3 


You  can  verify  (Problem  21)  that  the  two  equations  in  (7.16)  are  really  the  same 
equation,  namely  y  =  —x\/3.  Vectors  along  this  line,  say  i  —  jv^,  are  not  changed 
by  the  reflection  [see  (7.17)]  so  this  is  the  reflection  line.  As  further  verification  we 
can  show  [see  (7.17)]  that  a  vector  perpendicular  to  this  line,  say  i-s/3+j,  is  changed 
into  its  negative,  that  is,  it  is  reflected  through  the  line. 


(7.17) 


1  (  -1 


2 

1  (  -1 


2  I— \/3 


Comment:  The  solution  of  the  equation  Cr  =  r  is  an  example  of  an  eigenvalue, 
eigenvector  problem.  We  shall  discuss  such  problems  in  detail  in  Section  11. 

We  can  analyze  the  transformation  D  in  the  same  way  we  did  C  to  find  (Prob¬ 
lem  21)  that  the  reflection  line  is  y  =  x\/3.  Note  that  matrices  A  and  B  do  not 
commute  and  the  transformations  C  and  D  are  different. 
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Rotations  and  Reflections  in  3  Dimensions  Let’s  consider  3  by  3  orthogonal 
matrices  as  active  transformations  rotating  or  reflecting  vectors  r  =  ( x,  y,z ).  A 
simple  form  for  a  rotation  matrix  is 

(cos  9  —  sin  9  0\ 

sin  9  cos  9  0  j  . 

0  0  1 J 

You  should  satisfy  yourself  that  this  transformation  produces  a  rotation  of  vectors 
about  the  2  axis  through  angle  9.  We  can  then  find  the  rotation  angle  from  (7.12) 
as  we  did  in  2  dimensions.  Similarly  the  matrix 

(cos  9  —  sin  9  0\ 

sin#  cos  9  0  J 

0  0  -l) 

produces  a  rotation  about  the  z  axis  of  angle  9  together  with  a  reflection  through 
the  ( x,y )  plane,  and  again  we  can  find  the  rotation  angle  as  in  2  dimensions. 

We  will  show  in  Section  11  that  any  3  by  3  orthogonal  matrix  with  determinant 
=  1  can  be  written  in  the  form  (7.18)  by  choosing  the  2  axis  as  the  rotation  axis,  and 
any  3  by  3  orthogonal  matrix  with  determinant  =  —  1  can  be  written  in  the  form 
(7.19).  For  now,  let’s  look  at  a  few  simple  problems  we  can  do  just  by  considering 
how  the  matrix  maps  certain  vectors. 


Example  4. 

The  matrix  for  a  rotation  about  the 

y 

axis  is 

/  cos  9 

0 

sin  9' 

(7.20) 

F  =  0 

1 

0 

\  —  sin  9 

0 

cos  9 

You  should  satisfy  yourself  that  the  entry  —  sin  9  is  in  the  right  place  for  an  active 
transformation.  Let  9  =  90°;  then  the  matrix  F  in  (7.20)  maps  the  vector  i  =  (1,0,0) 
to  the  vector  — k  =  (0,  0,  —1);  this  is  correct  for  a  90°  rotation  around  the  y  axis. 
Check  that  (0,  0, 1)  is  mapped  to  (1,  0, 0). 


Example  5.  Find  the  mappings  produced  by  the  matrices 


(7.21) 


/ 0  0  l\  /  0  0  l\ 

G  =  0  -1  0  ,  K  =  -1  0  0 

\1  00/  \  0  -1  0/ 


First  we  find  that  the  determinants  are  1  so  these  are  rotations.  For  G,  either 
by  inspection  or  by  solving  Gr  =  r  as  in  (7.16),  we  find  that  the  vector  (1,0,1) 
is  unchanged  and  so  i  +  k  is  the  rotation  axis.  Now  G2  is  the  identity  matrix 
(corresponding  to  a  360°  rotation);  thus  the  rotation  angle  for  G  is  180°. 

Similarly  for  K,  we  find  that  the  vector  (1,  —1, 1)  is  unchanged  by  the  transfor¬ 
mation  so  i  —  j  +  k  is  the  rotation  axis.  Now  verify  that  K  maps  i  to  — j ,  and  —  j 
to  k,  and  k  to  i  (or,  alternatively  that  K3  is  the  identity  matrix)  so  the  rotation 
angle  for  K3  is  ±360°.  From  the  geometry  we  see  that  the  rotation  i  — >  —  j  — ■>  k  — >  i 
is  a  rotation  of  —120°  about  i  —  j  +  k.  (Also  see  Section  11.) 
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Example  6.  Find  the  mapping  produced  by  the  matrix 

/  0  -1  0 

L  =  -1  0  0 

\  0  0  1 

Since  detL  =  —1,  this  is  a  reflection  through  some  plane.  The  vector  perpendicular 
to  the  reflection  plane  is  reversed  by  the  reflection,  so  we  ask  for  a  vector  satisfying 
Lr  =  — r.  Either  by  solving  these  equations  or  by  inspection  we  find  r  =  (1, 1,0)  = 
i  +  j .  The  reflecting  plane  is  the  plane  through  the  origin  perpendicular  to  this 
vector,  that  is,  the  plane  x  +  y  =  0  (see  Section  5). 


PROBLEMS,  SECTION  7 

Are  the  following  linear  functions?  Prove  your  conclusions  by  showing  that  /( r)  satisfies 
both  of  the  equations  (7.1)  or  that  it  does  not  satisfy  at  least  one  of  them. 

1.  /( r)  =  A  •  r  +  3,  where  A  is  a  given  vector. 

2.  /(r)  =  A  •  (r  -  kz). 

3.  r  •  r. 


Are  the  following  linear  vector  functions?  Prove  your  conclusions  using  (7.2). 

4.  F(r)  =  r  —  ix  =  jy  +  k  z. 

5.  F(r)  =  A  X  r,  where  A  is  a  given  vector. 

6.  F(r)  =  r  +  A,  where  A  is  a  given  vector. 

Are  the  following  operators  linear? 


7.  Definite  integral  with  respect  to  x  from  0  to  1;  the  objects  being  operated  on  are 
functions  of  x. 


8.  Find  the  logarithm;  operate  on  positive  real  numbers. 

9.  Find  the  square;  operate  on  numbers  or  on  functions. 

10.  Find  the  reciprocal;  operate  on  numbers  or  on  functions. 


11. 

12. 

13. 


Find  the  absolute  value;  operate  on  complex  numbers. 


d  d2 

Let  D  stand  for  — ,  D2  for  - . 

dx  dx z 


d3 

D 3  =  —— ,  and  so  on.  Are  D,  D2 ,  D3  linear? 
dxA 


Operate  on  functions  of  x  which  can  be  differentiated  as  many  times  as  needed. 

(a)  As  in  Problem  12,  is  D2  +  2D  +  1  linear? 

(b)  Is  x2 D2  —  2 xD  +  7  a  linear  operator? 


14.  Find  the  maximum;  operate  on  functions  of  x. 

15.  Find  the  transpose;  operate  on  matrices. 


16.  Find  the  inverse;  operate  on  square  matrices. 

17.  Find  the  determinant;  operate  on  square  matrices. 
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18.  With  the  cross  product  of  two  vectors  defined  by  (4.14),  show  that  finding  the  cross 
product  is  a  linear  operation,  that  is,  show  that  (4.18)  is  valid.  Warning  hint:  Don’t 
try  to  prove  it  by  writing  out  components:  Writing,  for  example,  iAx  X  ( }By+\tBz )  = 
iAx  X  j By  +  i Ax  X  k Bz  would  be  assuming  what  you’re  trying  to  prove.  Further 
hints:  First  show  that  (4.18)  is  valid  if  B  and  C  are  both  perpendicular  to  A  by 
sketching  (in  the  plane  perpendicular  to  A)  the  vectors  B,  C,  B  +  C,  and  their 
vector  products  with  A.  Then  do  the  general  case  by  first  showing  that  A  X  B  and 
A  X  Bj^  (where  Bj_  is  the  vector  component  of  B  perpendicular  to  A)  have  the 
same  magnitude  and  the  same  direction. 

19.  If  we  multiply  a  complex  number  z  =  re ^  by  e*e,  we  get  e*ez  =  rel^+e\  that  is, 
a  complex  number  with  the  same  r  but  with  its  angle  increased  by  9.  We  can  say 
that  the  vector  r  from  the  origin  to  the  point  z  =  x  +  iy  has  been  rotated  by  angle  9 
as  in  Figure  7.4  to  become  the  vector  R  from  the  origin  to  the  point  Z  =  X  +  iY. 
Then  we  can  write  X  +  iY  =  el9z  =  el6(x  +  iy ).  Take  real  and  imaginary  parts  of 
this  equation  to  obtain  equations  (7.12). 

20.  Verify  equations  (7.13)  using  Figure  7.5.  Hints:  Write  r'  =  r  as  i'x'  +  j'y'  =  ix  +  j y 
and  take  the  dot  product  of  this  equation  with  i'  and  with  j'  to  get  x1  and  y' . 
Evaluate  the  dot  products  of  the  unit  vectors  in  terms  of  9  using  Figure  7.5.  For 
example,  i'  ■  j  is  the  cosine  of  the  angle  between  the  x '  axis  and  the  y  axis. 

21.  Do  the  details  of  Example  3  as  follows: 

(a)  Verify  that  the  four  matrices  in  (7.14)  are  all  orthogonal  and  verify  the  stated 
values  of  their  determinants. 

(b)  Verify  the  products  C  =  AB  and  D  =  BA  in  (7.15). 

(c)  Solve  (7.16)  to  find  the  reflection  line. 

(d)  Analyze  the  transformation  D  as  we  did  C. 

Let  each  of  the  following  matrices  represent  an  active  transformation  of  vectors  in  the 
(x,y)  plane  (axes  fixed,  vectors  rotated  or  reflected).  As  in  Example  3,  show  that  each 
matrix  is  orthogonal,  find  its  determinant,  and  find  the  rotation  angle,  or  find  the  line  of 
reflection. 


22. 

24. 

26. 


1  /  1  1 
1 


0  -1 

-1  0 


If 3  4 

5  U  -3 


23. 


25. 


27. 


1  (~V 3 


2  \ 

_  -1 

-A 

1  ( 

'  -1 

2^2\ 

3  \2\/2 

1  ) 

1 

-A 

\/2  V  1 

-J 

28.  Write  the  matrices  which  produce  a  rotation  9  about  the  x  axis,  or  that  rotation 
combined  with  a  reflection  through  the  ( y,z )  plane.  [Compare  (7.18)  and  (7.19)  for 
rotation  about  the  a  axis.] 

29.  Construct  the  matrix  corresponding  to  a  rotation  of  90°  about  the  y  axis  together 
with  a  reflection  through  the  (x,  z)  plane. 

30.  For  the  matrices  G  and  I<  in  (7.21),  find  the  matrices  R  =  GK  and  S  =  KG.  Note 
that  R  ^  S.  (In  3  dimensions,  rotations  about  two  different  axes  do  not  in  general 
commute.)  Find  what  geometric  transformations  are  produced  by  R  and  S. 
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31.  To  see  a  physical  example  of  non-commuting  rotations,  do  the  following  experiment. 
Put  a  book  on  your  desk  and  imagine  a  set  of  rectangular  axes  with  the  x  and  y 
axes  in  the  plane  of  the  desk  with  the  z  axis  vertical.  Place  the  book  in  the  first 
quadrant  with  the  x  and  y  axes  along  the  edges  of  the  book.  Rotate  the  book  90° 
about  the  x  axis  and  then  90°  about  the  z  axis;  note  its  position.  Now  repeat  the 
experiment,  this  time  rotating  90°  about  the  a  axis  first,  and  then  90°  about  the 
x  axis;  note  the  different  result.  Write  the  matrices  representing  the  90°  rotations 
and  multiply  them  in  both  orders.  In  each  case,  find  the  axis  and  angle  of  rotation. 


For  each  of  the  following  matrices,  find  its  determinant  to  see  whether  it  produces  a 
rotation  or  a  reflection.  If  a  rotation,  find  the  axis  and  angle  of  rotation.  If  a  reflection, 
find  the  reflecting  plane  and  the  rotation  (if  any)  about  the  normal  to  this  plane. 


32. 


/  0 

0 

-A 

/  ° 

0 

-A 

0 

-1 

0 

33. 

-1 

0 

0 

V-i 

0 

0/ 

V  0 

1 

0/ 

34. 


A 

0 

°\ 

o 

-l 

°\ 

0 

0 

-1 

35. 

1 

0 

0 

\o 

-1 

0/ 

Vo 

0 

-1/ 

8.  LINEAR  DEPENDENCE  AND  INDEPENDENCE 


We  say  that  the  three  vectors  A  =  i+j,  B  =  i  +  k,  and  C  =  2i  +  j  +  k  are 
linearly  dependent  because  A  +  B  —  C  =  0.  The  two  vectors  i  and  j  are  linearly 
independent  because  there  are  no  numbers  a  and  b  (not  both  zero)  such  that  the 
linear  combination  ai  +  bj  is  zero.  In  general,  a  set  of  vectors  is  linearly  dependent  if 
some  linear  combination  of  them  is  zero  (with  not  all  the  coefficients  equal  to  zero) . 
In  the  simple  examples  above,  it  was  easy  to  see  by  inspection  whether  the  vectors 
were  linearly  independent  or  not.  In  more  complicated  cases,  we  need  a  method  of 
determining  linear  dependence.  Consider  the  set  of  vectors 


(8.1) 


(1,4, -5),  (5, 2,1),  (2, -1,3), 


and  (3,  —6, 11); 


We  want  to  know  whether  they  are  linearly  dependent,  and  if  so,  we  want  to  find  a 
smaller  linearly  independent  set.  Let  us  row  reduce  the  matrix  whose  rows  are  the 
given  vectors  (see  Section  2): 


(1 

4 

-5\ 

(9 

0 

7\ 

5 

2 

1 

0 

-9 

13 

2 

-1 

3 

0 

0 

0 

\3 

-6 

n) 

V> 

0 

0/ 

In  row  reduction,  we  are  forming  linear  combinations  of  the  rows  by  elementary 
row  operations  [see  (2.8)].  All  these  operations  are  reversible,  so  we  could,  if  we 
liked,  reverse  our  calculations  and  combine  the  two  vectors  (9, 0,  7)  and  (0,  —9, 13) 
to  obtain  each  of  the  four  original  vectors  (Problem  1).  Thus  there  are  only  two 
independent  vectors  in  (8.1);  we  refer  to  these  independent  vectors  as  basis  vectors 
since  all  the  original  vectors  can  be  written  in  terms  of  them  (see  Section  10). 
Note  that  the  rank  (see  Section  2)  of  the  matrix  in  (8.2)  is  equal  to  the  number  of 
independent  or  basis  vectors. 
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Linear  Independence  of  Functions  By  a  definition  similar  to  that  for  vectors, 
we  say  that  the  functions  fi(x),  f2(x),  •  •  • ,  fn(x)  are  linearly  dependent  if  some 
linear  combination  of  them  is  identically  zero,  that  is,  if  there  are  constants  k\ ,  k2, 
•  •  • ,  kni  not  all  zero,  such  that 

(8.3)  kifx(x)  +  k2f2(x)  H - h  knfn(x)  =  0. 

For  example,  siu2  x  and  (1  —  cos2  x)  are  linearly  dependent  since 

sin2  x  —  (1  —  cos2  x)  =  0. 

But  sin  a’  and  cos  a;  are  linearly  independent  since  there  are  no  numbers  fci  and 
not  both  zero,  such  that 

(8.4)  fci  sin  x  +  cos  x 
is  zero  for  all  x  (Problem  8). 

We  shall  be  particularly  interested  in  knowing  that  a  given  set  of  functions  is 
linearly  independent.  For  this  purpose  the  following  theorem  is  useful  (Problems  8 
to  16,  and  Chapter  8,  Section  5). 


If  fi(x),  f2(x),  •  •  • ,  fn{x)  have  derivatives  of  order  n  —  1,  and  if  the  determinant 


fi(x) 

fz(x) 

fn(x) 

f[(x) 

fdx) 

fn(X ) 

(8.5) 

W  = 

fi'ix) 

••  /"(*) 

/l(n_1)(*) 

if'1}(  X)  ■ 

then  the  functions  are  linearly  independent.  (See  Problem  16.)  The  determi¬ 
nant  W  is  called  the  Wronskian  of  the  functions. 


Example  1.  Using  (8.5),  show  that  the  functions  1,  x,  sin  a;  are  linearly  independent. 
We  write  and  evaluate  the  Wronskian, 


W  = 


1 

0 

0 


x 

1 

0 


sin  x 
cos  a; 
—  sin  x 


—  sin  ax 


Since  —  sin  a;  is  not  identically  equal  to  zero,  the  functions  are  linearly  independent. 


Example  2.  Now  let’s  compute  the  Wronskian  for  a  case  when  the  functions  are  linearly 
dependent. 


X 

sinx 

2x  —  3  sin  x 

X 

sinx 

2x 

w  = 

1 

cosx 

2  —  3  cos  x 

= 

1 

cosx 

2 

=  (sin  x)  (2x  —  2x)  =  0, 

0 

—  sinx 

3  sin  a: 

0 

—  sinx 

0 

as  we  expected.  However,  note  that  “functions  dependent”  implies  W  =  0,  but 
W  =  0  does  not  necessarily  imply  “functions  dependent”.  (See  Problem  16.) 
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Homogeneous  Equations  In  Section  2  we  considered  sets  of  linear  equations. 
Here  we  want  to  consider  the  special  case  of  such  equations  when  the  constants  on 
the  right  hand  sides  are  all  zero;  these  are  called  homogeneous  equations.  We  write 
the  homogeneous  equations  corresponding  to  (2.12)  and  (2.13)  together  with  the 
row  reduced  matrices: 


i  o 
0  1 

1  1 
0  0 


We  can  draw  several  conclusions  from  these  examples.  Note  that  in  (8.6)  the  only 
solution  is  x  =  y  =  0;  the  rank  of  the  matrix  is  2,  the  same  as  the  number  of 
unknowns.  In  (8.7),  the  rank  of  the  matrix  is  1;  this  is  less  than  the  number  of 
unknowns.  This  reflects  what  we  could  see  in  (8.7),  that  we  really  have  just  one 
equation  in  two  unknowns;  all  the  points  on  a  line  satisfy  x  +  y  =  0.  In  (8.8)  we 
summarize  the  facts  for  homogeneous  equations: 


Homogeneous  equations  are  never  inconsistent;  they  always  have 
the  solution  “all  unknowns  =  0”  (often  called  the  “trivial  solu¬ 
tion”).  If  the  number  of  independent  equations  (that  is,  the  rank 
of  the  matrix)  is  the  same  as  the  number  of  unknowns,  this  is  the 
only  solution.  If  the  rank  of  the  matrix  is  less  than  the  number  of 
unknowns,  there  are  infinitely  many  solutions. 


A  very  important  special  case  is  a  set  of  n  homogeneous  equations  in  n  unknowns. 
By  (8.8),  these  equations  have  only  the  trivial  solution  unless  the  rank  of  the  matrix 
is  less  than  n.  This  means  that  at  least  one  row  of  the  row  reduced  n  by  n  matrix  of 
the  coefficients  is  a  zero  row.  But  then  the  determinant  D  of  the  coefficients  is  zero. 
Thus  we  have  an  important  result  (see  Problems  21  to  25;  also  see  Section  11): 


A  system  of  n  homogeneous  equations  in  n  unknowns  has  solutions 
(8.9)  other  than  the  trivial  solution  if  and  only  if  the  determinant  of  the 
coefficients  is  zero. 


Solutions  in  Vector  Form  Geometrically,  solutions  of  sets  of  linear  equations 
may  be  points  or  lines  or  planes. 

Example  3.  In  Section  2,  Example  4,  we  solved  equations  (2.15): 

(8.10)  £  =  3  +  2  z,  y  =  A  —  z. 

This  solution  set  consists  of  all  points  on  the  line  which  is  the  intersection  of  these 
two  planes.  An  interesting  way  to  write  the  solution  is  the  vector  form 


(8.11) 


r  =  (x,  y,  z)  =  (3  +  2z,  4  -z,z)  =  (3, 4, 0)  +  (2,  -1, 1  )z. 


Section  8 


Linear  Dependence  and  Independence  135 


If  we  put  z  =  t,  this  is  the  parametric  form  of  the  equations  of  a  straight  line, 
r  =  r0  +  At  [see  (5.8)]. 

Now  let’s  consider  the  homogeneous  equations  (zero  right  hand  sides)  corre¬ 
sponding  to  equations  (2.15).  The  equations  and  the  row  reduced  matrix  are: 


(8.12) 


/  1 

1 

2 

-1 

-5 

4 

V  3 

-1 

so  the  solutions  are 


M 

0 

0  ’ 

w 


/ 1  0  -2  0\ 

0  1  10 
0  0  0  0’ 
\0  0  0  0/ 


(8.13) 


x  =  2z,  y  =  —z,  or  r  =  (2,  — 1,1  )z. 


Comparing  (8.11)  and  (8.13),  we  see  that  the  solution  of  the  homogeneous  equations 
Mr  =  0  is  a  straight  line  through  the  origin;  the  solution  of  the  equations  Mr  =  k 
is  a  parallel  straight  line  through  the  point  (3, 4, 0).  We  could  say  that  the  solution 
of  Mr  =  k  is  the  solution  of  the  corresponding  homogeneous  equations  plus  the 
particular  solution  r  =  (3,4,0). 

Here  is  an  example  of  an  important  use  of  (8.9). 


Example  4.  For  what  values  of  A  does  the  following  set  of  equations  have  nontrivial 
solutions  for  x  and  y?  For  each  value  of  A  find  the  corresponding  relation  between  x 
and  y.  This  is  an  example  of  an  eigenvalue  problem;  we  shall  discuss  such  problems 
in  detail  in  Sections  11  and  12.  The  values  of  A  are  called  eigenvalues  and  the 
corresponding  vectors  (x,  y)  are  called  eigenvectors. 


(8.14) 


f  (1  -  A)x  +  2y  =  0, 
\  2x  +  (4  —  X)y  =  0. 


By  (8.9),  we  set  the  determinant  M  of  the  coefficients  equal  to  zero.  Then  we  solve 
for  A,  and  for  each  value  of  A  we  solve  for  x  and  y. 


1- A 
2 


2 

4- A 


=  A2  -  5A  +  4  -  4  =  A(A  -  5)  =  0, 


A  =  0,5. 


For  A  =  0,  we  find  x  +  2y  =  0.  For  A  =  5,  we  find  2x  —  y  =  0.  In  vector  notation 
the  eigenvectors  are:  For  A  =  0,  r  =  (2,  —  l)s,  and  for  A  =  5,  r  =  (l,2)f,  where  s 
and  t  are  parameters  in  these  vector  equations  of  straight  lines  through  the  origin. 


PROBLEMS,  SECTION  8 

1.  Write  each  of  the  vectors  (8.1)  as  a  linear  combination  of  the  vectors  (9,0,7)  and 
(0,-9, 13).  Hint:  To  get  the  right  x  component  in  (1,4, —5),  you  have  to  use 
( 1/ 9) (9,  0,  7).  How  do  you  get  the  right  y  component?  Is  the  z  component  now 
correct? 

In  Problems  2  to  4,  find  out  whether  the  given  vectors  are  dependent  or  independent;  if 
they  are  dependent,  find  a  linearly  independent  subset.  Write  each  of  the  given  vectors  as 
a  linear  combination  of  the  independent  vectors. 

2.  (1,-2,  3),  (1,1,1),  (-2, 1,-4),  (3,0,5) 
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3.  (0,1,1),  (-1,5,3),  (1,0,2),  (2, -15,1) 

4.  (3, 5,-1),  (1,4,2),  (-1,0,5),  (6,14,5) 

5.  Show  that  any  vector  V  in  a  plane  can  be  written  as  a  linear  combination  of  two 
non-parallel  vectors  A  and  B  in  the  plane;  that  is,  find  a  and  b  so  that  V  =  oA  +  &B. 
Hint:  Find  the  cross  products  A  X  V  and  B  X  V;  what  are  A  X  A  and  B  X  B? 
Take  components  perpendicular  to  the  plane  to  show  that 

_  (B  X  V)  -n 
°  (B  X  A)  •  n 

where  n  is  normal  to  the  plane,  and  a  similar  formula  for  b. 

6.  Use  Problem  5  to  write  V  =  3i  +  5j  as  a  linear  combination  of  A  =  2i  +  j  and 
B  =  3i  —  2j.  Show  that  the  formulas  in  Problem  5,  written  as  a  quotient  of  2  by  2 
determinants,  are  just  the  Cramer’s  rule  solution  of  simultaneous  equations  for  a 
and  b. 

7.  As  in  Problem  6,  write  V  =  4i  —  5j  in  terms  of  the  basis  vectors  i  —  4j  and  5i  +  2j. 
In  Problems  8  to  15,  use  (8.5)  to  show  that  the  given  functions  are  linearly  independent. 


8. 

sinx,  cos  x 

9. 

ix 

e  ,  sin  x 

10. 

X  X 

x,  e  ,  xe 

11. 

sinx,  cosx,  xsinx,  xcosx 

12. 

1,  X2 ,  X4,  X6 

13. 

sinx,  sin2x 

14. 

ix  —ix 

e  ,  e 

15. 

x  ix  i 

e  ,  e  ,  coshx 

16. 

(a)  Prove  that  if  the  Wronskian  (8.5) 

is  not  identically  zero,  then  the  functions  /1 , 

/2,  . . . ,  fn  are  linearly  independent.  Note  that  this  is  equivalent  to  proving 
that  if  the  functions  are  linearly  dependent,  then  W  is  identically  zero.  Hints: 
Suppose  (8.3)  were  true;  you  want  to  find  the  fc’s.  Differentiate  (8.3)  repeatedly 
until  you  have  a  set  of  n  equations  for  the  n  unknown  fc’s.  Then  use  (8.9). 

(b)  In  part  (a)  you  proved  that  if  W  ^  0,  then  the  functions  are  linearly  inde¬ 
pendent.  You  might  think  that  if  IT  =  0,  the  functions  would  be  linearly 
dependent.  This  is  not  necessarily  true;  if  W  =  0,  the  functions  might  be 
either  dependent  or  independent.  For  example,  consider  the  functions  x 3  and 
| a:3 1  on  the  interval  (—1, 1).  Show  that  W  =  0,  but  the  functions  are  not 
linearly  dependent  on  (—1,1).  (Sketch  them.)  On  the  other  hand,  they  are 
linearly  dependent  (in  fact  identical)  on  (0, 1). 

In  Problems  17  to  20,  solve  the  sets  of  homogeneous  equations  by  row  reducing  the  matrix. 


(  x  —  2y  +  3z  =  0 
17.  <  x  +  Ay  —  6z  =  0 

(  2x  +  2y  —  3z  =  0 


(  2x  +  3z  =  0 
18.  <  4x  +  2y  +  5z  =  0 

^  *  —  y  +  2z  =  0 


'  3x  +  y  +  3z  +  6w  =  0 
Ax  —  7y  —  3z  +  5  w  =  0 
x  +  3y  +  Az  —  3w  =  0 
3x  +  2z  +  7w  =  0 


20. 


"  2x  —  3y  +  5z  =  0 
x  +  2y  —  z  =  0 
x  —  5y  +  6z  =  0 
t  4x  +  y  +  3z  =  0 


21.  Find  a  condition  for  four  points  in  space  to  lie  in  a  plane.  Your  answer  should  be 
in  the  form  a  determinant  which  must  be  equal  to  zero.  Hint:  The  equation  of  a 
plane  is  of  the  form  ax  +  by  +  cz  =  d,  where  a,  b,  c,  d  are  constants.  The  four  points 
(*i,  j/i,  zi),  {x2,V2,  Z2),  etc.,  are  all  to  satisfy  this  equation.  When  can  you  find  a, 
b,  c,  d  not  all  zero? 
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22.  Find  a  condition  for  three  lines  in  a  plane  to  intersect  in  one  point.  Hint:  See 
Problem  21.  Write  the  equation  of  a  line  as  ax  +  by  =  c.  Assume  that  no  two  of  the 
lines  are  parallel. 

Using  (8.9),  find  the  values  of  A  such  that  the  following  equations  have  nontrivial  solutions, 
and  for  each  A,  solve  the  equations.  (See  Example  4.) 


(4  -  A)*  —  2y  =  0 
-2x  +  (7  -  X)y  =  0 


(6  —  A)*  +  3y  =  0 
3*  —  (2  +  A  )y  =  0 


(  — (1  +  A)x  +  y  +  3z  —  0, 

25.  I  x+  (2  -  A )y  =  0, 

(  3*  +  (2  —  A  )z  =  0. 

For  each  of  the  following,  write  the  solution  in  vector  form  [see  (8.11)  and  (8.13)]. 


26. 


"  2x  —  3y  +  5z 
x  +  2  y  —  z 
x  —  5y  +  &z 
s  4x  +  y  +  3z 


3 

5 

-2 

13 


|  x  —  y  +  2z  =  3 
27.  I  —2x  +  2 y  -  z  =  0 
[  4x  —  4y  +  5z  =  6 


{2x  +  y  —  5z  =  7 
x  -  2  y  =1 

3*  —  5  y  —  z  =  4 


9.  SPECIAL  MATRICES  AND  FORMULAS 

In  this  section  we  want  to  discuss  various  terms  used  in  work  with  matrices,  and 


prove  some  important  formulas.  First  we  list  for  reference  needed  definitions  and 
facts  about  matrices. 

There  are  several  special  matrices  which  are  related  to  a  given  matrix  A.  We 
outline  in  (9.1)  what  these  matrices  are  called,  what  notations  are  used  for  them, 
and  how  we  get  them  from  A. 

(9.1)  Name  of  Matrix 

Notations  for  it 

How  to  get  it  from  A 

Transpose  of  A,  or 

A  transpose 

AT  or  A  or  A'  or  A4 

Interchange  rows  and 
columns  in  A. 

Complex  conjugate 
of  A 

A  or  A* 

Take  the  complex 
conjugate  of  each 
element. 

Transpose  conjugate, 
Hermitian  conjugate, 
adjoint  (Problem  9), 
Hermitian  adjoint. 

A'  (A  dagger) 

Take  the  complex 
conjugate  of  each 
element  and  transpose. 

Inverse  of  A 

A"1 

See  Formula  (6.13). 

There  is  another  set  of  names  for  special  types  of  matrices.  In  (9.2),  we  list 
these  and  their  definitions  for  reference. 
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(9.2) 

A  matrix  is  called 

if  it  satisfies  the  condition(s) 

real 

A  =  A 

symmetric 

A  =  AT,  A  real 
(matrix  =  its  transpose) 

skew-symmetric  or 
antisymmetric 

A  =  —  AT,  A  real 

orthogonal 

A-1  =  AT,  A  real 
(inverse  =  transpose) 

pure  imaginary 

A  =  -A 

Hermitian 

A  =  At 

(matrix  =  its  transpose 
conjugate) 

anti-Hermitian 

A  =  -At 

unitary 

A"1  =  At 

(inverse  =  transpose 
conjugate) 

normal 

AAf  =  At  A 

(A  and  At  commute) 

Now  let’s  consider  some  examples  and  proofs  using  these  terms. 

Index  Notation  We  are  going  to  need  index  notation  in  some  of  our  work  below, 
so  for  reference  we  restate  the  rule  in  (6.2b)  for  matrix  multiplication. 

(9.3)  (AB  )ij  =  YJAikBk:j. 

k 

Study  carefully  the  index  notation  for  “row  times  column”  multiplication.  To  find 
the  element  in  row  i  and  column  j  of  the  product  matrix  AB,  we  multiply  row  i 
of  A  times  column  j  of  B.  Note  that  the  fc’s  (the  sum  is  over  k)  are  next  to  each 
other  in  (9.3).  If  we  should  happen  to  have  Yk^kjAik,  we  should  rewrite  it  as 
Yk  AikBkj  (with  the  fc’s  next  to  each  other)  to  recognize  it  as  an  element  of  the 
matrix  AB  (not  BA).  We  will  see  an  example  of  this  in  (9.10)  below. 

Kronecker  6  The  Kronecker  5  is  defined  by 

<9-4> 

For  example,  An  =  1,  5 12  =  0,  622  =  1,  A31  =  0,  and  so  on.  In  this  notation  a  unit 
matrix  is  one  whose  elements  are  Sij  and  we  can  write 

I  =  {Sij)- 


(9.5) 
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(Also  see  Chapter  10,  Section  5.)  The  Kronecker  6  notation  is  useful  for  other 
purposes.  For  example,  since  (for  positive  integers  m  and  n ) 


(9.6a) 


cos  nx  cos  mx  dx  = 


7 r,  if  m  =  n, 
0,  if  m  ^  n, 


we  can  write 
(9.6b) 


cos  nx  cos  mx  dx 


7T 


■  s 


nm  • 


This  is  the  same  as  (9.6a)  because  Snm  =  0  if  m  ^  n,  and  dnm  =  1  if  m  =  n. 

Using  the  Kronecker  6,  we  can  give  a  formal  proof  that  for  any  matrix  M  and  a 
conformable  unit  matrix  I,  the  product  of  I  and  M  is  just  M.  Using  index  notation 
and  equations  (9.3)  and  (9.4),  we  have 


(9.7)  (IM)y  -  SikMkj  =  Mij  or  IM  =  M 

k 

since  Sik  =  0  unless  k  =  i. 


More  Useful  Theorems  Let’s  use  index  notation  to  prove  the  associative  law 
for  matrix  multiplication,  that  is 

(9.8)  A(BC)  =  (AB)C  =  ABC. 

First  we  write  (BC)fcj  =  YZi  BkiCij.  Then  we  have 

(9.9)  [A(BC)]y  =  Y  Aik( BC)fcj  =  ^  Aik  Y  BkiCtj 

k  k  l 

=  EE  AikBuCij  =  (ABC)y 

k  i 

which  is  the  index  notation  for  A(BC)  =  ABC  as  in  (9.8).  We  can  prove  (AB)C  = 
ABC  in  a  similar  way  (Problem  1). 

In  formulas  we  may  want  the  transpose  of  the  product  of  two  matrices.  First 
note  that  Ajk  =  Aki  [see  (2.1)  or  (9.1)].  Then 

(AB)T  =  (AB)fcj  =  YAkjBji  =  YAJkB7j 

3  3 

(9-10)  =YBIjAJk  =  (BTAT)ifc,  or, 

3 

(AB)t  =  btat. 

The  theorem  applies  to  a  product  of  any  number  of  matrices  (see  Problem  8b).  For 
example 


(9.11)  (ABCD)t  =  DtCtBtAt. 

The  transpose  of  a  product  of  matrices  is  equal  to  the  product  of  the  transposes 
in  reverse  order. 
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A  similar  theorem  is  true  for  the  inverse  of  a  product  (see  Section  6,  Problem  18). 


(9.12)  (ABCD)-1  =  D-1C-1B-1A-1. 

The  inverse  of  a  product  of  matrices  is  equal  to  the  product  of  the  inverses  in 
reverse  order. 


Trace  of  a  Matrix  The  trace  (or  spur)  or  a  square  matrix  A  (written  Tr  A)  is 
the  sum  of  the  elements  on  the  main  diagonal.  Thus  the  trace  of  a  unit  n  by  n 
matrix  is  rz,  and  the  trace  of  the  matrix  M  in  (6.10)  is  6.  It  is  a  theorem  that  the 
trace  of  a  product  of  matrices  is  not  changed  by  permuting  them  in  cyclic  order. 
For  example 

(9.13)  Tr(ABC)  =  Tr(BCA)  =  Tr(CAB). 

We  can  prove  this  as  follows: 

Tr(ABC)  =  5>BC)ii  =  EEE- a  ij  B3kCki 

i  i  j  k 

=  EEE  BjkCkiAij  =  Tr(BCA) 

i  j  k 

=  EEE  CkiAijBjk  =  Tr(CAB). 

i  j  k 

Warning:  Tr(ABC)  is  not  equal  to  Tr(ACB)  in  general. 

Theorem:  If  H  is  a  Hermitian  matrix,  then  U  =  e*H  is  a  unitary  matrix.  (This 
is  an  important  relation  in  quantum  mechanics.)  By  (9.2)  we  need  to  prove  that 
=  U-1  if  Ht  =  H.  First,  e*He_lH  =  e*H_lH  since  H  commutes  with  itself — see 
Problem  6.29.  But  this  is  e°  which  is  the  unit  matrix  [see  Section  6]  so  U-1  =  e~lH. 
To  find  td  =  (elH)^,  we  expand  U  =  e‘H  in  a  power  series  to  get  U  =  ]T)fe(z H)k/k! 
and  then  take  the  transpose  conjugate.  To  do  this  we  just  need  to  realize  that  the 
transpose  of  a  sum  of  matrices  is  the  sum  of  the  transposes,  and  that  the  transpose 
of  a  power  of  a  matrix,  say  (M")T  is  equal  to  (MT)n  (Problem  9.21).  Also  recall 
from  Chapter  2  that  you  find  the  complex  conjugate  of  an  expression  by  changing 
the  signs  of  all  the  i’s.  This  means  that  (zHp  =  — zH^  =  — zH  since  H  is  Hermitian. 
Then  summing  the  series  we  get  =  e~tH,  which  is  just  what  we  found  for  IW1 
above.  Thus  =  V  so  U  is  a  unitary  matrix.  (Also  see  Problem  11.61.) 

PROBLEMS,  SECTION  9 

1.  Use  index  notation  as  in  (9.9)  to  prove  the  second  part  of  the  associative  law  for 
matrix  multiplication:  (AB)C  =  ABC. 

2.  Use  index  notation  to  prove  the  distributive  law  for  matrix  multiplication,  namely: 
A(B  +  C)  =  AB  +  AC. 
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3.  Given  the  following  matrix,  find  the  transpose,  the  inverse,  the  complex  conjugate, 
and  the  transpose  conjugate  of  A.  Verify  that  AA-1  =  A-1A  =  the  unit  matrix. 

/  1  0  5i\ 

A  =  — 2i  2  0  , 

V  1  1 +*  0/ 

4.  Repeat  Problem  3  given 

/  0  2  i  -1\ 

A  =  j  —i  2  0  1. 

V  3  0  0/ 

5.  Show  that  the  product  AAT  is  a  symmetric  matrix. 

6.  Give  numerical  examples  of:  a  symmetric  matrix;  a  skew-symmetric  matrix;  a  real 
matrix;  a  pure  imaginary  matrix. 

7.  Write  each  of  the  items  in  the  second  column  of  (9.2)  in  index  notation. 

8.  (a)  Prove  that  (AB)^  =  B^A^.  Hint:  See  (9.10). 

(b)  Verify  (9.11),  that  is,  show  that  (9.10)  applies  to  a  product  of  any  number  of 
matrices.  Hint:  Use  (9.10)  and  (9.8). 

9.  In  (9.1)  we  have  defined  the  adjoint  of  a  matrix  as  the  transpose  conjugate.  This  is 
the  usual  definition  except  in  algebra  where  the  adjoint  is  defined  as  the  transposed 
matrix  of  cofactors  [see  (6.13)].  Show  that  the  two  definitions  are  the  same  for  a 
unitary  matrix  with  determinant  =  +1. 

10.  Show  that  if  a  matrix  is  orthogonal  and  its  determinant  is  +1,  then  each  element 
of  the  matrix  is  equal  to  its  own  cofactor.  Hint:  Use  (6.13)  and  the  definition  of  an 
orthogonal  matrix. 

11.  Show  that  a  real  Hermitian  matrix  is  symmetric.  Show  that  a  real  unitary  matrix  is 
orthogonal.  Note:  Thus  we  see  that  Hermitian  is  the  complex  analogue  of  symmetric, 
and  unitary  is  the  complex  analogue  of  orthogonal.  (See  Section  11.) 

12.  Show  that  the  definition  of  a  Hermitian  matrix  (A  =  A1)  can  be  written  a,ij  =  aji 
(that  is,  the  diagonal  elements  are  real  and  the  other  elements  have  the  property 
that  ai2  =  a2i,  etc.).  Construct  an  example  of  a  Hermitian  matrix. 

13.  Show  that  the  following  matrix  is  a  unitary  matrix. 

\l+W 3)/4  ^L(l  +  *)> 

V27f(1  +  i)  (V5  +  i)/4y 

14.  Use  (9.11)  and  (9.12)  to  simplify  (ABTC)T,  (C-1MC)-1,  (AH)-1  (AHA-1)3 (HA-1)-1. 

15.  (a)  Show  that  the  Pauli  spin  matrices  (Problem  6.6)  are  Hermitian. 

(b)  Show  that  the  Pauli  spin  matrices  satisfy  the  Jacobi  identity  [A,  [B,  C]]  + 
[B,  [C,  A]]  +  [C,  [A,  B]]  =  0  where  [A,  B]  is  the  commutator  of  A,  B  [see  (6.3)]. 

(c)  Generalize  (b)  to  prove  the  Jacobi  identity  for  any  (conformable)  matrices  A, 
B,  C.  Also  see  Chapter  6,  Problem  3.14. 

16.  Let  Cij  =  (—iy+J Mij  be  the  cofactor  of  element  aij  in  the  determinant  A.  Show 
that  the  statement  of  Laplace’s  development  and  the  statement  of  Problem  3.8  can 
be  combined  in  the  equations 

^  ]  aijCkj  —  J/A  ■  det  A,  or  ^  ]  OijCik  —  J; *  det  A. 
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17.  (a)  Show  that  if  A  and  B  are  symmetric,  then  AB  is  not  symmetric  unless  A  and 

B  commute. 

(b)  Show  that  a  product  of  orthogonal  matrices  is  orthogonal. 

(c)  Show  that  if  A  and  B  are  Hermitian,  then  AB  is  not  Hermitian  unless  A  and 
B  commute. 

(d)  Show  that  a  product  of  unitary  matrices  is  unitary. 

18.  If  A  and  B  are  symmetric  matrices,  show  that  their  commutator  is  antisymmetric 
[see  equation  (6.3)]. 

19.  (a)  Prove  that  Tr(AB)  =  Tr(BA).  Hint:  See  proof  of  (9.13). 

(b)  Construct  matrices  A,  B,  C  for  which  Tr(ABC)  ^  Tr(CBA),  but  verify  that 
Tr(ABC)  =  Tr(CAB). 

(c)  If  S  is  a  symmetric  matrix  and  A  is  an  antisymmetric  matrix,  show  that 
Tr(SA)  =  0.  Hint:  Consider  Tr(SA)T  and  prove  that  Tr(SA)  =  —  Tr(SA). 

20.  Show  that  the  determinant  of  a  unitary  matrix  is  a  complex  number  with  absolute 
value  =  1.  Hint:  See  proof  of  equation  (7.11). 

21.  Show  that  the  transpose  of  a  sum  of  matrices  is  equal  to  the  sum  of  the  transposes. 
Also  show  that  (M")T  =  (MT)n.  Hint:  Use  (9.11)  and  (9.8). 

22.  Show  that  a  unitary  matrix  is  a  normal  matrix,  that  is,  that  it  commutes  with  its 
transpose  conjugate  [see  (9.2)].  Also  show  that  orthogonal,  symmetric,  antisymmet¬ 
ric,  Hermitian,  and  anti-Hermitian  matrices  are  normal. 

23.  Show  that  the  following  matrices  are  Hermitian  whether  A  is  Hermitian  or  not: 
AAt,  A  + Af,  i(A- Af). 

24.  Show  that  an  orthogonal  transformation  preserves  the  length  of  vectors.  Hint:  If  r  is 
the  column  matrix  of  vector  r  [see  (6.10)],  write  out  rTr  to  show  that  it  is  the  square 
of  the  length  of  r.  Similarly  RTR  =  |R|2  and  you  want  to  show  that  | R| 2  =  |r|2, 
that  is,  RtR  =  rTr  if  R  =  Mr  and  M  is  orthogonal.  Use  (9.11). 

25.  (a)  Show  that  the  inverse  of  an  orthogonal  matrix  is  orthogonal.  Hint:  Let  A  = 

O-1;  from  (9.2),  write  the  condition  for  O  to  be  orthogonal  and  show  that  A 
satisfies  it. 

(b)  Show  that  the  inverse  of  a  unitary  matrix  is  unitary.  See  hint  in  (a). 

(c)  If  H  is  Hermitian  and  U  is  unitary,  show  that  U_1HU  is  Hermitian. 

10.  LINEAR  VECTOR  SPACES 

We  have  used  extensively  the  vector  r  =  ix+jy+kz  to  mean  a  vector  from  the  origin 
to  the  point  (. x,y,z ).  There  is  a  one-to-one  correspondence  between  the  vectors  r 
and  the  points  (x,y,z)\  the  collection  of  all  such  points  or  all  such  vectors  makes 
up  the  3-dimensional  space  often  called  R3  (R  for  real)  or  V3  (V  for  vector)  or  E3 
(E  for  Euclidean).  Similarly,  we  can  consider  a  2-dimensional  space  V2  of  vectors 
r  =  ix  +  jy  or  points  (x,y)  making  up  the  ( x,y )  plane.  V2  might  also  mean  any 
plane  through  the  origin.  And  V\  means  all  the  vectors  from  the  origin  to  points 
on  some  line  through  the  origin. 

We  also  use  x,  y,  z  to  mean  the  variables  or  unknowns  in  a  problem.  Now 
applied  problems  often  involve  more  than  three  variables.  By  extension  of  the  idea 
of  V3,  it  is  convenient  to  call  an  ordered  set  of  n  numbers  a  point  or  vector  in  the 
n-dimensional  space  Vn.  For  example,  the  4- vectors  of  special  relativity  are  ordered 
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sets  of  four  numbers;  we  say  that  space-time  is  4-dimensional.  A  point  of  the  phase 
space  used  in  classical  and  quantum  mechanics  is  an  ordered  set  of  six  numbers, 
the  three  components  of  the  position  of  a  particle  and  the  three  components  of  its 
momentum;  thus  the  phase  space  of  a  particle  is  the  6-dimensional  space  Ve¬ 
in  such  cases,  we  can’t  represent  the  variables  as  coordinates  of  a  point  in  physical 
space  since  physical  space  has  only  three  dimensions.  But  it  is  convenient  and 
customary  to  extend  our  geometrical  terminology  anyway.  Thus  we  use  the  terms 
variables  and  coordinates  interchangeably  and  speak,  for  example,  of  a  “point  in  5- 
dimensional  space,”  meaning  an  ordered  set  of  values  of  five  variables,  and  similarly 
for  any  number  of  variables.  In  three  dimensions,  we  think  of  the  coordinates  of  a 
point  as  the  components  of  a  vector  from  the  origin  to  the  point.  By  analogy,  we 
call  an  ordered  set  of  five  numbers  a  “vector  in  5-dimensional  space”  or  an  ordered 
set  of  n  numbers  a  “vector  in  n-dimensional  space.” 

Much  of  the  geometrical  terminology  which  is  familiar  in  two  and  three  dimen¬ 
sions  can  be  extended  to  problems  in  n  dimensions  (that  is,  n  variables)  by  using 
the  algebra  which  parallels  the  geometry.  For  example,  the  distance  from  the  origin 
to  the  point  (x,  y ,  z)  is  \J  x2  +  y2  +  z2.  By  analogy  in  a  problem  in  the  five  vari¬ 
ables  x,  y,  z,  u,  v ,  we  define  the  distance  from  the  origin  (0,0, 0,0,0)  to  the  point 
(. x,y,z,u,v )  as  \J  x2  +  y2  +  z2  +  u 2  +  v2.  By  using  the  algebra  which  goes  with 
the  geometry,  we  can  easily  extend  such  ideas  as  the  length  of  a  vector,  the  dot 
product  of  two  vectors,  and  therefore  the  angle  between  the  vectors  and  the  idea  of 
orthogonality,  etc.  We  saw  in  Section  7,  that  an  orthogonal  transformation  in  two 
or  three  dimensions  corresponds  to  a  rotation.  Thus  we  might  say,  in  a  problem  in 
n  variables,  that  a  linear  transformation  (that  is  a  linear  change  of  variables)  satis¬ 
fying  “sum  of  squares  of  new  variables  =  sum  of  squares  of  old  variables”  [compare 
(7.8)]  corresponds  to  a  “rotation  in  n-dimensional  space." 

Example  1.  Find  the  distance  between  the  points  (3, 0,  5,  —2, 1)  and  (0, 1,  —2,  3, 0). 

Generalizing  what  we  would  do  in  three  dimensions,  we  find  d 2  =  (3  —  0)2  + 
(0  -  l)2  +  (5  +  2)2  +  (-2  -  3)2  +  (1  -  0)2  =  9  +  1  +  49  +  25  +  1  =  85,  d  =  V85. 

If  we  start  with  several  vectors,  and  find  linear  combinations  of  them  in  the 
algebraic  way  (by  components),  then  we  say  that  the  original  set  of  vectors  and  all 
their  linear  combinations  form  a  linear  vector  space  (or  just  vector  space  or  linear 
space  or  space).  Note  that  if  r  is  one  of  our  original  vectors,  then  r  —  r  =  0  is  one 
of  the  linear  combinations;  thus  the  zero  vector  (that  is,  the  origin)  must  be  a  point 
in  every  vector  space.  A  line  or  plane  not  passing  through  the  origin  is  not  a  vector 
space. 

Subspace,  Span,  Basis,  Dimension  Suppose  we  start  with  the  four  vectors  in 
(8.1).  We  showed  in  (8.2)  that  they  are  all  linear  combinations  of  the  two  vectors 
(9,  0,  7)  and  (0,  —9, 13).  Now  two  linearly  independent  vectors  (remember  their  tails 
are  at  the  origin)  determine  a  plane;  all  linear  combinations  of  the  two  vectors  lie 
in  the  plane.  [The  plane  we  are  talking  about  in  this  example  is  the  plane  through 
the  three  points  (9,0,7),  (0,-9, 13),  and  the  origin.]  Since  all  the  vectors  making 
up  this  plane  are  also  part  of  3-dimensional  space  V3,  we  call  V2  a  subspace 
of  V3.  Similarly  any  line  lying  in  this  plane  and  passing  through  the  origin  is  a 
subspace  of  V2  and  of  V3.  We  say  that  either  the  original  four  vectors  or  the  two 
independent  ones  span  the  space  V2;  a  set  of  vectors  spans  a  space  if  all  the  vectors 
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in  the  space  can  be  written  as  linear  combinations  of  the  spanning  set.  A  set  of 
linearly  independent  vectors  which  span  a  vector  space  is  called  a  basis.  Here  the 
vectors  (9,0,7)  and  (0,-9, 13)  are  one  possible  choice  as  a  basis  for  the  space  V2; 
another  choice  would  be  any  two  of  the  original  vectors  since  in  (8.2)  no  two  of  the 
vectors  are  dependent. 

The  dimension  of  a  vector  space  is  equal  to  the  number  of  basis  vectors.  Note 
that  this  statement  implies  (correctly — see  Problem  8)  that  no  matter  how  you  pick 
the  basis  vectors  for  a  given  vector  space,  there  will  always  be  the  same  number 
of  them.  This  number  is  the  dimension  of  the  space.  In  3  dimensions,  we  have 
frequently  used  the  unit  basis  vectors  i,  j,  k  which  can  also  be  written  as  (1,0,0), 
(0,1,0),  (0,0,1).  Then  in,  say  5  dimensions,  a  corresponding  set  of  unit  basis 
vectors  would  be  (1,0,  0,0,0),  (0,1, 0,0,0),  (0,0, 1,0,0),  (0,0, 0,1,0),  (0,0,  0,0,1). 
You  should  satisfy  yourself  that  these  five  vectors  are  linearly  independent  and  span 
a  5  dimensional  space. 

Example  2.  Find  the  dimension  of  the  space  spanned  by  the  following  vectors,  and  a  basis 
for  the  space:  (1, 0, 1,  5,  -2),  (0, 1,0,6,  -3),  (2,  -1,  2, 4, 1),  (3, 0,  3, 15,  -6). 

We  write  the  matrix  whose  rows  are  the  components  of  the  vectors  and  row 
reduce  it  to  find  that  there  are  three  linearly  independent  vectors:  (1,0, 1,5,0), 
(0,1, 0,6,0),  (0,0, 0,0,1).  These  three  vectors  are  a  basis  for  the  space  which  is 
therefore  3-dimensional. 

Inner  Product,  Norm,  Orthogonality  Recall  from  (4.10)  that  the  scalar  (or 
dot  or  inner)  product  of  two  vectors  A  =  (A3,  A2,  A3)  and  B  =  {B3,B2lB3)  is 
A\Bi  +  A2B2  +  A3B3  =  £T_i  AiBi.  This  is  very  easy  to  generalize  to  n  dimensions. 
By  definition,  the  inner  product  of  two  vectors  in  n  dimensions  is  given  by 


(10.1)  A  •  B  =  (Inner  product  of  A  and  B)  =  AiBi. 

i=l 


Similarly,  generalizing  (4.1),  we  can  define  the  length  or  norm  of  a  vector  in  n  di¬ 
mensions  by  the  formula: 


n 

(10.2) 

A  =  Norm  of  A  =  |  |A|  |  =  VA  ■  A  = 

EAi 

\ 

i= 1 

In  3  dimensions,  we  also  write  the  scalar  product  as  AB  cos  9  [see  (4.2)]  so  if  two 
vectors  are  orthogonal  (perpendicular)  their  scalar  product  is  AB  cos7r/2  =  0.  We 
generalize  this  to  n  dimensions  by  saying  that  two  vectors  in  n  dimensions  are 
orthogonal  if  their  inner  product  is  zero. 


n 

(10.3)  A  and  B  are  orthogonal  if  Y.A'R'  =  (]- 

i=  1 
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Schwarz  Inequality  In  2  or  3  dimensions  we  can  find  the  angle  between  two 
vectors  [see  (4.11)]  from  the  formula  A  •  B  =  AB  cos  6.  It  is  tempting  to  use  the 
same  formula  in  n  dimensions,  but  before  we  do  we  should  be  sure  that  the  resulting 
value  of  cos  9  will  satisfy  |  cos(9|  <  1,  that  is 


(10.4) 

A  •  B  <  AB,  or 

n 

J2AtBi 

-\ 

n 

T.AU 

n 

!>?- 

i= 1 

\ 

•=■  \ 

i=l 

This  is  called  the  Schwarz  inequality  (for  n-dimensional  Euclidean  space).  We  can 
prove  it  as  follows.  First  note  that  if  B  =  0,  (10.4)  just  says  0  <  0  which  is  certainly 
true.  For  B  /  0,  we  consider  the  vector  C  =  BA  —  (A  •  B)B /B,  and  find  C  •  C. 
Now  C  •  C  =  Cf  >  0,  so  we  have 

C  •  C  =  B2(A  ■  A)  -  2B(A  •  B)(A  •  B)/B+  (A  •  B)2(B  •  B)/B2 
=  A2B2  -  2(A  •  B)2  +  (A  -B)2  =  A2B2  -  (A  •  B)2  =  C2  >  0, 

which  gives  (10.4).  Thus,  if  we  like,  we  can  define  the  cosine  of  the  angle  between 
two  vectors  in  n  dimensions  by  cos 9  =  A  •  B /(AB).  Note  that  equality  holds  in 
Schwarz’s  inequality  if  and  only  if  cos  6  =  ±1,  that  is,  when  A  and  B  are  parallel 
or  antiparallel,  say  B  =  k:A. 

Example  3.  Find  the  cosine  of  the  angle  between  each  pair  of  the  3  basis  vectors  we  found 
in  Example  2. 

By  (10.2)  we  find  that  the  norms  of  the  first  two  basis  vectors  are  y/1  +  1  +  25  = 
x/27  and  yjl  +  36  =  \/37.  By  (10.1),  the  inner  product  of  these  two  vectors  is 
l-0  +  0-  l  +  l-  0  +  5-  6  +  0-  0  =  30.  Thus  cos<2  =  30/(V27  •  37)  ~  0.949,  which, 
we  note,  is  <  1  as  Schwarz’s  inequality  says.  The  third  basis  vector  in  Example  2 
is  orthogonal  to  the  other  two  since  the  inner  products  are  zero,  that  is,  cos0  =  0. 

Orthonormal  Basis;  Gram-Schmidt  Method  We  call  a  set  of  vectors  or¬ 
thonormal  if  they  are  all  mutually  orthogonal  (perpendicular),  and  each  vector  is 
normalized  (that  is,  its  norm  is  one — it  has  unit  length).  For  example,  the  vectors 
i,  j,  k,  form  an  orthonormal  set.  If  we  have  a  set  of  basis  vectors  for  a  space,  it  is 
often  convenient  to  take  combinations  of  them  to  form  an  orthonormal  basis.  The 
Gram-Schmidt  method  is  a  systematic  process  for  doing  this.  It  is  very  simple  in 
idea  although  the  details  of  carrying  it  out  can  get  messy.  Suppose  we  have  basis 
vectors  A,  B,  C.  Normalize  A  to  get  the  first  vector  of  a  set  of  orthonormal  basis 
vectors.  To  get  a  second  basis  vector,  subtract  from  B  its  component  along  A;  what 
remains  is  orthogonal  to  A.  [See  equation  (4.4)  and  Figure  4.10.]  Normalize  this 
remainder  to  find  the  second  vector  of  an  orthonormal  basis.  Similarly,  subtract 
from  C  its  components  along  A  and  B  to  find  a  third  vector  orthogonal  to  both  A 
and  B  and  normalize  this  third  vector.  We  now  have  3  mutually  orthogonal  unit 
vectors;  this  is  the  desired  set  of  orthonormal  basis  vectors.  In  a  space  of  higher 
dimension,  this  process  can  be  continued.  (We  will  see  a  use  for  this  method  in 
Section  11;  see  degeneracy,  pages  152-153.) 
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Example  4.  Given  the  basis  vectors  A,  B,  C,  below,  use  the  Gram-Schmidt  method  to 
find  an  orthonormal  set  of  basis  vectors  ei,  e2,  ©3.  Following  the  outline  above,  we 
find 

A=  (0,0, 5,0);  ei  =  A/A  =(0,0, 1,0); 

B  =  (2,  0, 3,  0);  B  -  (ei  •  B)ei  =  B  —  3ei  =  (2, 0, 0,  0); 

e2  =  (1,0, 0,0); 

C  =  (7, 1,  -5, 3);  C  -  (ei  ■  C)ei  -  (e2  •  C)e2  =  C  -  (-5)ei  -  7e2 

=  (0,1,  0,3); 
e3  =  (0,  l,0,3)/v/10. 

Complex  Euclidean  Space  In  applications  it  is  useful  to  allow  vector  compo¬ 
nents  to  be  complex.  For  example,  in  three  dimensions  we  might  consider  vectors 
like  (5  +  2i,  3  —  i,  1  +  i).  Let’s  go  back  and  see  what  modifications  are  needed  in 
this  case.  In  (10.2),  we  want  the  quantity  under  the  square  root  sign  to  be  positive. 
To  assure  this,  we  replace  the  square  of  Ai  by  the  absolute  square  of  A,,  that  is  by 
\Ai\2  =  A*Ai  where  A*  is  the  complex  conjugate  of  At  (see  Chapter  2).  Similarly, 
in  (10.1)  and  (10.3),  we  replace  At B,  by  A*Bi.  Thus  we  define 

n 

(10.6)  (Inner  product  of  A  and  B)  =  A*Bi 

i= 1 


(10.7)  (Norm  of  A)  = 


(10.8)  A  and  B  are  orthogonal  if  Ea^=°. 

i= 1 

The  Schwarz  inequality  becomes  (see  Problem  6) 

n  n 

(10.9)  ^A^Bi  <  ^A*A, 

i=l  \  i= 1 

Note  that  we  can  write  the  inner  product  in  matrix  form.  If  A  is  a  column  matrix 
with  elements  Ai,  then  the  transpose  conjugate  matrix  At  is  a  row  matrix  with 
elements  A*.  Using  this  notation  we  can  write  A*Bi  =  A^B  (Problem  9). 

Example  5.  Given  A  =  (3 i,  1  —  i,  2  +  3t,  1  +  2 i),  B  =  (—1, 1  +  2i,  3  —  i,  i),  C  =  (4  —  2i,  2  — 
i,  1,  i  —  2),  we  find  by  (10.6)  to  (10.8): 

(Inner  product  of  A  and  B)  =  (— 3i)(— 1)  +  (1  +  i)(l  +  2 i) 

+  (2  -  3i)(3  —  *)  +  (!  —  2 i)i  =  4  -  Ai. 
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(Norm  of  A)2  =  (— 3z)(3z)  +  (1  +  i)(l  -  i)  +  (2  -  3i)(2  +  3 i)  +  (1  -  2i)(l  +  2 i) 

=  9  +  2  +  13  +  5  =  29,  1 1  A|  |  =  y/29. 

(Norm  of  B)2  =  1  +  5  +  10+  1  =  17,  ||B||  =  Vl7. 

Note  that  |4  —  4z|  =  4\/2  <  v/29v/17  in  accord  with  the  Schwarz  inequality  (10.9). 

(Inner  product  of  B  and  C)  =  (— 1)(4  —  2 i)  +  (1  —  2z)(2  —  i)  +  (3  +  i)(l) 

+  ( — z)(z  —  2)  =  — 4  +2  i  —  5i  +  3  +  i  +  1  +  2z  =  0. 

Thus  by  (10.8),  B  and  C  are  orthogonal. 

PROBLEMS,  SECTION  10 

1.  Find  the  distance  between  the  points 

(a)  (4, -1,2,  7)  and  (2,  3, 1,9); 

(b)  (-1,5, -3,  2, 4)  and  (2,  6,  2,  7,  6); 

(c)  (5,  -2,  3,  3, 1,  0)  and  (0, 1, 5,  7,  2, 1). 

2.  For  the  given  sets  of  vectors,  find  the  dimension  of  the  space  spanned  by  them  and 
a  basis  for  this  space. 

(a)  (1,-1, 0,0),  (0,-2, 5,1),  (1,-3, 5,1),  (2, -4, 5,1); 

(b)  (0,1,  2,  0,0, 4),  (1,1, 3,  5, -3,  5),  (1,  0,  0,  5,  0, 1),  (-1, 1,  3, -5, -3, 3), 

(0,0, 1,0, -3,0); 

(c)  (0, 10,  -1, 1, 10),  (2,  -2,  -4, 0,  -3),  (4,  2,  0, 4,  5),  (3,  2,  0, 3, 4),  (5,  -4,  5,  6,  2). 

3.  (a)  Find  the  cosines  of  the  angles  between  pairs  of  vectors  in  Problem  2(a). 

(b)  Find  two  orthogonal  vectors  in  Problem  2(b). 

4.  For  each  given  set  of  basis  vectors,  use  the  Gram-Schmidt  method  to  find  an  or¬ 
thonormal  set. 

(a)  A  =  (0,  2,  0,  0),  B  =  (3,  -4,  0,  0),  C  =  (1,  2, 3, 4). 

(b)  A  =  (0,  0, 0,  7),  B  =  (2,  0,  0,  5),  C  =  (3, 1, 1, 4). 

(c)  A  =  (6,  0,  0,  0),  B  =  (1,  0,  2,  0),  C  =  (4, 1,  9,  2). 

5.  By  (10.6)  and  (10.7),  find  the  norms  of  A  and  B  and  the  inner  product  of  A  and  B, 

and  note  that  the  Schwarz  inequality  (10.9)  is  satisfied: 

(a)  A  =f;(3  +  i,  1, 2  —  i,  —5 i,  i  +  1),  B  =  (2 i,  4  —  3i,  1  +  i,  3 i,  1); 

(b)  A  =  (2, 2i  —  3, 1  +  *,  5i,  i  —  2),  B  =  (5*  -  2, 1, 3  +  i,  2 i,  4). 

6.  Write  out  the  proof  of  the  Schwarz  inequality  (10.9)  for  a  complex  Euclidean  space. 
Hint:  Follow  the  proof  of  (10.4)  in  (10.5),  replacing  the  definitions  of  norm  and  inner 
product  in  (10.1)  and  (10.2)  by  the  definitions  in  (10.6)  and  (10.7).  Remember  that 
norms  are  real  and  >  0. 

7.  Show  that,  in  n-dimensional  space,  any  n  +  1  vectors  are  linearly  dependent. 

Hint:  See  Section  8. 

8.  Show  that  two  different  sets  of  basis  vectors  for  the  same  vector  space  must  contain 
the  same  number  of  vectors.  Hint:  Suppose  a  basis  for  a  given  vector  space  contains 
n  vectors.  Use  Problem  7  to  show  that  there  cannot  be  more  than  n  vectors  in  a 
basis  for  this  space.  Conversely,  if  there  were  a  correct  basis  with  less  than  n  vectors, 
what  can  you  say  about  the  claimed  n-vector  basis? 
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9.  Write  equations  (10.6)  to  (10.9)  in  matrix  form  as  discussed  just  after  (10.9). 

10.  Prove  that  1 1 A  +  B|  <  ||Aj|  +  ||B||.  This  is  called  the  triangle  inequality;  in  two 
or  three  dimensions,  it  simply  says  that  the  length  of  one  side  of  a  triangle  < 
sum  of  the  lengths  of  the  other  2  sides.  Hint:  To  prove  it  in  n-dimensional  space, 
write  the  square  of  the  desired  inequality  using  (10.2)  and  also  use  the  Schwarz 
inequality  (10.4).  Generalize  the  theorem  to  complex  Euclidean  space  by  using 
(10.7)  and  (10.9). 

11.  EIGENVALUES  AND  EIGENVECTORS;  DIAGONALIZING  MATRICES 

We  can  give  the  following  physical  interpretation  to  Figure  7.2  and  equations  (7.5). 
Suppose  the  ( x ,  y)  plane  is  covered  by  an  elastic  membrane  which  can  be  stretched, 
shrunk,  or  rotated  (with  the  origin  fixed).  Then  any  point  (x,y)  of  the  membrane 
becomes  some  point  (X,  Y)  after  the  deformation,  and  we  can  say  that  the  matrix  M 
describes  the  deformation.  Let  us  now  ask  whether  there  are  any  vectors  such  that 
Ft  =  Ar  where  A  =  const.  Such  vectors  are  called  eigenvectors  (or  characteristic 
vectors)  of  the  transformation,  and  the  values  of  A  are  called  the  eigenvalues  (or 
characteristic  values)  of  the  matrix  M  of  the  transformation. 


Eigenvalues  To  illustrate  finding  eigenvalues,  let’s  consider  the  transformation 


(11.1) 


The  eigenvector  condition  R  =  Ar  is,  in  matrix  notation, 


or  written  out  in  equation  form: 


(11.2) 


5x  —  2  y  =  Xx, 
-2x  +  2  y  =  A  y, 


(5  -  A)x  -  2y  =  0, 
—2x  +  (2  —  A  )y  =  0. 


These  equations  are  homogeneous.  Recall  from  (8.9)  that  a  set  of  homogeneous 
equations  has  solutions  other  than  x  =  y  =  0  only  if  the  determinant  of  the  coeffi¬ 
cients  is  zero.  Thus  we  want 


(11.3) 


5  -  A 
-2 


-2 
2- A 


=  0. 


This  is  called  the  characteristic  equation  of  the  matrix  M,  and  the  determinant  in 
(11.3)  is  called  the  secular  determinant. 


To  obtain  the  characteristic  equation  of  a  matrix  M,  we  subtract  A  from  the 
elements  on  the  main  diagonal  of  M,  and  then  set  the  determinant  of  the  resulting 
matrix  equal  to  zero. 


We  solve  (11-3)  for  A  to  find  the  characteristic  values  of  M: 

(5  -  A) (2  -  A)  -  4  =  A2  -  7A  +  6  =  0, 
A  =  1  or  A  =  6. 


(11.4) 
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Eigenvectors  Substituting  the  A  values  from  (11.4)  into  (11.2),  we  get: 

,  ,  2x  —  y  =  0  from  either  of  the  equations  (11.2)  when  A  =  1; 

^  x  +  2y  =  0  from  either  of  the  equations  (11.2)  when  A  =  6. 

We  were  looking  for  vectors  r  =  \x  +  j y  such  that  the  transformation  (11.1)  would 
give  an  R  parallel  to  r.  What  we  have  found  is  that  any  vector  r  with  x  and  y  com¬ 
ponents  satisfying  either  of  the  equations  (11.5)  has  this  property.  Since  equations 
(11.5)  are  equations  of  straight  lines  through  the  origin,  such  vectors  lie  along  these 
lines  (Figure  11.1).  Then  equations  (11.5)  show  that  any  vector  r  from  the  origin 
to  a  point  on  x  +  2y  =  0  is  changed  by  the  transformation  (11.1)  to  a  vector  in 
the  same  direction  but  six  times  as  long,  and  any  vector  from  the  origin  to  a  point 
on  2a '  —  y  =  0  is  unchanged  by  the  transformation  (11.1).  These  vectors  (along 
x  +  2y  =  0  and  2x  —  y  =  0)  are  the  eigenvectors  of  the  transformation.  Along  these 
two  directions  (and  only  these),  the  deformation  of  the  elastic  membrane  was  a  pure 
stretch  with  no  shear  (rotation). 


Diagonalizing  a  Matrix  We  next  write  (11.2)  once  with  A  =  1,  and  again  with 
A  =  6,  using  subscripts  1  and  2  to  identify  the  corresponding  eigenvectors: 


(11.6) 


5aq  —  2yi  =  x\,  5x2  —  2j/2  =  6x2, 

— 2aq  +  2  j/i  =  yt,  -2x2  +  2  y2  =  6  y2. 


These  four  equations  can  be  written  as  one  matrix  equation,  as  you  can  easily  verify 
by  multiplying  out  both  sides  (Problem  1): 


(11.7) 


(  5  -2\  fXl  x2\  fXl  x2\  (l  0\ 

2  2 J  \yi  y2)  \yi  y2)  VO  6 J  ' 


All  we  really  can  say  about  (xi,yi)  is  that  2x\  —  y  1  =  0;  however,  it  is  convenient  to 
pick  numerical  values  of  x\  and  y\  to  make  ri  =  (aq,  y\)  a  unit  vector,  and  similarly 
for  r2  =  (x2,  y2).  Then  we  have 


(11.8) 
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and  (11.7)  becomes 


(11.9) 


5 

-2 


-2 

2 


y/5  \/5 

(—  —\ 

vz  vz 

2  1 

2  1 

WZ  vz) 

wz  vz) 

/J_ 

M=(J  1).  C  = 

V5 

V5 

2 

1 

wz 

vz) 

the  determinant  of  C  is  not 

zero, 

then 

Representing  these  matrices  by  letters  we  can  write 

MC  =  CD,  where 

(11.10) 


D  = 


1  0 
0  6 


If,  as  here 
us  multiply  (11.10)  by  C-1 
C^MC  =  C_1CD  =  D. 


and  remember  that  C  1C  is  the  unit  matrix; 


,  let 
then 


(11.11)  C_1MC  =  D. 

The  matrix  D  has  elements  different  from  zero  only  down  the  main  diagonal;  it 
is  called  a  diagonal  matrix.  The  matrix  D  is  called  similar  to  M,  and  when  we 
obtain  D  given  M,  we  say  that  we  have  diagonalized  M  by  a  similarity  transfor¬ 
mation. 


We  shall  see  shortly  that  this  amounts  physically  to  a  simplification  of  the  problem 
by  a  better  choice  of  variables.  For  example,  in  the  problem  of  the  membrane,  it  is 
simpler  to  describe  the  deformation  if  we  use  axes  along  the  eigenvectors.  Later  we 
shall  see  more  examples  of  the  use  of  the  diagonalization  process. 

Observe  that  it  is  easy  to  find  D;  we  need  only  solve  the  characteristic  equation 
of  M.  Then  D  is  a  matrix  with  these  characteristic  values  down  the  main  diagonal 
and  zeros  elsewhere.  We  can  also  find  C  (with  more  work),  but  for  many  purposes 
only  D  is  needed. 

Note  that  the  order  of  the  eigenvalues  down  the  main  diagonal  of  D  is  arbitrary; 
for  example  we  could  write  (11.6)  as 

<"•“>  (-2 1)  t ;:) = t ::)  (s  ? 

instead  of  (11.7).  Then  (11.11)  still  holds,  with  a  different  C,  of  course,  and  with 


D  = 


instead  of  as  in  (11.10)  (Problem  1). 


6 

0 


0 

1 
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Meaning  of  C  and  D  To  see  more  clearly  the  meaning  of 
(11.11)  let  us  find  what  the  matrices  C  and  D  mean  physically. 
We  consider  two  sets  of  axes  (x,y)  and  ( x',y ')  with  (x',y') 
rotated  through  9  from  (x,y)  (Figure  11.2).  The  (x,y)  and 
( x ' ,  y')  coordinates  of  one  point  (or  components  of  one  vector 
r  =  r')  relative  to  the  two  systems  are  related  by  (7.13). 
Solving  (7.13)  for  x  and  y,  we  have 


(11.13) 


x  =  xJ  cos  0  —  y'  sin  (9, 
y  =  xJ  sin  9  +  y'  cos  9, 


Figure  11.2 


or  in  matrix  notation 


(11.14) 


r  =  Cr' 


where 


( cos  9 
^sin  9 


—  sin  6\ 
cos  9 )  ‘ 


This  equation  is  true  for  any  single  vector  with  components  given  in  the  two  systems. 
Suppose  we  have  another  vector  R  =  R/  (Figure  11.2)  with  components  X,  Y  and 
X',  Y';  these  components  are  related  by 

(11.15)  R=CR'. 

Now  let  M  be  a  matrix  which  describes  a  deformation  of  the  plane  in  the  ( x ,  y) 
system.  Then  the  equation 


(11.16) 


R  =  Mr 


says  that  the  vector  r  becomes  the  vector  R  after  the  deformation,  both  vectors 
given  relative  to  the  (a:,  y)  axes.  Let  us  ask  how  we  can  describe  the  deformation  in 
the  (x',y')  system,  that  is,  what  matrix  carries  r'  into  R'?  We  substitute  (11.14) 
and  (11.15)  into  (11.16)  and  find  CR/  =  MCr'  or 

(11.17)  R'  =  C_1MCr/. 


Thus  the  answer  to  our  question  is  that 


D  =  C  *MC  is  the  matrix  which  describes  in  the  (x',y')  system  the  same  defor¬ 
mation  that  M  describes  in  the  (x,  y)  system. 


Next  we  want  to  show  that  if  the  ma¬ 
trix  C  is  chosen  to  make  D  =  C-1MC 
a  diagonal  matrix,  then  the  new  axes 
(x\  y')  are  along  the  directions  of  the  eig¬ 
envectors  of  M.  Recall  from  (11.10)  that 
the  columns  of  C  are  the  components  of 
the  unit  eigenvectors.  If  the  eigenvectors 
are  perpendicular,  as  they  are  in  our  ex¬ 
ample  (see  Problem  2)  then  the  new  axes 
(. x',y ')  along  the  eigenvector  directions 
are  a  set  of  perpendicular  axes  rotated 
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from  axes  ( x,y )  by  some  angle  9  (Figure  11.3).  The  unit  eigenvectors  ri  and  r2  are 
shown  in  Figure  11.3;  from  the  figure  we  find 


(11.18) 


x\  =  jri |  cos 6  =  cos6>,  X2  =  —  |r2|  sin 0  =  —  sin6> 

t/i  =  |ri|  sin6>  =  sin#,  2/2  =  |r2|  cos  6  =  cos  6; 

Xi  £2\  _  ( cos  9  —  sin  9 

2/i  2/2 )  V  sin  9  cos  9 


Thus,  the  matrix  C  which  diagonalizes  M  is  the  rotation  matrix  C  in  (11.14) 
when  the  axes  are  along  the  directions  of  the  eigenvectors  of  M. 


Relative  to  these  new  axes,  the  diagonal  matrix  D  describes  the  deformation. 
For  our  example  we  have 


(11.19) 


R'  =  Dr' 


X'  =  x’ 


1  0\  fx' 

0  6 )  U' 


y  =  62/' ■ 


In  words,  (11.19)  says  that  [in  the  {x',y')  system]  each  point  (x1  ,y')  has  its  x '  coor¬ 
dinate  unchanged  by  the  deformation  and  its  y’  coordinate  multiplied  by  6,  that  is, 
the  deformation  is  simply  a  stretch  in  the  y'  direction.  This  is  a  simpler  description 
of  the  deformation  and  clearer  physically  than  the  description  given  by  (11.1). 

You  can  see  now  why  the  order  of  eigenvalues  down  the  main  diagonal  in  D  is 
arbitrary  and  why  (11.12)  is  just  as  satisfactory  as  (11.7).  The  new  axes  (xr,  y')  are 
along  the  eigenvectors,  but  it  is  unimportant  which  eigenvector  we  call  x'  and  which 
we  call  y' .  In  doing  a  problem  we  simply  select  a  D  with  the  eigenvalues  of  M  in 
some  (arbitrary)  order  down  the  main  diagonal.  Our  choice  of  D  then  determines 
which  eigenvector  direction  is  called  the  x'  axis  and  which  is  called  y' . 

It  was  unnecessary  in  the  above  discussion  to  have  the  x'  and  y'  axes  perpendic¬ 
ular,  although  this  is  the  most  useful  case.  If  r  =  Cr'  but  C  is  just  any  (nonsingular) 
matrix  [not  necessarily  the  orthogonal  rotation  matrix  as  in  (11.14)],  then  (11.17) 
still  follows.  That  is,  C-1MC  describes  the  deformation  using  (x',y')  axes.  But  if 
C  is  not  an  orthogonal  matrix,  then  the  (x',  y') 
axes  are  not  perpendicular  (Figure  11.4)  and 
x2  +  y2  yf  x'  +  2/  ,  that  is,  the  transforma¬ 
tion  is  not  a  rotation  of  axes.  Recall  that  C 
is  the  matrix  of  unit  eigenvectors;  if  these  are 
perpendicular,  then  C  is  an  orthogonal  ma¬ 
trix  (Problem  6).  It  can  be  shown  that  this 
will  be  the  case  if  and  only  if  the  matrix  M 
is  symmetric.  [See  equation  (11.27)  and  the 
discussion  just  before  it.  Also  see  Problems 

33  to  35,  and  Problem  15.25.]  Figure  11.4 


Degeneracy  For  a  symmetric  matrix,  we  have  seen  that  the  eigenvectors  corre¬ 
sponding  to  different  eigenvalues  are  orthogonal.  If  two  (or  more)  eigenvalues  are 
the  same,  then  that  eigenvalue  is  called  degenerate.  Degeneracy  means  that  two  (or 
more)  independent  eigenvectors  correspond  to  the  same  eigenvalue. 
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Example  1.  Consider  the  following  matrix: 


The  eigenvalues  of  M  are  A  =  6,  —3,  —3,  and  the  eigenvector  corresponding  to  A  =  6 
is  (2,  —2, 1)  (Problem  36).  For  A  =  —3,  the  eigenvector  condition  is  2x  —  2y  +  z  =  0. 
This  is  a  plane  orthogonal  to  the  A  =  6  eigenvector,  and  any  vector  in  this  plane 
is  an  eigenvector  corresponding  to  A  =  —3.  That  is,  the  A  =  —  3  eigenspace  is  a 
plane.  It  is  convenient  to  choose  two  orthogonal  eigenvectors  as  basis  vectors  in  this 
A  =  — 3  eigenplane,  for  example  (1,1,0)  and  (—1,1,4).  (See  Problem  36.) 

You  may  ask  how  you  find  these  orthogonal  eigenvectors  except  by  inspection. 
Recall  that  the  cross  product  of  two  vectors  is  perpendicular  to  both  of  them.  Thus 
in  the  present  case  we  could  pick  one  vector  in  the  A  =  —3  eigenplane  and  then 
take  its  cross  product  with  the  A  =  6  eigenvector.  This  gives  a  second  vector  in  the 
A  =  —  3  eigenplane,  perpendicular  to  the  first  one  we  picked.  However,  this  only 
works  in  three  dimensions;  if  we  are  dealing  with  spaces  of  higher  dimension  (see 
Section  10),  then  we  need  another  method.  Suppose  we  first  write  down  just  any 
two  (different)  vectors  in  the  eigenplane  not  trying  to  make  them  orthogonal.  Then 
we  can  use  the  Gram-Schmidt  method  (see  Section  10)  to  find  an  orthogonal  set. 
For  example,  in  the  problem  above,  suppose  you  had  thought  of  (or  your  computer 
had  given  you)  the  vectors  A  =  (1, 1, 0)  and  B  =  (—1, 0, 2)  which  are  vectors  in  the 
A  =  —3  eigenplane  but  not  orthogonal  to  each  other.  Following  the  Gram-Schmidt 
method,  we  find 

A  =  (1,1,0),  e  =  A/A  =  (l,l,0)/\/2, 

B  -  (e  •  B)e  =  (-1,0, 2)  -  ^(1, 1,0)  =  ±2 )  , 

or  (—1,1,4)  as  we  had  above.  For  a  degenerate  subspace  of  dimension  m  >  2, 
we  just  need  to  write  down  m  linearly  independent  eigenvectors,  and  then  find  an 
orthogonal  set  by  the  Gram-Schmidt  method. 


Diagonalizing  Hermitian  Matrices  We  have  seen  how  to  diagonalize  symmet¬ 
ric  matrices  by  orthogonal  similarity  transformations.  The  complex  analogue  of 
a  symmetric  matrix  (ST  =  S)  is  a  Hermitian  matrix  (H*  =  H)  and  the  complex 
analogue  of  an  orthogonal  matrix  (0T  =  CU1)  is  a  unitary  matrix  (U^  =  U_1).  So 
let’s  discuss  diagonalizing  Hermitian  matrices  by  unitary  similarity  transformations. 
This  is  of  great  importance  in  quantum  mechanics. 

Although  Hermitian  matrices  may  have  complex  off-diagonal  elements,  the  eigen¬ 
values  of  a  Hermitian  matrix  are  always  real.  Let’s  prove  this.  (Refer  to  Section  9 
for  definitions  and  theorems  as  needed.)  Let  H  be  a  Hermitian  matrix,  and  let  r  be 
the  column  matrix  of  a  non-zero  eigenvector  of  H  corresponding  to  the  eigenvalue  A. 
Then  the  eigenvector  condition  is  Hr  =  Ar.  We  want  to  take  the  transpose  conjugate 
(dagger)  of  this  equation.  Using  the  complex  conjugate  of  equation  (9.10),  we  get 
(Hr)T  =  Uh!  =  UH  since  H*  =  H  for  a  Hermitian  matrix.  The  transpose  conjugate 
of  Ar  is  A*r!  (since  A  is  a  number,  we  just  need  to  take  its  complex  conjugate).  Now 
we  have  the  two  equations 


(11.21) 


Hr  =  Ar  and 


r+H  =  A*rf. 
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Multiply  the  first  equation  in  (11.21)  on  the  left  [see  discussion  following  (10.9)]  by 
the  row  matrix  r'  and  the  second  equation  on  the  right  by  the  column  matrix  r  to 
get 

(11.22)  r^Hr  =  Ar^r  and  r^Hr  =  A*r^r. 

Subtracting  the  two  equations  we  find  (A  —  A*)rir  =  0.  Since  we  assumed  r  /  0,  we 
have  A*  =  A,  that  is,  A  is  real. 

We  can  also  show  that  for  a  Hermitian  matrix  the  eigenvectors  corresponding  to 
two  different  eigenvalues  are  orthogonal.  Start  with  the  two  eigenvector  conditions, 

(11.23)  Hri  =  Airi  and  Hr2  =  A2^. 

From  these  we  can  show  (Problem  37) 

(11.24)  r|Hr2  =  Air|r2  =  A2r{r2,  or  (Ai  -  A2)r}r2  =  0. 

Thus  if  Ai  ^  A2,  then  the  inner  product  of  ri  and  r2  is  zero,  that  is,  they  are 
orthogonal  [see  (10.8)]. 

We  can  also  prove  that  if  a  matrix  M  has  real  eigenvalues  and  can  be  diagonalized 
by  a  unitary  similarity  transformation,  then  it  is  Hermitian.  In  symbols,  we  write 
U_1MU  =  D,  and  find  the  transpose  conjugate  of  this  equation  to  get  (Problem  38) 

(11.25)  (U_1MU)t  =  U_1MtU  =  Df  =  D. 

Thus  U_1MU  =  D  =  U_1M'I'U,  so  M  =  M^,  which  says  that  M  is  Hermitian.  So  we 
have  proved  that 


(11.26) 


A  matrix  has  real  eigenvalues  and  can  be  diagonalized  by  a  unitary 
similarity  transformation  if  and  only  if  it  is  Hermitian. 


Since  a  real  Hermitian  matrix  is  a  symmetric  matrix  and  a  real  unitary  matrix  is  an 
orthogonal  matrix,  the  corresponding  statement  for  symmetric  matrices  is  (Prob¬ 
lem  39). 


(11.27) 


A  matrix  has  real  eigenvalues  and  can  be  diagonalized  by  an  or¬ 
thogonal  similarity  transformation  if  and  only  if  it  is  symmetric. 


Recall  from  (9.2)  and  Problem  9.22  that  normal  matrices  include  symmetric,  Hermi¬ 
tian,  orthogonal,  and  unitary  matrices  (as  well  as  some  others).  It  may  be  useful  to 
know  the  following  general  theorem  which  we  state  without  proof  [see,  for  example, 
Am.  J.  Phys.  52,  513-515  (1984)]. 


(11.28) 


A  matrix  can  be  diagonalized  by  a  unitary  similarity  transformation 
if  and  only  if  it  is  normal. 
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Example  2.  To  illustrate  diagonalizing  a  Hermitian  matrix  by  a  unitary  similarity  trans¬ 
formation,  we  consider  the  matrix 

(11.29)  H=  (3  +  ,  3-7)' 

(Verify  that  H  is  Hermitian.)  We  follow  the  same  routine  we  used  to  find  the 
eigenvalues  and  eigenvectors  of  a  symmetric  matrix.  The  eigenvalues  are  given  by 

(2  -  A)(— 1  -  A)  -  (3  +  *)(. 3  -  *)  =  0, 

A2  —  A  —  12  =  0,  A  =  -3,4. 


For  A  =  —3,  an  eigenvector  satisfies  the  equations 


(  5 
\3  +  * 

5x  +  (3 


or 


—  i)y  =  0,  (3  +  i)x  +  2y  =  0. 


These  equations  are  satisfied  by  x  =  2,  y  =  (— 3— i).  A  choice  for  the  unit  eigenvector 
is  (2,-3  -i)/VU.  For  A  =  4,  we  find  similarly  the  equations 


—2x  +  (3  —  i)y  =  0,  (3  +  i)x  —  5y  =  0, 

which  are  satisfied  by  y  =  2,  x  =  3  —  i,  so  a  unit  eigenvector  is  (3  —  i,  2) /\/T4.  We 
can  verify  that  the  two  eigenvectors  are  orthogonal  (as  we  proved  above  that  they 
must  be)  by  finding  that  their  inner  product  [see  (10.8)]  is  (2,  —3  —  i)*  •  (3  —  i,  2)  = 
2(3  —  *)  +  2(— 3  +  ?')  =  0.  As  in  (11.10)  we  write  the  unit  eigenvectors  as  the  columns 
of  a  matrix  U  which  diagonalizes  H  by  a  similarity  transformation. 


U  = 


1  (  2 


Vu 


ut 


1  f  2  -3  +  A 

\/l4  \3  +  f  2  J 


You  can  easily  verify  that  LRU  =  the  unit  matrix,  so  U  1  =  Ub  Then  (Problem  40) 


(11.30) 


U_1HU  =  l^HU  = 


-3  0 
0  4 


that  is,  H  is  diagonalized  by  a  unitary  similarity  transformation. 


Orthogonal  Transformations  in  3  Dimensions  In  Section  7,  we  considered 
the  active  rotation  and/or  reflection  of  vectors  r  which  was  produced  by  a  given 
3  by  3  orthogonal  matrix.  Study  Equations  (7.18)  and  (7.19)  carefully  to  see  that, 
acting  on  a  column  vector  r,  they  rotate  the  vector  by  angle  9  around  the  2  axis 
and/or  reflect  it  through  the  (x,  y)  plane.  We  would  now  like  to  see  how  to  find 
the  effect  of  more  complicated  orthogonal  matrices.  We  can  do  this  by  using  an 
orthogonal  similarity  transformation  to  write  a  given  orthogonal  matrix  relative  to 
a  new  coordinate  system  in  which  the  rotation  axis  is  the  2  axis,  and/or  the  ( x,y ) 
plane  is  the  reflecting  plane  (in  vector  space  language,  this  is  a  change  of  basis). 
Then  a  comparison  with  (7.18)  or  (7.19)  gives  the  rotation  angle.  Recall  how  we 
construct  a  C  matrix  so  that  C-1MC  describes  the  same  transformation  relative  to 
a  new  set  of  axes  that  M  described  relative  to  the  original  axes:  The  columns  of  the 
C  matrix  are  the  components  of  unit  vectors  along  the  new  axes  [see  (11.18)  and 
Figure  11.3]. 
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Example  3.  Consider  the  following  matrices. 


A=\[ ~V2 

72 

i  h2 

B=3  2 

-1 

—2 

(11.31) 

0 

72  , 

—2 

-1 

2\  1 

-V2 

1/ 

3  V  1 

2 

-2 

You  can  verify  that  A  and  B  are  both  orthogonal,  and  that  clet  A  =  1 ,  det  B  =  —  1 
(Problem  45).  Thus  A  is  a  rotation  matrix  while  B  involves  a  reflection  (and 
perhaps  also  a  rotation).  For  A,  a  vector  along  the  rotation  axis  is  not  affected  by 
the  transformation  so  we  find  the  rotation  axis  by  solving  the  equation  Ar  =  r.  We 
did  this  in  Section  7,  but  now  you  should  recognize  this  as  an  eigenvector  equation. 
We  want  the  eigenvector  corresponding  to  the  eigenvalue  1.  By  hand  or  by  computer 
(Problem  45)  we  find  that  the  eigenvector  of  A  corresponding  to  A  =  1  is  (1, 0, 1)  or 
i  +  k;  this  is  the  rotation  axis.  We  want  the  new  z  axis  to  lie  along  this  direction, 
so  we  take  the  elements  of  the  third  column  of  matrix  C  to  be  the  components  of 
the  unit  vector  u  =  (1, 0,  l)/72.  For  the  first  column  (new  x  axis)  we  choose  a  unit 
vector  perpendicular  to  the  rotation  axis,  say  v  =  (1,  0,  — l)/72,  and  for  the  second 
column  (new  y  axis),  we  use  the  cross  product  u  X  v  =  (0, 1,0)  (so  that  the  new 
axes  form  a  right-handed  orthogonal  triad) .  This  gives  (Problem  45) 

/  1  0  i\  /  °  1  °\ 

(11.32)  C  =  -=\  0  V2  0  ,  C_1AC  =  -1  0  0  . 

72  \-i  0  1/  \  0  0  1/ 

Comparing  this  result  with  (7.18),  we  see  that  cos#  =  0  and  sin#  =  — 1,  so  the 
rotation  is  —90°  around  the  axis  i  +  k  (or,  if  you  prefer,  +90°  around  —  i  k). 


Example  4.  For  the  matrix  B,  a  vector  perpendicular  to  the  reflection  plane  is  reversed 
in  direction  by  the  reflection.  Thus  we  want  to  solve  the  equation  Br  =  — r,  that 
is,  to  find  the  eigenvector  corresponding  to  A  =  —  1.  You  can  verify  (Problem  45) 
that  this  is  the  vector  (1,— 1,1)  or  i  —  j  +  k.  The  reflection  is  through  the  plane 
x  —  y  +  z  =  0,  and  the  rotation  (if  any)  is  about  the  vector  i  —  j  +  k.  As  we  did  for 
matrix  A,  we  construct  a  matrix  C  from  this  vector  and  two  perpendicular  vectors, 
to  get  (Problem  45) 


(11.33) 


/J_ 

1 

J_\ 

V6 

71 

73 

-1 

i 

-1 

71 

71 

-2 

0 

1 

VV6 

7!/ 

fll 

2 

-1 

I  2  ~2~ 

V  o  o 


C-1BC  = 


2 

73 


0 

-1/ 


Compare  this  with  (7.19)  to  get  cos 6  =  —  sin#  =  Ap,  so  matrix  B  produces  a 
rotation  of  120°  around  i  —  j  +  k  and  a  reflection  through  the  plane  x  —  y  +  z  =  0. 


You  may  have  discovered  that  matrices  A  and  B  have  two  complex  eigenvalues 
(see  Problem  46).  The  corresponding  eigenvectors  are  also  complex,  and  we  didn’t 
use  them  because  this  would  take  us  into  complex  vector  space  (see  Section  10, 
and  Problem  47)  and  our  rotation  and  reflection  problems  are  in  ordinary  real 
3-dimensional  space.  (Note  also  that  we  did  not  diagonalize  A  and  B,  but  just 
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used  similarity  transformations  to  display  them  relative  to  rotated  axes.)  However, 
when  all  the  eigenvalues  of  an  orthogonal  matrix  are  real  (see  Problem  48),  then 
this  process  produces  a  diagonalized  matrix  with  the  eigenvalues  down  the  main 
diagonal. 

Example  5.  Consider  the  matrix 


(11.34) 


You  can  verify  (Problem  49)  that  cletF  =  1,  that  the  rotation  axis  (eigenvector 
corresponding  to  the  eigenvalue  A  =  1)  is  3i  +  2j  +  k,  and  that  the  other  two 
eigenvalues  are  -1,-1.  Then  the  diagonalized  F  (relative  to  axes  with  the  new 
z  axis  along  the  rotation  axis)  is 


-1  0  0\ 

0  -1  0  . 

0  0  1/ 

Comparing  this  with  equation  (7.18),  we  see  that  cos 9  =  — 1,  sin#  =  0,  so  F  pro¬ 
duces  a  rotation  of  180°  about  3i  +  2j  +  k. 

An  even  easier  way  to  find  the  rotation  angle  in  this  problem  is  to  use  the 
trace  of  F  (Problem  50).  From  (7.18)  and  (11.34)  we  have  2  cos 9  +  1  =  —1.  Thus 
cos  9  =  —1,  9  =  180°  as  before.  This  method  gives  cos  9  for  any  rotation  or  reflection 
matrix,  but  unless  cos  9  =  ±1,  we  also  need  more  information  (say  the  value  of  sin  9) 
to  determine  whether  9  is  positive  or  negative. 

Powers  and  Functions  of  Matrices  In  Section  6  we  found  functions  of  some 
matrices  A  for  which  it  was  easy  to  find  the  powers  because  they  repeated  period¬ 
ically  [see  equations  (6.15)  to  (6.17)].  When  this  doesn’t  happen,  it  isn’t  so  easy 
to  find  powers  directly  (Problem  58).  But  it  is  easy  to  find  powers  of  a  diagonal 
matrix,  and  you  can  also  show  that  (Problem  57) 

(11.36)  Mn  =  CDnC-1,  where  C_1MC  =  D,  D  diagonal. 

This  result  is  useful  not  just  for  evaluating  powers  and  functions  of  numerical  ma¬ 
trices  but  also  for  proving  theorems  (Problem  60). 

Example  6.  We  can  show  that  if,  as  above,  C_1MC  =  D,  then 

(11.37)  det  eM  =  eTl^M). 

As  in  (6.17)  we  define  eM  by  its  power  series.  For  each  term  of  the  series  M"  = 
CDnC-1  by  (11.36),  so  eM  =  CeDC-1.  By  (6.6),  the  determinant  of  a  product  = 
the  product  of  the  determinants,  and  detCC^1  =  1,  so  we  have  cleteM  =  deteD. 
Now  the  matrix  eD  is  diagonal  and  the  diagonal  elements  are  e *4  where  A i  are  the 
eigenvalues  of  M.  Thus  cleteD  =  eXl  eX2 eXs  ■  ■  ■  =  eTrD.  But  by  (9.13),  TrD  = 
Tr(CC_1M)  =  TrM,  so  we  have  (11.37). 


(11.35) 
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Simultaneous  Diagonalization  Can  we  diagonalize  two  (or  more)  matrices  us¬ 
ing  the  same  similarity  transformation?  Sometimes  we  can,  namely  if,  and  only  if, 
they  commute.  Let’s  see  why  this  is  true.  Recall  that  the  diagonalizing  C  matrix 
has  columns  which  are  mutually  orthogonal  unit  eigenvectors  of  the  matrix  being 
diagonalized.  Suppose  we  can  find  the  same  set  of  eigenvectors  for  two  matrices  F 
and  G;  then  the  same  C  will  diagonalize  both.  So  the  problem  amounts  to  showing 
how  to  find  a  common  set  of  eigenvectors  for  F  and  G  if  they  commute. 

Example  7.  Let’s  start  by  diagonalizing  F.  Suppose  r  (a  column  matrix)  is  the  eigenvector 
corresponding  to  the  eigenvalue  A,  that  is,  Fr  =  Ar.  Multiply  this  on  the  left  by  G 
and  use  GF  =  FG  (matrices  commute)  to  get 

(11.38)  GFr  =  AGr,  or  F(Gr)  =  A(Gr). 

This  says  that  Gr  is  an  eigenvector  of  F  corresponding  to  the  eigenvalue  A.  If  A  is 
not  degenerate  (that  is  if  there  is  just  one  eigenvector  corresponding  to  A)  then  Gr 
must  be  the  same  vector  as  r  (except  maybe  for  length),  that  is,  Gr  is  a  multiple  of  r, 
or  Gr  =  A'r.  This  is  the  eigenvector  equation  for  G;  it  says  that  r  is  an  eigenvector 
of  G.  If  all  eigenvalues  of  F  are  non-degenerate,  then  F  and  G  have  the  same  set  of 
eigenvectors,  and  so  can  be  diagonalized  by  the  same  C  matrix. 

Example  8.  Now  suppose  that  there  are  two  (or  more)  linearly  independent  eigenvec¬ 
tors  corresponding  to  the  eigenvalue  A  of  F.  Then  every  vector  in  the  degenerate 
eigenspace  corresponding  to  A  is  an  eigenvector  of  matrix  F  (see  discussion  of  de¬ 
generacy  above).  Next  consider  matrix  G.  Corresponding  to  all  non-degenerate 
F  eigenvalues  we  already  have  the  same  set  of  eigenvectors  for  G  as  for  F.  So  we 
just  have  to  find  the  eigenvectors  of  G  in  the  degenerate  eigenspace  of  F.  Since  all 
vectors  in  this  subspace  are  eigenvectors  of  F,  we  are  free  to  choose  ones  which  are 
eigenvectors  of  G.  Thus  we  now  have  the  same  set  of  eigenvectors  for  both  matrices, 
and  so  we  can  construct  a  C  matrix  which  will  diagonalize  both  F  and  G.  For  the 
converse,  see  Problem  62. 

PROBLEMS,  SECTION  11 

1.  Verify  (11.7).  Also  verify  (11.12)  and  find  the  corresponding  different  C  in  (11.11). 
Hint:  To  find  C,  start  with  (11.12)  instead  of  (11.7)  and  follow  through  the  method 
of  getting  (11.10)  from  (11.7). 

2.  Verify  that  the  two  eigenvectors  in  (11.8)  are  perpendicular,  and  that  C  in  (11.10) 
satisfies  the  condition  (7.9)  for  an  orthogonal  matrix. 

3.  (a)  If  C  is  orthogonal  and  M  is  symmetric,  show  that  C^1MC  is  symmetric. 

(b)  If  C  is  orthogonal  and  M  antisymmetric,  show  that  C_1MC  is  antisymmetric. 

4.  Find  the  inverse  of  the  rotation  matrix  in  (7.13);  you  should  get  C  in  (11.14).  Replace 
9  by  —9  in  (7.13)  to  see  that  the  matrix  C  corresponds  to  a  rotation  through  —8. 

5.  Show  that  the  C  matrix  in  (11.10)  does  represent  a  rotation  by  finding  the  rotation 
angle.  Write  equations  (7.13)  and  (11.13)  for  this  rotation. 

6.  Show  that  if  C  is  a  matrix  whose  columns  are  the  components  (xi,yi)  and  (£2,1/2) 
of  two  perpendicular  vectors  each  of  unit  length,  then  C  is  an  orthogonal  matrix. 
Hint:  Find  CTC. 
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7.  Generalize  Problem  6  to  three  dimensions;  to  n  dimensions. 

8.  Show  that  under  the  transformation  (11.1),  all  points  (x,  y)  on  a  given  straight  line 
through  the  origin  go  into  points  ( X ,  Y)  on  another  straight  line  through  the  origin. 
Hint:  Solve  (11.1)  for  x  and  y  in  terms  of  X  and  Y  and  substitute  into  the  equation 
y  =  mx  to  get  an  equation  Y  =  kX,  where  k  is  a  constant.  Further  hint:  If  R  =  Mr, 
then  r  =  M-1R. 

9.  Show  that  det(C_1MC)  s=  det  M.  Hints:  See  (6.6).  What  is  the  product  of  det(C_1) 
and  det  C?  Thus  show  that  the  product  of  the  eigenvalues  of  M  is  equal  to  det  M. 

10.  Show  that  Tr(C_1MC)  =  TrM.  Hint:  See  (9.13).  Thus  show  that  the  sum  of  the 
eigenvalues  of  M  is  equal  to  Tr  M. 

11.  Find  the  inverse  of  the  transformation  x'  =  2x  —  3 y,  y'  =  x  +  y,  that  is,  find  x,  y  in 
terms  of  x' ,  y' .  (Hint:  Use  matrices.)  Is  the  transformation  orthogonal? 

Find  the  eigenvalues  and  eigenvectors  of  the  following  matrices.  Do  some  problems  by 
hand  to  be  sure  you  understand  what  the  process  means.  Then  check  your  results  by 
computer. 


12. 


15. 


18. 


21. 


24. 


1  3 

2  2 


13. 


16. 


19. 


22. 


14. 


17. 


20. 


23. 


\ 

/  i 

1 

p 

1 

25. 

! 

1 

! 

26. 

! 

2 

I 

.-1  1  ~h 


a  i  2/ 


Let  each  of  the  following  matrices  M  describe  a  deformation  of  the  (x,  y)  plane.  For 
each  given  M  find:  the  eigenvalues  and  eigenvectors  of  the  transformation,  the  matrix  C 
which  diagonalizes  M  and  specifies  the  rotation  to  new  axes  (x' ,  y')  along  the  eigenvectors, 
and  the  matrix  D  which  gives  the  deformation  relative  to  the  new  axes.  Describe  the 
deformation  relative  to  the  new  axes. 


27. 


28. 


5  2 
2  2 


29. 


3  4 

4  9 


30. 


3  1 
1  3 


31. 


3  2 
2  3 


33.  Find  the  eigenvalues  and  eigenvectors  of  the  real  symmetric  matrix 


M  = 


Show  that  the  eigenvalues  are  real  and  the  eigenvectors  are  perpendicular. 
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34.  By  multiplying  out  M  =  CDC  1  where  C  is  the  rotation  matrix  (11.14)  and  D  is 
the  diagonal  matrix 

Ai  0\ 

0  \2)  ’ 

show  that  if  M  can  be  diagonalized  by  a  rotation,  then  M  is  symmetric. 

35.  The  characteristic  equation  for  a  second-order  matrix  M  is  a  quadratic  equation. 
We  have  considered  in  detail  the  case  in  which  M  is  a  real  symmetric  matrix  and 
the  roots  of  the  characteristic  equation  (eigenvalues)  are  real,  positive,  and  unequal. 
Discuss  some  other  possibilities  as  follows: 

(a)  M  real  and  symmetric,  eigenvalues  real,  one  positive  and  one  negative.  Show 
that  the  plane  is  reflected  in  one  of  the  eigenvector  lines  (as  well  as  stretched 
or  shrunk).  Consider  as  a  simple  special  case 


(b)  M  real  and  symmetric,  eigenvalues  equal  (and  therefore  real).  Show  that  M 
must  be  a  multiple  of  the  unit  matrix.  Thus  show  that  the  deformation  consists 
of  dilation  or  shrinkage  in  the  radial  direction  (the  same  in  all  directions)  with 
no  rotation  (and  reflection  in  the  origin  if  the  root  is  negative). 

(c)  M  real,  not  symmetric,  eigenvalues  real  and  not  equal.  Show  that  in  this  case 
the  eigenvectors  are  not  orthogonal.  Hint:  Find  their  dot  product. 

(d)  M  real,  not  symmetric,  eigenvalues  complex.  Show  that  all  vectors  are  rotated, 
that  is,  there  are  no  (real)  eigenvectors  which  are  unchanged  in  direction  by 
the  transformation.  Consider  the  characteristic  equation  of  a  rotation  matrix 
as  a  special  case. 

36.  Verify  the  eigenvalues  and  eigenvectors  of  matrix  M  in  (11.20).  Find  some  other 
pairs  of  orthogonal  eigenvectors  in  the  A  =  —3  eigenplane. 

37.  Starting  with  (11.23),  obtain  (11.24).  Hints:  Take  the  transpose  conjugate  (dagger) 
of  the  first  equation  in  (11.23),  (remember  that  H  is  Hermitian  and  the  A’s  are  real) 
and  multiply  on  the  right  by  r2.  Multiply  the  second  equation  in  (11.23)  on  the  left 
by  r{. 

38.  Verify  equation  (11.25).  Hint:  Remember  from  Section  9  that  the  transpose  con¬ 
jugate  (dagger)  of  a  product  of  matrices  is  the  product  of  the  transpose  conjugates 
in  reverse  order  and  that  IF  =  U-1.  Also  remember  that  we  have  assumed  real 
eigenvalues,  so  D  is  a  real  diagonal  matrix. 

39.  Write  out  the  detailed  proof  of  (11.27).  Hint:  Follow  the  proof  of  (11.26)  in  equations 
(11.21)  to  (11.25),  replacing  the  Hermitian  matrix  H  by  a  symmetric  matrix  M  which 
is  real.  However,  don’t  assume  that  the  eigenvalues  A  are  real  until  you  prove  it. 

40.  Verify  the  details  as  indicated  in  diagonalizing  H  in  (11.29). 

Verify  that  each  of  the  following  matrices  is  Hermitian.  Find  its  eigenvalues  and  eigenvec¬ 
tors,  write  a  unitary  matrix  U  which  diagonalizes  H  by  a  similarity  transformation,  and 

show  that  U_1HU  is  the  diagonal  matrix  of  eigenvalues. 


41. 


2  i 
-i  2 


42. 


3  1  -  i 

1  +  i  2 


43. 


44. 


(  -2  3  +  4i\ 

\3  -Ai  -2  ) 
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45.  Verify  the  details  in  the  discussion  of  the  matrices  in  (11.31). 

46.  We  have  seen  that  an  orthogonal  matrix  with  determinant  1  has  at  least  one  eigen¬ 
value  =  1,  and  an  orthogonal  matrix  with  determinant  =  —  1  has  at  least  one  eigen¬ 
value  =  —1.  Show  that  the  other  two  eigenvalues  in  both  cases  are  elS,  e~l8 ,  which, 
of  course,  includes  the  real  values  1  (when  9  =  0),  and  —1  (when  9  =  n).  Hint:  See 
Problem  9,  and  remember  that  rotations  and  reflections  do  not  change  the  length 
of  vectors  so  eigenvalues  must  have  absolute  value  =  1. 

47.  Find  a  unitary  matrix  U  which  diagonalizes  A  in  (11.31)  and  verify  that  U^1AU  is 
diagonal  with  the  eigenvalues  down  the  main  diagonal. 

48.  Show  that  an  orthogonal  matrix  M  with  all  real  eigenvalues  is  symmetric.  Hints : 
Method  1.  When  the  eigenvalues  are  real,  so  are  the  eigenvectors,  and  the  unitary 
matrix  which  diagonalizes  M  is  orthogonal.  Use  (11.27).  Method  2.  From  Prob¬ 
lem  46,  note  that  the  only  real  eigenvalues  of  an  orthogonal  M  are  ±1.  Thus  show 
that  M  =  M-1.  Remember  that  M  is  orthogonal  to  show  that  M  =  MT. 

49.  Verify  the  results  for  F  in  the  discussion  of  (11.34). 

50.  Show  that  the  trace  of  a  rotation  matrix  equals  2cosf?  +  1  where  9  is  the  rotation 
angle,  and  the  trace  of  a  reflection  matrix  equals  2cosf?  —  1.  Hint:  See  equations 
(7.18)  and  (7.19),  and  Problem  10. 


Show  that  each  of  the  following  matrices  is  orthogonal  and  find  the  rotation  and/or  reflec¬ 
tion  it  produces  as  an  operator  acting  on  vectors.  If  a  rotation,  find  the  axis  and  angle; 
if  a  reflection,  find  the  reflecting  plane  and  the  rotation,  if  any,  about  the  normal  to  that 
plane. 


51. 


53. 


55. 


6 

7 

-6 

2 

-1 

2 

8 

-4 

1 


52.  - 


54. 


56. 


-1 

-1 

\/2\ 

1 

1 

^2 

\/2 

-y/2 

0/ 

1 

V2 

-!\ 

\/2 

0 

V2\ 

1 

-V2 

-1  / 

( 

2 

s/2 

V2  ' 

=  - 

-s/2  1  +  s/2 

1  -  s/2 

*  V- 

-s/2  1 

-y/2 

l  +  s/2, 

57.  Show  that  if  D  is  a  diagonal  matrix,  then  D™  is  the  diagonal  matrix  with  elements 
equal  to  the  nth  power  of  the  elements  of  D.  Also  show  that  if  D  =  C_1MC,  then 
Dn  =  C_1M"C,  so  M"  =  CD"C_1.  Hint:  For  n  =  2,  (C^MC)2  =  C^MCC^MC; 
what  is  CC_1? 


58.  Note  in  Section  6  [see  (6.15)]  that,  for  the  given  matrix  A,  we  found  A2  =  —I,  so  it 
was  easy  to  find  all  the  powers  of  A.  It  is  not  usually  this  easy  to  find  high  powers 
of  a  matrix  directly.  Try  it  for  the  square  matrix  M  in  equation  (11.1).  Then  use 
the  method  outlined  in  Problem  57  to  find  M4,  M10,  eM. 

59.  Repeat  the  last  part  of  Problem  58  for  the  matrix  M  = 

60.  The  Caley-Hamilton  theorem  states  that  “A  matrix  satisfies  its  own  characteristic 
equation.”  Verify  this  theorem  for  the  matrix  M  in  equation  (11.1).  Hint:  Substitute 
the  matrix  M  for  A  in  the  characteristic  equation  (11.4)  and  verify  that  you  have  a 
correct  matrix  equation.  Further  hint:  Don’t  do  all  the  arithmetic.  Use  (11.36)  to 
write  the  left  side  of  your  equation  as  C(D2  —  7D  +  6)C_1  and  show  that  the  paren¬ 
thesis  =  0.  Remember  that,  by  definition,  the  eigenvalues  satisfy  the  characteristic 
equation. 
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61.  At  the  end  of  Section  9  we  proved  that  if  H  is  a  Hermitian  matrix,  then  the  matrix 
elH  is  unitary.  Give  another  proof  by  writing  H  =  GDC-1,  remembering  that  now 
C  is  unitary  and  the  eigenvalues  in  D  are  real.  Show  that  elD  is  unitary  and  that 
elH  is  a  product  of  three  unitary  matrices.  See  Problem  9.17d. 

62.  Show  that  if  matrices  F  and  G  can  be  diagonalized  by  the  same  C  matrix,  then  they 
commute.  Hint:  Do  diagonal  matrices  commute? 


12.  APPLICATIONS  OF  DIAGONALIZATION 


We  next  consider  some  examples  of  the  use  of  the  diagonalization  process.  A  central 
conic  section  (ellipse  or  hyperbola)  with  center  at  the  origin  has  the  equation 


(12.1) 


Ax 2  +  2  Hxy  +  By 2  =  K, 


where  A,  B ,  H  and  K  are  constants.  In  matrix  form  this  can  be  written 


(12.2) 


or 


(* 


=  K 


if  we  call 


=  M 


(as  you  can  verify  by  multiplying  out  the  matrices) .  We  want  to  choose  the  principal 
axes  of  the  conic  as  our  reference  axes  in  order  to  write  the  equation  in  simpler  form. 
Consider  Figure  11.2;  let  the  axes  ( x',y ')  be  rotated  by  some  angle  9  from  ( x,y ). 
Then  the  ( x,y )  and  ( x',y ')  coordinates  of  a  point  are  related  by  (11.13)  or  (11.14): 


(12.3) 


By  (9.11)  the  transpose  of  (12.3)  is 


(12.4) 


(x  y) 
(x  y) 


(x‘ 

(x‘ 


sin0\ 
cos  9  )  °r 

2/')  C-1 


since  C  is  an  orthogonal  matrix.  Substituting  (12.3)  and  (12.4)  into  (12.2),  we  get 


(12.5) 


(*'  y 0  C-1MC  =  K. 


If  C  is  the  matrix  which  diagonalizes  M,  then  (12.5)  is  the  equation  of  the  conic 
relative  to  its  principal  axes. 


Example  1.  Consider  the  conic 

(12.6)  5x2  —  4xy  +  2y2  =  30. 


In  matrix  form  this  can  be  written 

(IM)  (X  1)(0=3O. 
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We  have  here  the  same  matrix, 


whose  eigenvalues  we  found  in  Section  11.  In  that  section  we  found  a  C  such  that 

C-1MC  =  D  =  (q  “)■ 


Then  the  equation  (12.5)  of  the  conic  relative  to  principal  axes  is 

(12.8)  (xf  y')  (g  (*!)  =  x'2  +  6y'2  =  30. 

Observe  that  changing  the  order  of  1  and  6  in  D  would  give  6x'2  +  y’2  =  30  as  the 
new  equation  of  the  ellipse  instead  of  (12.8).  This  amounts  simply  to  interchanging 
the  x'  and  y'  axes. 

By  comparing  the  matrix  C  of  the  unit  eigenvectors  in  (11.10)  with  the  rotation 
matrix  in  (11.14),  we  see  that  the  rotation  angle  6  (Figure  11.3)  from  the  original 
axes  (x,y)  to  the  principal  axes  (x',  y')  is 


(12.9) 


9  =  arc  cos  —=■ . 

V5 


Notice  that  in  writing  the  conic  section  equation  in  matrix  form  (12.2)  and  (12.7), 
we  split  the  xy  term  evenly  between  the  two  nondiagonal  elements  of  the  matrix; 
this  made  M  symmetric.  Recall  (end  of  Section  11)  that  M  can  be  diagonalized 
by  a  similarity  transformation  C_1MC  with  C  an  orthogonal  matrix  (that  is,  by 
a  rotation  of  axes)  if  and  only  if  M  is  symmetric.  We  choose  M  symmetric  (by 
splitting  the  xy  term  in  half)  to  make  our  process  work. 

Although  for  simplicity  we  have  been  working  in  two  dimensions,  the  same  ideas 
apply  to  three  (or  more)  dimensions  (that  is,  three  or  more  variables).  As  we  have 
said  (Section  10),  although  we  can  represent  only  three  coordinates  in  physical 
space,  it  is  very  convenient  to  use  the  same  geometrical  terminology  even  though 
the  number  of  variables  is  greater  than  three.  Thus  if  we  diagonalize  a  matrix  of 
any  order,  we  still  use  the  terms  eigenvalues,  eigenvectors,  principal  axes,  rotation 
to  principal  axes,  etc. 


Example  2.  Rotate  to  principal  axes  the  quadric  surface 

x2  +  6  xy  —  2  y2  —  2  yz  +  z2  =  24. 

In  matrix  form  this  equation  is 

(x  y  z)  ^3  -2  -1  j  Q  =  24. 

The  characteristic  equation  of  this  matrix  is 
1  -  A  3  0 

3  -2  -  A  -1  =0  =  -A3  +  13A-12 

0  -1  1  -  A 


(A  —  1)(A  +  4)(A  —  3). 
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The  characteristic  values  are 

A  =  1,  A  =  —4,  A  =3. 

Relative  to  the  principal  axes  (x',y',z')  the  quadric  surface  equation  becomes 

A  0  0\  fx'\ 

(aJ  y’  z’)  0  -4  0  /  =24 

\0  0  3 j  \z'J 


or 

x'2  -  4 y'2  +  3 z'2  =  24. 

From  this  equation  we  can  identify  the  quadric  surface  (hyperboloid  of  one  sheet) 
and  sketch  its  size  and  shape  using  (x' ,  y' ,  z')  axes  without  finding  their  relation  to 
the  original  (x,  y,  z)  axes.  However,  if  we  do  want  to  know  the  relation  between  the 
two  sets  of  axes,  we  find  the  C  matrix  in  the  following  way.  Recall  from  Section  11 
that  C  is  the  matrix  whose  columns  are  the  components  of  the  unit  eigenvectors. 
One  of  the  eigenvectors  can  be  found  by  substituting  the  eigenvalue  A  =  1  into  the 
equations 

(i :!  -■)  (!)  ■  (3 

and  solving  for  x,  y,  z.  Then  ia:  +  j y  +  kr  is  an  eigenvector  corresponding  to 
A  =  1,  and  by  dividing  it  by  its  magnitude  we  get  a  unit  eigenvector  (Problem  8). 
Repeating  this  process  for  each  of  the  other  values  of  A,  we  get  the  following  three 
unit  eigenvectors: 


(tTo,0’7To) 

when 

A  =  1; 

-3  5  1  A 

when 

A  =  - 

735’  735’  735/ 

-3-2  1  A 

when 

A  =  3. 

Til’  7i4’  Vu) 

Then  the  rotation  matrix  C  is 

(—  —  —\ 

Vio  735  Vu 


735  th 

3  11 

V7n>  vM  7 n/ 


The  numbers  in  C  are  the  cosines  of  the  nine  angles  between  the  (x,  y7  z)  and 
( x',y',z ')  axes.  (Compare  Figure  11.3  and  the  discussion  of  it.) 

A  useful  physical  application  of  this  method  occurs  in  discussing  vibrations.  We 
illustrate  this  with  a  simple  problem. 
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Example  3.  Find  the  characteristic  vibration  frequencies  for  the  system  of  masses  and 
springs  shown  in  Figure  12.1. 


1 

k 

m  rrrt. cm 

m  ^ 

_f 
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— 'liWfflT' — 
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1  w 

—• - '  00001)1)  — 

1 

Figure  12.1 


Let  x  and  y  be  the  coordinates  of  the  two  masses  at  time  t  relative  to  their 
equilibrium  positions,  as  shown  in  Figure  12.1.  We  want  to  write  the  equations  of 
motion  (mass  times  acceleration  =  force)  for  the  two  masses  (see  Chapter  2,  end 
of  Section  16).  We  can  just  write  the  forces  by  inspection  as  we  did  in  Chapter  2, 
but  for  more  complicated  problems  it  is  useful  to  have  a  systematic  method.  First 
write  the  potential  energy;  for  a  spring  this  is  V  =  \ky 2  where  y  is  the  compression 
or  extension  of  the  spring  from  its  equilibrium  length.  Then  the  force  exerted  on 
a  mass  attached  to  the  spring  is  —  ky  =  — dV/dy .  If  V  is  a  function  of  two  (or 
more)  variables,  say  x  and  y  as  in  Figure  12.1,  then  the  forces  on  the  two  masses 
are  —dV/dx  and  —dV/dy  (and  so  on  for  more  variables).  For  Figure  12.1,  the 
extension  or  compression  of  the  middle  spring  is  x  —  y  so  its  potential  energy  is 
\k{x  —  y)2 .  For  the  other  two  springs,  the  potential  energies  are  \kx2  and  \ky2  so 
the  total  potential  energy  is 

(12.10)  V  =  ^ kx 2  +  ^ k(x  -  y)2  +  ^ ky 2  =  k{x2  -  xy  +  y2). 

In  writing  the  equations  of  motion  it  is  convenient  to  use  a  dot  to  indicate  a  time 
derivative  (as  we  often  use  a  prime  to  mean  an  x  derivative).  Thus  x  =  dx/dt , 
x  =  d2x/dt2,  etc.  Then  the  equations  of  motion  are 

11.  (  mx  =  —dV/dx  =  —2 kx  +  ky, 

\  my  =  —dV/dy  =  kx  —  2 ky. 

In  a  normal  or  characteristic  mode  of  vibration,  the  x  and  y  vibrations  have  the 
same  frequency.  As  in  Chapter  2,  equations  (16.22),  we  assume  solutions  x  =  Xoezwt, 
V  =  yo^lLUt i  with  the  same  frequency  u>  for  both  x  and  y.  [Or,  if  you  prefer,  we  could 
replace  elut  by  sin  u>t  or  cos  u>t  or  sin(wf  +  a),  etc.]  Note  that  (for  any  of  these 
solutions), 

(12.12)  x  —  —ijj2x,  and  y  =  —to2y. 


Substituting  (12.12)  into  (12.11)  we  get  (Problem  10) 

—mu2x  =  —2kx  +  ky, 
— mu>2y  =  kx  —  2  ky. 

In  matrix  form  these  equations  are 


(12.13) 


(12.14) 


A 


2  -1 

-1  2 


with  A  = 


Note  that  this  is  an  eigenvalue  problem  (see  Section  11).  To  find  the  eigenvalues  A, 
we  write 


(12.15) 


2  -  A  -1 


-1  2-  A 
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and  solve  for  A  to  find  A  =  1  or  A  =  3.  Thus  [by  the  definition  of  A  in  (12.14)]  the 
characteristic  frequencies  are 


(12.16) 


LO\  = 


and 


0J2  = 


The  eigenvectors  (not  normalized)  corresponding  to  these  eigenvalues  are: 
(12.17)  For  A  =  1:  y  =  x  or  r  =  (1, 1);  for  A  =  3:  y—  —x  orr  =  (1,  —1). 


Thus  at  frequency  to\  (with  y  =  x),  the  two  masses  oscillate  back  and  forth  to¬ 
gether  like  this  — >— •>  and  then  like  this  .  At  frequency  u>2  (with  y  =  —a;),  they 

oscillate  in  opposite  directions  like  this  < - >  and  then  like  this  — »<— .  These  two 

especially  simple  ways  in  which  the  system  can  vibrate,  each  involving  just  one 
vibration  frequency,  are  called  the  characteristic  (or  normal)  modes  of  vibration; 
the  corresponding  frequencies  are  called  the  characteristic  (or  normal)  frequencies 
of  the  system. 

The  problem  we  have  just  done  shows  an  important  method  which  can  be  used 
in  many  different  applications.  There  are  numerous  examples  of  vibration  problems 
in  physics — in  acoustics:  the  vibrations  of  strings  of  musical  instruments,  of  drum¬ 
heads,  of  the  air  in  organ  pipes  or  in  a  room;  in  mechanics  and  its  engineering  ap¬ 
plications:  vibrations  of  mechanical  systems  all  the  way  from  the  simple  pendulum 
to  complicated  structures  like  bridges  and  airplanes;  in  electricity:  the  vibrations  of 
radio  waves,  of  electric  currents  and  voltages  as  in  a  tuned  radio;  and  so  on.  In  such 
problems,  it  is  often  useful  to  find  the  characteristic  vibration  frequencies  of  the 
system  under  consideration  and  the  characteristic  modes  of  vibration.  More  com¬ 
plicated  vibrations  can  then  be  discussed  as  combinations  of  these  simpler  normal 
modes  of  vibration. 


Example  4.  In  Example  3  and  Figure  12.1,  the  two  masses  were  equal  and  all  the  spring 
constants  were  the  same.  Changing  the  spring  constants  to  different  values  doesn’t 
cause  any  problems  but  when  the  masses  are  different,  there  is  a  possible  difficulty 
which  we  want  to  discuss.  Consider  an  array  of  masses  and  springs  as  in  Figure  12.1 
but  with  the  following  masses  and  spring  constants:  2 k,  2 to,  6k,  3 to,  3 k.  We  want 
to  find  the  characteristic  frequencies  and  modes  of  vibration.  Following  our  work  in 
Example  3,  we  write  the  potential  energy  V,  find  the  forces,  write  the  equations  of 
motion,  and  substitute  x  =  —xrx,  and  y  =  —uj2y,  in  order  to  find  the  characteristic 
frequencies.  (Do  the  details :  Problem  11.) 

V  =  ^2kx2  +  ^6k(x  -  y )2  +  ^ 3ky 2  =  ^ k(8x 2  -  12 xy  +  9 y2) 

(  2 mx  =  —dV/dx,  (  —2 rmo2x  =  —k(8x  —  6 y), 

y  3m y  =  —dV/dy,  °r  \  —3 mco2y  =  — fc(— 6x  +  9 y). 


(12.18) 

(12.19) 


Next  divide  each  equation  by  its  mass  and  write  the  equations  in  matrix  form. 


(12.20) 


With  A  =  mu2 /k,  the  eigenvalues  of  the  square  matrix  are  A  =  1  and  A  =  6.  Thus 
the  characteristic  frequencies  of  vibration  are 


(12.21) 


u>i  = 
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The  corresponding  eigenvectors  are: 

(12.22)  For  A  =  1:  y  =  x  or  r  =  (1, 1);  for  A  =  6:  3 y  =  — 2x  or  r  =  (3,  —2). 

Thus  at  frequency  ui\  the  two  masses  oscillate  back  and  forth  together  with  equal 
amplitudes  like  this  <— <—  and  then  like  this  — >.  At  frequency  u> 2  the  two  masses 

oscillate  in  opposite  directions  with  amplitudes  in  the  ratio  3  to  2  like  this  < - >  and 

then  like  this  — . 

Now  we  seem  to  have  solved  the  problem;  where  is  the  difficulty?  Note  that  the 
square  matrix  in  (12.20)  is  not  symmetric  [and  compare  (12.14)  where  the  square 
matrix  was  symmetric].  In  Section  11  we  discussed  the  fact  that  (for  real  matri¬ 
ces)  only  symmetric  matrices  have  orthogonal  eigenvectors  and  can  be  diagonalized 
by  an  orthogonal  transformation.  Here  note  that  the  eigenvectors  in  Example  3 
were  orthogonal  [dot  product  of  (1,1)  and  (1,-1)  is  zero]  but  the  eigenvectors  for 
(12.20)  are  not  orthogonal  [dot  product  of  (1, 1)  and  (3,  —2)  is  not  zero].  If  we  want 
orthogonal  eigenvectors,  we  can  make  the  change  of  variables  (also  see  Example  6) 

(12.23)  X  =  xV2,  Y  =  yV 3, 


where  the  constants  are  the  square  roots  of  the  numerical  factors  in  the  masses  2 m 
and  3m.  (Note  that  geometrically  this  just  amounts  to  different  changes  in  scale 
along  the  two  axes,  not  to  a  rotation.)  Then  (12.20)  becomes 


(12.24) 


k_(  4  -V6\  (X\ 

m  y/6  3  )  ' 


By  inspection  we  see  that  the  characteristic  equation  for  the  square  matrix  in  (12.24) 
is  the  same  as  the  characteristic  equation  for  (12.20)  so  the  eigenvalues  and  the  char¬ 
acteristic  frequencies  are  the  same  as  before  (as  they  must  be  by  physical  reasoning). 
However  the  (12.24)  matrix  is  symmetric  and  so  we  know  that  its  eigenvectors  are 
orthogonal.  By  direct  substitution  of  (12.23)  into  (12.22),  [or  by  solving  for  the 
eigenvectors  in  the  (12.24)  matrix]  we  find  the  eigenvectors  in  the  X,  Y  coordi¬ 
nates: 


(12.25)  For  A  =  1:  R  =  ( X,Y )  =  (y/2,-y/3);  for  A  =  6:  R=  (3\/2,2\/3). 
As  expected,  these  eigenvectors  are  orthogonal. 


Example  5.  Let’s  consider  a  model  of  a  linear  triatomic  molecule  in  which  we  approximate 
the  forces  between  the  atoms  by  forces  due  to  springs  (Figure  12.2). 

m  k  M  k  m 

Figure  12.2 

As  in  Example  3,  let  x,  y,  z  be  the  coordinates  of  the  three  masses  relative  to  their 
equilibrium  positions.  We  want  to  find  the  characteristic  vibration  frequencies  of 
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the  molecule.  Following  our  work  in  Examples  3  and  4,  we  find  (Problem  12) 

(12.26)  V  =  \k(x  —  y )2  +  \k{y  —  z )2  =  \k{x2  +  2  y2  +  z2  —  2  xy  —  2yz), 

{mi  =  —dV/dx  =  —  k(x  —  y), 

My  =  —dV/dy  =  — fc(2y  —  x  —  z), 
mz  =  —dV/dz  =  —k(z  —  y), 

(12.27)  or 

i—muj2x  =  —  k(x  —  y), 

—Moj2y  =  —k{2y  —  x  —  z), 

—mui2z  =  —k(z  —  y). 

We  are  going  to  consider  several  different  ways  of  solving  this  problem  in  order 
to  learn  some  useful  techniques.  First  of  all,  if  we  add  the  three  equations  we  get 

(12.28)  mi  +  My  +  mz  =  0. 

Physically  (12.28)  says  that  the  center  of  mass  is  at  rest  or  moving  at  constant 
speed  (that  is,  has  zero  acceleration).  Since  we  are  just  interested  in  vibrational 
motion,  let’s  assume  that  the  center  of  mass  is  at  rest  at  the  origin.  Then  we  have 
nix  +  My  +  mz  =  0.  Solving  this  equation  for  y  gives 

(12.29)  y=-™{x  +  z). 

Substitute  (12.29)  into  the  second  set  of  equations  in  (12.27)  to  get  the  x  and  z 
equations 


(12.30) 


2  ,  /,  TXl’  \  .TO 

— mto  x  =  —  k(l  H - )x  —  k — -z, 

v  M  M 
i  ,  to  m 

—tow  2  =  —  k — -x  —  k (1  H - -)z. 

M  v  M 


In  matrix  form  equations  (12.30)  become  [compare  (12.14)] 

"1  + 


M 


M 


m  1  _i_  _m 

M  1  '  M 


(12.31)  A 

We  solve  this  eigenvalue  problem  to  find 


with  A 


For  wi  we  find  z  =  —x,  and  consequently  by  (12.29),  y  =  0.  For  w 2,  we  find  z  =  x 
and  so  y  =  —  jffix.  Thus  at  frequency  Wi,  the  central  mass  M  is  at  rest  and  the 
two  masses  to  vibrate  in  opposite  directions  like  this  <—  to  M  to— >  and  then  like 
this  to — >  M  <—  m.  At  the  higher  frequency  to 2,  the  central  mass  M  moves  in 
one  direction  while  the  two  masses  to  move  in  the  opposite  direction,  first  like  this 
to — >  <—M  to — *  and  then  like  this  <—  to  M— >  <—  m. 

Now  suppose  that  we  had  not  thought  about  eliminating  the  translational  motion 
and  had  set  this  problem  up  as  a  3  variable  problem.  Let’s  go  back  to  the  second  set 
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of  equations  in  (12.27),  and  divide  the  x  and  z  equations  by  to  and  the  y  equation 
by  M.  Then  in  matrix  form  these  equations  can  be  written  as 


(12.33) 


With  A  =  tow2 /k,  the  eigenvalues  of  the  square  matrix  are  A  =  0, 1, 1  +  and 
the  corresponding  eigenvectors  are  (check  these) 


(12.34) 


For  A  =  0,  r  =  (1,1,1); 
for  A  =  1,  r  =  (1, 0,  —1); 
r  ,  2to  2 to 

fotA  =  1+TO  r  =  (1’-17’1)- 


We  recognize  the  A  =  0  solution  as  corresponding  to  translation  both  because  w  =  0 
(so  there  is  no  vibration),  and  because  r  =  (1,1,1)  says  that  any  motion  is  the  same 
for  all  three  masses.  The  other  two  modes  of  vibration  are  the  same  ones  we  had 
above.  We  note  that  the  square  matrix  in  (12.33)  is  not  symmetric  and  so,  as 
expected,  the  eigenvectors  in  (12.34)  are  not  an  orthogonal  set.  However,  the  last 
two  (which  correspond  to  vibrations)  are  orthogonal  so  if  we  are  just  interested  in 
modes  of  vibration  we  can  ignore  the  translation  eigenvector.  If  we  want  to  consider 
all  motion  of  the  molecule  along  its  axis  (both  translation  and  vibration),  and  want 
an  orthogonal  set  of  eigenvectors,  we  can  make  the  change  of  variables  discussed  in 
Example  4,  namely 


(12.35)  X  =  x, 

Then  the  eigenvectors  become 

(12.36)  (1,  y/M/m ,  1) 


(1, 0,  — 1),  (1,  —2^/rn/M ,  1) 


which  are  an  orthogonal  set.  The  first  eigenvector  (corresponding  to  translation) 
may  seem  confusing,  looking  as  if  the  central  mass  M  doesn’t  move  with  the  others 
(as  it  must  for  pure  translation).  But  remember  from  Example  4  that  changes  of 
variable  like  (12.23)  and  (12.35)  correspond  to  changes  of  scale,  so  in  the  XYZ 
system  we  are  not  using  the  same  measuring  stick  to  find  the  position  of  the  central 
mass  as  for  the  other  two  masses.  Their  physical  displacements  are  actually  all  the 
same. 


Example  6.  Let’s  consider  Example  4  again  in  order  to  illustrate  a  very  compact  form 
for  the  eigenvalue  equation.  Satisfy  yourself  (Problem  13)  that  we  can  write  the 
potential  energy  V  in  (12.18)  as 

(12.37)  V  =  -krTVr  where  V  =  ^  g  ,  r  =  ,  rT  =  (x  y)  . 

Similarly  the  kinetic  energy  T  =  i(2 mx2  +  3my2)  can  be  written  as 

(12.38)  T  =  iTOrTTr  where  T  =  ^  ^  ,  r  =  ,  rT  =  (a;  y)  . 
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(Notice  that  the  T  matrix  is  diagonal  and  is  a  unit  matrix  when  the  masses  are  equal; 
otherwise  T  has  the  mass  factors  along  the  main  diagonal  and  zeros  elsewhere.)  Now 
using  the  matrices  T  and  V,  we  can  write  the  equations  of  motion  (12.19)  as 


(12.39) 


ATr  =  Vr  where  A  = - 

k 


We  can  think  of  (12.39)  as  the  basic  eigenvalue  equation.  If  T  is  a  unit  matrix,  then 
we  just  have  Ar  =  Vr  as  in  (12.14).  If  not,  then  we  can  multiply  (12.39)  by  T-1  to 
get 


(12.40) 


Ar  =  T_1Vr  = 


as  in  (12.20).  However,  we  see  that  this  matrix  is  not  symmetric  and  so  the  eigen¬ 
vectors  will  not  be  orthogonal.  If  we  want  the  eigenvectors  to  be  orthogonal  as  in 
(12.23),  we  choose  new  variables  so  that  the  T  matrix  is  the  unit  matrix,  that  is 
variables  X  and  Y  so  that 


(12.41)  T  =  i(2  mi2  +  3  my2)  =  \m(X2  +  V2). 

But  this  means  that  we  want  X2  =  2x2  and  Y2  =  3 y2  as  in  (12.23),  or  in  matrix 
form, 


(12.42) 


R  = 


(y/2 

0  V  ( 

“  Wv  " 

V« 

Vs)  { 

i 

w 

II 

'1/V2 

v  0 

0 

i/Vs, 

=  T1/2r 


or 


Substituting  (12.42)  into  (12.39),  we  get  ATT  1^2R  =  VT  1^2R.  Then  multiplying 
on  the  left  by  T-1/2  and  noting  that  T_1/2TT-1/2  =  I,  we  have 


(12.43) 


AR  =  T-1/2VT"1/2R 


as  the  eigenvalue  equation  in  terms  of  the  new  variables  X  and  Y.  Substituting  the 
numerical  T-1/2  from  (12.42)  into  (12.43)  gives  the  result  we  had  in  (12.24). 

We  have  simply  demonstrated  that  (12.39)  and  (12.43)  give  compact  forms  of 
the  eigenvalue  equations  for  Example  4.  However,  it  is  straightforward  to  show 
that  these  equations  are  just  a  compact  summary  of  the  equations  of  motion  for 
any  similar  vibrations  problem,  in  any  number  of  variables,  just  by  writing  the 
potential  and  kinetic  energy  matrices  and  comparing  the  equations  of  motion  in 
matrix  form. 


Example  7.  Find  the  characteristic  frequencies  and  the  characteristic  modes  of  vibration 
for  the  system  of  masses  and  springs  shown  in  Figure  12.3,  where  the  motion  is 
along  a  vertical  line. 

Let’s  use  the  simplified  method  of  Example  6  for  this  problem.  We  first  write  the 
expressions  for  the  kinetic  energy  and  the  potential  energy  as  in  previous  examples. 
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(Note  carefully  that  we  measure  x  and  y  from  the  equilibrium  positions 
of  the  masses  when  they  are  hanging  at  rest;  then  the  gravitational 
forces  are  already  balanced  and  gravitational  potential  energy  does  not 
come  into  the  expression  for  V.) 

T  =  bm{  4x2  +  y2), 

(12.44')  2  v  y  h 

V  =  \k[  3x2  +  (x  —  y)2]  =  \k{  4x2  —  2  xy  +  y2). 


mum 


The  corresponding  matrices  are  [see  equations  (12.37)  and  (12.38)]: 


(12.45) 


k 


As  in  equation  (12.40),  we  find  T  1V  and  its  eigenvalues  and  eigenvec¬ 
tors. 


y  •  m 
) 


1V  (1/ 4  0\  (  4  -l\  _  /  1  — 1/4\  mw2  _  1  3  Figure  12.3 

T  v"Vo  ljv-i  V^l-i  ij’  2‘ 

/  b  I  3  u 

(12.46)  For  w  =  \J  — ,  r=  (1,2);  for  w  =  ^  ,  r  =  (1, -2). 


As  expected  (since  T-1V  is  not  symmetric),  the  eigenvectors  are  not  orthogonal. 
If  we  want  orthogonal  eigenvectors,  we  make  the  change  of  variables  X  =  2x, 
Y  =  y,  to  find  the  eigenvectors  R  =  (1, 1)  and  R  =  (1,  —1)  which  are  orthogonal. 
Alternatively,  we  can  find  the  matrix  T-1/2VT-1/2 

<->  (T  ?)  (-( -;)  (f  o  -  (-U  -1/2)  • 

and  find  its  eigenvalues  and  eigenvectors. 


PROBLEMS,  SECTION  12 

1.  Verify  that  (12.2)  multiplied  out  is  (12.1). 

Find  the  equations  of  the  following  conics  and  quadric  surfaces  relative  to  principal  axes. 

2.  2*2  +  4a:y  —  y2  =  24 

3.  8*2  +  8xj /  +  2y2  =  35 

4.  3®2  +  8a:y  —  3  y2  =  8 

5.  5x2  +  3  y2  +  2  z2  +  =  14 

6.  x2  +  y2  +  z2  +  4a;y  +  2xz  —  2yz  =  12 

7.  x2  +  3y2  +  3  z2  +  4  xy  +  \xz  =  60 

8.  Carry  through  the  details  of  Example  2  to  find  the  unit  eigenvectors.  Show  that  the 
resulting  rotation  matrix  C  is  orthogonal.  Hint:  Find  CCT. 

9.  For  Problems  2  to  7,  find  the  rotation  matrix  C  which  relates  the  principal  axes  and 
the  original  axes.  See  Example  2. 

10.  Verify  equations  (12.13)  and  (12.14).  Solve  (12.15)  to  find  the  eigenvalues  and  verify 
(12.16).  Find  the  corresponding  eigenvectors  as  stated  in  (12.17). 
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11.  Verify  the  details  of  Example  4,  equations  (12.18)  to  (12.25). 

12.  Verify  the  details  of  Example  5,  equations  (12.26)  to  (12.36). 

13.  Verify  the  details  of  Example  6,  equations  (12.37)  to  (12.43). 

Find  the  characteristic  frequencies  and  the  characteristic  modes  of  vibration  for  systems 
of  masses  and  springs  as  in  Figure  12.1  and  Examples  3,  4,  and  6  for  the  following  arrays. 


14. 

k,  m,  2k,  m,  k 

15. 

5 k,  m,  2k,  m,  2k 

16. 

4 k,  m,  2k,  m,  k 

17. 

3k,  3m,  2k,  Am,  2k 

18. 

2k,  m,  k,  5m,  10/c 

19. 

Ak,  2m,  k,  m,  k 

20. 

Carry  through  the  details  of  Example  7. 

Find  the  characteristic  frequencies  and  the  characteristic  modes  of  vibration  as  in  Ex¬ 
ample  7  for  the  following  arrays  of  masses  and  springs,  reading  from  top  to  bottom  in  a 
diagram  like  Figure  12.3. 

21.  3k,m,2k,m  22.  Ak,3m,k,m  23.  2k,  4m,  k,  2m 

13.  A  BRIEF  INTRODUCTION  TO  GROUPS 

We  will  not  go  very  far  into  group  theory — there  are  whole  books  on  the  subject 
as  well  as  on  its  applications  in  physics.  But  since  so  many  of  the  ideas  we  are 
discussing  in  this  chapter  are  involved,  it  is  interesting  to  have  a  quick  look  at 
groups. 

Example  1.  Think  about  the  four  numbers  ±1,  ±i.  Notice  that  no  matter  what  products 
and  powers  of  them  we  compute,  we  never  get  any  numbers  besides  these  four. 
This  property  of  a  set  of  elements  with  a  law  of  combination  is  called  closure.  Now 
think  about  these  numbers  written  in  polar  form:  e777/2,  e77r,  e 3l7r/2,  e2*77  =  1,  or  the 
corresponding  rotations  of  a  vector  (in  the  xy  plane  with  tail  at  the  origin) ,  or  the 
set  of  rotation  matrices  corresponding  to  these  successive  90°  rotations  of  a  vector 
(Problem  1).  Note  also  that  these  numbers  are  the  four  fourth  roots  of  1,  so  we 
could  write  them  as  A,  A2,  A3,  A4  =  1.  All  these  sets  are  examples  of  groups,  or 
more  precisely,  they  are  all  representations  of  the  same  group  known  as  the  cyclic 
group  of  order  4.  We  will  be  particularly  interested  in  groups  of  matrices,  that  is,  in 
matrix  representations  of  groups,  since  this  is  very  important  in  applications.  Now 
just  what  is  a  group? 

Definition  of  a  Group  A  group  is  a  set  {A,  B,C,-  ■  ■}  of  elements — which  may 
be  numbers,  matrices,  operations  (such  as  the  rotations  above) — together  with  a  law 
of  combination  of  two  elements  (often  called  the  “product”  and  written  as  AB — see 
discussion  below)  subject  to  the  following  four  conditions. 

1.  Closure:  The  combination  of  any  two  elements  is  an  element  of  the  group. 

2.  Associative  law:  The  law  of  combination  satisfies  the  associative  law: 

(AB)C  =  A(BC). 

3.  Unit  element:  There  is  a  unit  element  /  with  the  property  that  I A  =  AI  =  A 
for  every  element  of  the  group. 
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4.  Inverses:  Every  element  of  the  group  has  an  inverse  in  the  group;  that  is,  for 
any  element  A  there  is  an  element  B  such  that  AB  =  BA  =  I. 

We  can  easily  verify  that  these  four  conditions  are  satisfied  for  the  set  ±1,  ±i 
under  multiplication. 

1.  We  have  already  discussed  closure. 

2.  Multiplication  of  numbers  is  associative. 

3.  The  unit  element  is  1. 

4.  The  numbers  i  and  — z  are  inverses  since  their  product  is  1;  —1  is  its  own 
inverse,  and  1  is  its  own  inverse. 

Thus  the  set  ±1,  ±z,  under  the  operation  of  multiplication,  is  a  group.  The  order 
of  a  finite  group  is  the  number  of  elements  in  the  group.  When  the  elements  of  a 
group  of  order  n  are  of  the  form  A ,  A2,  A3,  •  •  • ,  An  =  1,  it  is  called  a  cyclic  group. 
Thus  the  group  ±1,  ±z,  under  multiplication,  is  a  cyclic  group  of  order  4  as  we 
claimed  above. 

A  subgroup  is  a  subset  which  is  itself  a  group.  The  whole  group,  or  the  unit 
element,  are  called  trivial  subgroups ;  any  other  subgroup  is  called  a  proper  subgroup. 
The  group  ±1,  ±z  has  the  proper  subgroup  ±1. 

Product,  Multiplication  Table  In  the  definition  of  a  group  and  in  the  discus¬ 
sion  so  far,  we  have  used  the  term  “product”  and  have  written  AB  for  the  combi¬ 
nation  of  two  elements.  However,  terms  like  “product”  or  “multiplication”  are  used 
here  in  a  generalized  sense  to  refer  to  whatever  the  operation  is  for  combining  group 
elements.  In  applications,  group  elements  are  often  matrices  and  the  operation  is 
matrix  multiplication.  In  general  mathematical  group  theory,  the  operation  might 
be,  for  example,  addition  of  two  elements,  and  that  sounds  confusing  to  say  “prod¬ 
uct”  when  we  mean  sum!  Look  at  one  of  the  first  examples  we  discussed,  namely 
the  rotation  of  a  vector  by  angles  7t/2,  7t,  37t/2,  27t  or  0.  If  the  group  elements  are 
rotation  matrices,  then  we  multiply  them,  but  if  the  group  elements  are  the  angles, 
then  we  add  them.  But  the  physical  problem  is  exactly  the  same  in  both  cases.  So 
remember  that  group  multiplication  refers  to  the  law  of  combination  for  the  group 
rather  than  just  to  ordinary  multiplication  in  arithmetic. 

Multiplication  tables  for  groups  are  very  useful;  equations  (13.1),  (13.2),  and 
(13.4)  show  some  examples.  Look  at  (13.1)  for  the  group  ±1,  ±z.  The  first  column 
and  the  top  row  (set  off  by  lines)  list  the  group  elements.  The  sixteen  possible 
products  of  these  elements  are  in  the  body  of  the  table.  Note  that  each  element  of 
the  group  appears  exactly  once  in  each  row  and  in  each  column  (Problem  3).  At 
the  intersection  of  the  row  starting  with  i  and  the  column  headed  by  —i,  you  find 
the  product  (z)(— i)  =  1,  and  similarly  for  the  other  products. 


i 

i 

-i 

—i 

1 

1 

i 

-1 

—i 

i 

i 

-1 

—i 

1 

-1 

-1 

—i 

1 

i 

—i 

—i 

1 

i 

-1 

(13.1) 
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In  (13.2)  below,  note  that  you  add  the  angles  as  we  discussed  above.  However, 
it’s  not  quite  just  adding — it’s  really  the  familiar  process  of  adding  angles  until 
you  get  to  2n  and  then  starting  over  again  at  zero.  In  mathematical  language  this 
is  called  adding  (mod  27t)  and  we  write  7r/2  +  37r/2  =  0  (mod  27t).  Hours  on  an 
ordinary  clock  add  in  a  similar  way.  If  it’s  10  o’clock  and  then  4  hours  elapse,  the 
clock  says  it’s  2  o’clock.  We  write  10  +  4  =  2  (mod  12).  (See  Problems  6  and  7  for 
more  examples.) 


(13.2) 


Two  groups  are  called  isomorphic  if  their  multiplication  tables  are  identical 
except  for  the  names  we  attach  to  the  elements  [compare  (13.1)  and  (13.2)].  Thus 
all  the  4-element  groups  we  have  discussed  so  far  are  isomorphic  to  each  other,  that 
is,  they  are  really  all  the  same  group.  However,  there  are  two  different  groups  of 
order  4,  the  cyclic  group  we  have  discussed,  and  another  group  called  the  4’s  group 
(see  Problem  4). 

Symmetry  Group  of  the  Equilateral  Triangle  Con¬ 
sider  three  identical  atoms  at  the  corners  of  an  equilateral 
triangle  in  the  xy  plane,  with  the  center  of  the  triangle  at  the 
origin  as  shown  in  Figure  13.1.  What  rotations  and  reflec¬ 
tions  of  vectors  in  the  xy  plane  (as  in  Section  7)  will  produce 
an  identical  array  of  atoms?  By  considering  Figure  13.1,  we 
see  that  there  are  three  possible  rotations:  0°,  120°,  240°, 
and  three  possible  reflections,  through  the  three  lines  F,  G,  Figure  13.1 
H  (lines  along  the  altitudes  of  the  triangle).  Think  of  mov¬ 
ing  just  the  triangle  (that  is,  the  atoms),  leaving  the  axes  and  the  lines  F,  G ,  H 
fixed  in  the  background.  As  in  Section  7,  we  can  write  a  2  by  2  rotation  or  reflection 
matrix  for  each  of  these  six  transformations  and  set  up  a  multiplication  table  to 
show  that  they  do  form  a  group  of  order  6.  This  group  is  called  the  symmetry  group 
of  the  equilateral  triangle.  We  find  (Problem  8) 


(13.3) 


Identity,  0°  rotation 
120°  rotation 
240°  rotation 

Reflection  through  line  F  (y  axis) 
Reflection  through  line  G 
Reflection  through  line  H 
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The  group  multiplication  table  is: 


I 

A 

B 

F 

G 

H 

I 

I 

A 

B 

F 

G 

H 

A 

A 

B 

I 

G 

H 

F 

B 

B 

I 

A 

H 

F 

G 

F 

F 

H 

G 

I 

B 

A 

G 

G 

F 

H 

A 

I 

B 

H 

H 

G 

F 

B 

A 

I 

Note  here  that  GF  =  A,  but  FG  =  B,  not  surprising  since  we  know  that  matrices 
don’t  always  commute.  In  group  theory,  if  every  two  group  elements  commute,  the 
group  is  called  Abelian.  Our  previous  group  examples  have  all  been  Abelian,  but 
the  group  in  (13.4)  is  not  Abelian. 

This  is  just  one  example  of  a  symmetry  group.  Group  theory  is  so  important  in 
applications  because  it  offers  a  systematic  way  of  using  the  symmetry  of  a  physical 
problem  to  simplify  the  solution.  As  we  have  seen,  groups  can  be  represented  by 
sets  of  matrices,  and  this  is  widely  used  in  applications. 

Conjugate  Elements,  Class,  Character  Two  group  elements  A  and  B  are 
called  conjugate  elements  if  there  is  a  group  element  C  such  that  C_1AC  =  B. 
By  letting  C  be  successively  one  group  element  after  another,  we  can  find  all  the 
group  elements  conjugate  to  A.  This  set  of  conjugate  elements  is  called  a  class. 
Recall  from  Section  11  that  if  A  is  a  matrix  describing  a  transformation  (such  as  a 
rotation  or  some  sort  of  mapping  of  a  space  onto  itself),  then  B  =  C_1AC  describes 
the  same  mapping  but  relative  to  a  different  set  of  axes  (different  basis).  Thus  all 
the  elements  of  a  class  really  describe  the  same  mapping,  just  relative  to  different 
bases. 


Example  2.  Find  the  classes  for  the  group  in  (13.3)  and  (13.4).  We  find  the  elements 
conjugate  to  F  as  follows  [use  (13.4)  to  find  inverses  and  products]: 


(13.5) 


I_1FI  =  F; 

A”1  FA  =  BFA  =  BH  =  G; 
B_1FB  =  AFB  =  AG  =  H; 
F  'FF  =  F; 

G_1FG  =  GFG  =  GB  =  H; 
H_1FH  =  HFH  =  HA  =  G. 


Thus  the  elements  F,  G,  and  H  are  conjugate  to  each  other  and  form  one  class.  You 
can  easily  show  (Problem  12)  that  elements  A  and  B  are  another  class,  and  the  unit 
element  I  is  a  class  by  itself.  Now  notice  what  we  observed  above.  The  elements 
F,  G,  and  H  all  just  interchange  two  atoms,  that  is,  all  of  them  do  the  same  thing, 
just  seen  from  a  different  viewpoint.  The  elements  A  and  B  rotate  the  atoms,  A  by 
120°  and  B  by  240°  which  is  the  same  as  120°  looked  at  upside  down.  And  finally 
the  unit  element  I  leaves  things  unchanged  so  it  is  a  class  by  itself.  Notice  that  a 
class  is  not  a  group  (except  for  the  class  consisting  of  I)  since  a  group  must  contain 
the  unit  element.  So  a  class  is  a  subset  of  a  group,  but  not  a  subgroup. 
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Recall  from  (9.13)  and  Problem  11.10  that  the  trace  of  a  matrix  (sum  of  diagonal 
elements)  is  not  changed  by  a  similarity  transformation.  Thus  all  the  matrices  of  a 
class  have  the  same  trace.  Observe  that  this  is  true  for  the  group  (13.3):  Matrix  I 
has  trace  =  2,  A  and  B  have  trace  =  —  ^  =  —  1,  and  F,  G,  and  H  have  trace 

=  0.  In  this  connection,  the  trace  of  a  matrix  is  called  its  character ,  so  we  see  that 
all  matrices  of  a  class  have  the  same  character.  Also  note  that  we  could  write  the 
matrices  (13.3)  in  (infinitely)  many  other  ways  by  rotating  the  reference  axes,  that 
is,  by  performing  similarity  transformations.  But  since  similarity  transformations 
do  not  change  the  trace,  that  is,  the  character,  we  now  have  a  number  attached 
to  each  class  which  is  independent  of  the  particular  choice  of  coordinate  system 
(basis).  Classes  and  their  associated  character  are  very  important  in  applications 
of  group  theory. 

One  more  number  is  important  here,  and  that  is  the  dimension  of  a  representa¬ 
tion.  In  (13.3),  we  used  2  by  2  matrices  (2  dimensions),  but  it  would  be  possible 
to  work  in  3  dimensions.  Then,  for  example,  the  A  matrix  would  describe  a  120° 
rotation  around  the  z  axis  and  would  be 

/— 1/2  — v/3/2  0\ 

(13.6)  A  =  I  v/3/2  -1/2  0  , 

\0  0  1/ 

and  the  other  matrices  in  (13.3)  would  have  a  similar  form,  called  block  diagonalized. 
But  now  the  traces  of  all  the  matrices  are  increased  by  1.  To  avoid  having  any 
ambiguity  about  character,  we  use  what  are  called  “irreducible  representations”  in 
finding  character;  let’s  discuss  this. 

Irreducible  Representations  A  2-dimensional  representation  is  called  reducible 
if  all  the  group  matrices  can  be  diagonalized  by  the  same  unitary  similarity  transfor¬ 
mation  (that  is,  the  same  change  of  basis).  For  example,  the  matrices  in  Problem  1 
and  the  matrices  in  Problem  4  both  give  2-dimensional  reducible  representations 
of  their  groups  (see  Problems  13,  15,  and  16).  On  the  other  hand,  the  matrices  in 
(13.3)  cannot  be  simultaneously  diagonalized  (see  Problem  13),  so  (13.3)  is  called  a 
2-dimensional  irreducible  representation  of  the  equilateral  triangle  symmetry  group. 
If  a  group  of  3  by  3  matrices  can  all  be  either  diagonalized  or  put  in  the  form  of 

(13.6)  (block  diagonalized)  by  the  same  unitary  similarity  transformation,  then  the 
representation  is  called  reducible;  if  not,  it  is  a  3-dimensional  irreducible  representa¬ 
tion.  For  still  larger  matrices,  imagine  the  matrices  block  diagonalized  with  blocks 
along  the  main  diagonal  which  are  the  matrices  of  irreducible  representations. 

Thus  we  see  that  any  representation  is  made  up  of  irreducible  representations. 
For  each  irreducible  representation,  we  find  the  character  of  each  class.  Such  lists 
are  known  as  character  tables,  but  their  construction  is  beyond  our  scope. 

Infinite  Groups  Here  we  survey  some  examples  of  infinite  groups  as  well  as  some 
sets  which  are  not  groups. 

(13.7) 

(a)  The  set  of  all  integers,  positive,  negative,  and  zero,  under  ordinary  addition, 
is  a  group.  Proof :  The  sum  of  two  integers  is  an  integer.  Ordinary  addition 
obeys  the  associative  law.  The  unit  element  is  0.  The  inverse  of  the  integer  N 
is  — N  since  N  +  (— N)  =  0. 


Section  13 


A  Brief  Introduction  to  Groups  177 


(b)  The  same  set  under  ordinary  multiplication  is  not  a  group  because  0  has  no 
inverse.  But  even  if  we  omit  0,  the  inverses  of  the  other  integers  are  fractions 
which  are  not  in  the  set. 

(c)  Under  ordinary  multiplication,  the  set  of  all  rational  numbers  except  zero,  is 
a  group.  Proof:  The  product  of  two  rational  numbers  is  a  rational  number. 
Ordinary  multiplication  is  associative.  The  unit  element  is  1,  and  the  inverse 
of  a  rational  number  is  just  its  reciprocal. 

Similarly,  you  can  show  that  the  following  sets  are  groups  under  ordinary  mul¬ 
tiplication  (Problem  17):  All  real  numbers  except  zero,  all  complex  numbers 
except  zero,  all  complex  numbers  re1 6  with  r  =  1. 

(d)  Ordinary  subtraction  or  division  cannot  be  group  operations  because  they  don’t 
satisfy  the  associative  law;  for  example,  x—(y—z )  ^  (x  —  y)  —  z.  (Problem  18.) 

(e)  The  set  of  all  orthogonal  2  by  2  matrices  under  matrix  multiplication  is  a  group 
called  0(2).  If  the  matrices  are  required  to  be  rotation  matrices,  that  is,  have 
determinant  +1,  the  set  is  a  group  called  SO(2)  (the  S  stands  for  special). 
Similarly,  the  following  sets  of  matrices  are  groups  under  matrix  multiplication: 
The  set  of  all  orthogonal  3  by  3  matrices,  called  0(3);  its  subgroup  SO(3) 
with  determinant  =  1;  or  the  corresponding  sets  of  orthogonal  matrices  of  any 
dimension  n,  called  0(?i)  and  SO(n).  (Problem  19.) 

(f)  The  set  of  all  unitary  n  by  n  matrices,  n  =  1,  2, 3,  •  •  • ,  called  U(n),  is  a  group 
under  matrix  multiplication,  and  its  subgroup  SU(n)  of  unitary  matrices  with 
determinant  =  1  is  also  a  group.  Proof:  We  have  repeatedly  noted  that  matrix 
multiplication  is  associative  and  that  the  unit  matrix  is  the  unit  element  of  a 
group  of  matrices.  So  we  just  need  to  check  closure  and  inverses.  The  product  of 
two  unitary  matrices  is  unitary  (see  Section  9) .  If  two  matrices  have  determinant 
=  1,  their  product  has  determinant  =  1  [see  equation  (6.6)].  The  inverse  of  a 
unitary  matrix  is  unitary  (see  Problem  9.25). 


PROBLEMS,  SECTION  13 

1.  Write  the  four  rotation  matrices  for  rotations  of  vectors  in  the  xy  plane  through 
angles  90°,  180°,  270°,  360°  (or  0°)  [see  equation  (7.12)].  Verify  that  these  4  matrices 
under  matrix  multiplication  satisfy  the  four  group  requirements  and  are  a  matrix 
representation  of  the  cyclic  group  of  order  4.  Write  their  multiplication  table  and 
compare  with  Equations  (13.1)  and  (13.2). 

2.  Following  the  text  discussion  of  the  cyclic  group  of  order  4,  and  Problem  1,  discuss 

(a)  the  cyclic  group  of  order  3  (see  Chapter  2,  Problem  10.32); 

(b)  the  cyclic  group  of  order  6. 

3.  Show  that,  in  a  group  multiplication  table,  each  element  appears  exactly  once  in 
each  row  and  in  each  column.  Hint:  Suppose  that  an  element  appears  twice,  and 
show  that  this  leads  to  a  contradiction,  namely  that  two  elements  assumed  different 
are  the  same  element. 

Show  that  the  matrices 


1  = 


4. 
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under  matrix  multiplication,  form  a  group.  Write  the  group  multiplication  table  to 
see  that  this  group  (called  the  4’s  group)  is  not  isomorphic  to  the  cyclic  group  of 
order  4  in  Problem  1.  Show  that  the  4’s  group  is  Abelian  but  not  cyclic. 

5.  Consider  the  group  of  order  4  with  unit  element  I  and  other  elements  A,  B,  C, 

where  AB  =  BA  =  C,  and  A2  =  B2  =  I.  Write  the  group  multiplication  table  and 

verify  that  it  is  a  group.  There  are  two  groups  of  order  4  (discussed  in  Problems  1 
and  4).  To  which  is  this  one  isomorphic?  Hint:  Compare  the  multiplication  tables. 

6.  Consider  the  integers  0,  1,  2,  3  under  addition  (mod  4).  Write  the  group  “multipli¬ 

cation”  table  and  show  that  you  have  a  group  of  order  4.  Is  this  group  isomorphic 
to  the  cyclic  group  of  order  4  or  to  the  4’s  group? 

7.  Consider  the  set  of  numbers  1,  3,  5,  7  with  multiplication  (mod  8)  as  the  law  of 
combination.  Write  the  multiplication  table  to  show  that  this  is  a  group.  [To 
multiply  two  numbers  (mod  8),  you  multiply  them  and  then  take  the  remainder 
after  dividing  by  8.  For  example,  5  x  7  =  35  =  3(mod  8).]  Is  this  group  isomorphic 
to  the  cyclic  group  of  order  4  or  to  the  4’s  group? 

8.  Verify  (13.3)  and  (13.4).  Hints:  For  the  rotation  and  reflection  matrices,  see  Sec¬ 
tion  7.  In  checking  the  multiplication  table,  be  sure  you  are  multiplying  the  matrices 
in  the  right  order.  Remember  that  matrices  are  operators  on  the  vectors  in  the  plane 
(Section  7),  and  matrices  may  not  commute.  GFA  means  apply  A,  then  F,  then  G. 

9.  Show  that  any  cyclic  group  is  Abelian.  Hint:  Does  a  matrix  commute  with  itself? 

10.  As  we  did  for  the  equilateral  triangle,  find  the  symmetry  group  of  the  square.  Hints: 
Draw  the  square  with  its  center  at  the  origin  and  its  sides  parallel  to  the  x  and  y 
axes.  Find  a  set  of  eight  2  by  2  matrices  (4  rotation  and  4  reflection)  which  map  the 
square  onto  itself,  and  write  the  multiplication  table  to  show  that  you  have  a  group. 

11.  Do  Problem  10  for  a  rectangle.  Note  that  now  only  two  rotations  and  2  reflections 
leave  the  rectangle  unchanged.  So  you  have  a  group  of  order  4.  To  which  is  it 
isomorphic,  the  cyclic  group  or  the  4’s  group? 

12.  Verify  (13.5)  and  then  also  show  that  A,  B  are  the  elements  of  a  class,  and  that  I  is 
a  class  by  itself.  Show  that  it  will  always  be  true  in  any  group  that  I  is  a  class  by 
itself.  Hint:  What  is  C_1IC  for  any  element  C  of  a  group? 

13.  Using  the  discussion  of  simultaneous  diagonalization  at  the  end  of  Section  11,  show 
that  the  2-dimensional  matrices  in  Problems  1  and  4  are  reducible  representations 
of  their  groups,  and  the  matrices  in  (13.5)  give  an  irreducible  representation  of  the 
equilateral  triangle  symmetry  group.  Hint:  Look  at  the  multiplication  tables  to  see 
which  matrices  commute. 

14.  Use  the  multiplication  table  you  found  in  Problem  10  to  find  the  classes  in  the 
symmetry  group  of  a  square.  Show  that  the  2  by  2  matrices  you  found  are  an 
irreducible  representation  of  the  group  (see  Problem  13),  and  find  the  character  of 
each  class  for  that  representation.  Note  that  it  is  possible  for  the  character  to  be 
the  same  for  two  classes,  but  it  is  not  possible  for  the  character  of  two  elements  of 
the  same  class  to  be  different. 

15.  By  Problem  13,  you  know  that  the  matrices  in  Problem  4  are  a  reducible  repre¬ 
sentation  of  the  4’s  group,  that  is  they  can  all  be  diagonalized  by  the  same  unitary 
similarity  transformation  (in  this  case  orthogonal  since  the  matrices  are  symmetric). 
Demonstrate  this  directly  by  finding  the  matrix  C  and  diagonalizing  all  4  matrices. 

16.  Do  Problem  15  for  the  group  of  matrices  you  found  in  Problem  1.  Be  careful  here — 
you  are  working  in  a  complex  vector  space  and  your  C  matrix  will  be  unitary  but 
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not  orthogonal  (see  Sections  10  and  11).  Comment:  Not  surprisingly,  the  numbers 
1,  i,  —1,  —i  give  a  1-dimensional  representation — note  that  a  single  number  can  be 
thought  of  as  a  1-dimensional  matrix. 

17.  Verify  that  the  sets  listed  in  (13.7c)  are  groups. 

18.  Show  that  division  cannot  be  a  group  operation.  Hint:  See  (13. 7d). 

19.  Verify  that  the  sets  listed  in  (13. 7e)  are  groups.  Hint:  See  the  proofs  in  (13. 7f). 

20.  Is  the  set  of  all  orthogonal  3-by-3  matrices  with  determinant  =  —  1  a  group?  If  so, 
what  is  the  unit  element? 

21.  Is  the  group  SO(2)  Abelian?  What  about  SO (3)?  Hint:  See  the  discussion  following 
equation  (6.14). 

14.  GENERAL  VECTOR  SPACES 

In  this  section  we  are  going  to  introduce  a  generalization  of  our  picture  of  vector 
spaces  which  is  of  great  importance  in  applications.  This  will  be  merely  an  intro¬ 
duction  because  the  ideas  here  will  be  used  in  many  of  the  following  chapters  as 
you  will  discover.  The  basic  idea  will  be  to  set  up  an  outline  of  the  requirements  for 
3-dimensional  vector  spaces  (as  we  listed  the  requirements  for  a  group),  and  then 
show  that  these  familiar  3-dimensional  vector  space  requirements  are  satisfied  by 
sets  of  things  like  functions  or  matrices  which  we  would  not  ordinarily  think  of  as 
vectors. 

Definition  of  a  Vector  Space  A  vector  space  is  a  set  of  elements  {U,  V,  W,  •  •  •  } 
called  vectors,  together  with  two  operations:  addition  of  vectors,  and  multiplication 
of  a  vector  by  a  scalar  (which  for  our  purposes  will  be  a  real  or  a  complex  number), 
and  subject  to  the  following  requirements: 

1.  Closure:  The  sum  of  any  two  vectors  is  a  vector  in  the  space. 

2.  Vector  addition  is: 

(a)  commutative:  U  +  V  =  V  +  U, 

(b)  associative:  (U  +  V)  +  W  =  U  +  (V  +  W). 

3.  (a)  There  is  a  zero  vector  0  such  that  0  +  V  =  V  +  0  =  V  for  every  element 

V  in  the  space. 

(b)  Every  element  V  has  an  additive  inverse  (—V)  such  that  V  +  (—V)  =  0. 

4.  Multiplication  of  vectors  by  scalars  has  the  expected  properties: 

(a)  k( U  +  V)  =  fcU  +  fcV; 

(b)  (*!  +  fc2)V  =  foV  +  k2V; 

(c)  (feife)V  =  fcrtfeV); 

(d)  0  •  V  =  0,  and  1  •  V  =  V. 

You  should  go  over  these  and  satisfy  yourself  that  they  are  all  true  for  ordinary  two 
and  three  dimensional  vector  spaces.  Now  let’s  look  at  some  examples  of  things  we 
don’t  usually  think  of  as  vectors  which,  nevertheless,  satisfy  the  above  requirements. 
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Example  1.  Consider  the  set  of  polynomials  of  the  third  degree  or  less,  namely  functions 
of  the  form  f(x )  =  ao  +  a\x  +  a 2X2  +  a^x3.  Is  this  a  vector  space?  If  so,  find  a 
basis.  What  is  the  dimension  of  the  space? 

We  go  over  the  requirements  listed  above: 

1.  The  sum  of  two  polynomials  of  degree  <  3  is  a  polynomial  of  degree  <  3  and 
so  is  a  member  of  the  set. 

2.  Addition  of  algebraic  expressions  is  commutative  and  associative. 

3.  The  “zero  vector”  is  the  polynomial  with  all  coefficients  a*  equal  to  0,  and 
adding  it  to  any  other  polynomial  just  gives  that  other  polynomial.  The 
additive  inverse  of  a  function  /( x)  is  just  —/(#),  and  —f(x)  +  f(x)  =  0  as 
required  for  a  vector  space. 

4.  All  the  listed  familiar  rules  are  just  what  we  do  every  time  we  work  with 
algebraic  expressions. 

So  we  have  a  vector  space!  Now  let’s  try  to  find  a  basis  for  it.  Consider  the  set 
of  functions:  {1,  x,  x2,  x3}.  They  span  the  space  since  any  polynomial  of  degree  <  3 
is  a  linear  combination  of  them.  You  can  easily  show  (Problem  1)  by  computing 
the  Wronskian  [equation  (8.5)]  that  they  are  linearly  independent.  Therefore  they 
are  a  basis,  and  since  there  are  4  basis  vectors,  the  dimension  of  the  space  is  4. 

Example  2.  Consider  the  set  of  linear  combinations  of  the  functions 

{elx,  e~lx,  sinx,  cosx,  rrsina;}. 

It  is  straightforward  to  verify  that  all  our  requirements  above  are  met  (Problem  1). 
To  find  a  basis,  we  must  find  a  linearly  independent  set  of  functions  which  spans  the 
space.  We  note  that  the  given  functions  are  not  linearly  independent  since  elx  and 
e~lx  are  linear  combinations  of  sin  x  and  cos  a:  (Chapter  2,  Section  4).  However,  the 
set  {sin  a;,  cos  a;,  a:  sin  a;}  is  a  linearly  independent  set  and  it  spans  the  space.  So 
this  is  a  possible  basis  and  the  dimension  of  the  space  is  3.  Another  possible  basis 
would  be  {elx,  e~lx,  a;  sin  a;}.  You  will  meet  sets  of  functions  like  these  as  solutions 
of  differential  equations  ( see  Chapter  8,  Problems  5.13  to  5.18). 

Example  3.  Modify  Example  1  to  consider  the  set  of  polynomials  of  degree  <  3  with 
/( 1)  =  1.  Is  this  a  vector  space?  Suppose  we  add  two  of  the  polynomials;  then  the 
value  of  the  sum  at  x  =  1  is  2,  so  it  is  not  an  element  of  the  set.  Thus  requirement  1 
is  not  satisfied  so  this  is  not  a  vector  space.  Note  that  a  subset  of  the  vectors  of 
a  vector  space  is  not  necessarily  a  subspace.  On  the  other  hand,  if  we  consider 
polynomials  of  degree  <  3  with  /( 1)  =  0,  then  the  sum  of  two  of  them  is  zero  at 
x  =  1;  this  is  a  vector  space.  You  can  easily  verify  (Problem  1)  that  it  is  a  subspace 
of  dimension  3  and  a  possible  basis  is  {x  —  1,  x2  —  1,  a:3  —  1}. 

Example  4.  Consider  the  set  of  all  polynomials  of  any  degree  <  N.  The  sum  of  two 
polynomials  of  degree  <  N  is  another  such  polynomial,  and  you  can  easily  verify 
(Problem  1)  that  the  rest  of  the  requirements  are  met,  so  this  is  a  vector  space.  A 
simple  choice  of  basis  is  the  set  of  powers  of  x  from  x°  =  1  to  xN .  Thus  we  see  that 
the  dimension  of  this  space  is  N  +  1. 
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Example  5.  Consider  the  set  of  all  2  by  3  matrices  with  matrix  addition  as  the  law  of 
combination,  and  multiplication  by  scalars  defined  as  in  Section  6.  Recall  that  you 
add  matrices  by  adding  corresponding  elements.  Thus  a  sum  of  two  2  by  3  matrices 
is  another  2  by  3  matrix.  For  matrix  addition  and  multiplication  by  scalars,  it 
is  straightforward  to  show  that  the  other  requirements  listed  above  are  satisfied 
(Problem  1).  As  a  basis,  we  could  use  the  six  matrices: 


10  0 
0  0  0 

0  0  0 
10  0 


/  0  1  0\  (0  0  l\ 

\0  0  oj’  ^0  0  0J  ’ 

/ 0  0  0\  (0  0  0\ 

Vo  1  0)  ’  Vo  0  l)  ■ 


Satisfy  yourself  that  these  are  linearly  independent  and  that  they  span  the  space 
(that  is,  that  you  could  write  any  2  by  3  matrix  as  a  linear  combination  of  these 
six).  Since  there  are  6  basis  vectors,  the  dimension  of  this  space  is  6. 


Inner  Product,  Norm,  Orthogonality  The  definitions  of  these  terms  need  to 
be  generalized  when  our  “vectors”  are  functions,  that  is,  we  want  to  generalize 
equations  (10.1)  to  (10.3).  A  natural  generalization  of  a  sum  is  an  integral,  so 
we  might  reasonably  replace  AiBi  by  f  A(x)B(x )  dx,  and  A?  by  f[A(x)]2  dx. 
However,  in  applications  we  frequently  want  to  consider  complex  functions  of  the 
real  variable  x  (for  example,  elx  as  in  Example  2).  Thus,  given  functions  A(x)  and 
B  (x)  on  a  <  x  <  b,  we  define 


(14.1) 

(14.2) 

(14.3) 


[Inner  Product  of  A (x)  and  B(x)]  =  /  A*(x)B(x)  dx, 

J  a 

[Norm  of  A(x)\  =  ||A(:r)||  =  J  A*{x)A{x)  dx, 

fb 

A(x)  and  B(x)  are  orthogonal  on  (a,  b)  if  /  A*(x)B(x)  dx  =  0. 

J  a 


Let’s  now  generalize  our  definition  (14.1)  of  inner  product  still  further.  Let  A , 
B,  C,  ■  ■  ■  be  elements  of  a  vector  space,  and  let  a,  b,  c,  •  •  •  be  scalars.  We  will  use 
the  bracket  (A\B)  to  mean  the  inner  product  of  A  and  B.  This  vector  space  is  called 
an  inner  product  space  if  an  inner  product  is  defined  subject  to  the  conditions: 


(14.4a)  (A\B)*  =  (B\A); 

(14.4b)  (A|A)  >  0,  (A\A)  =  0  if  and  only  if  A  =  0; 

(14.4c)  {C\aA  +  bB)  =  a(C\A)  +  b(C\B). 

(See  Problem  11.)  It  follows  from  (14.4)  that  (Problem  12) 

(14.5a)  (aA  +  bB\C)  =  a*(A\C)  +  b*(B\C),  and 

(14.5b)  (aA\bB)  =  a*b(A\B). 

You  will  find  various  other  notations  for  the  inner  product,  such  as  ( A ,  B )  or  [A,  B\ 
or  ( A,  B ).  The  notation  (A\B)  is  used  in  quantum  mechanics.  Most  mathematics 
books  put  the  complex  conjugate  on  the  second  factor  in  (14.1)  and  make  the  cor¬ 
responding  changes  in  (14.4)  and  (14.5).  Most  physics  and  mathematical  methods 
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books  handle  the  complex  conjugate  as  we  have.  If  you  are  confused  by  this  notation 
and  equations  (14.4)  and  (14.5),  keep  going  back  to  (14.1)  where  (A\B)  =  f  A* B 
until  you  get  used  to  the  bracket  notation.  Also  study  carefully  our  use  of  the 
bracket  notation  in  the  next  section  and  do  Problems  11  to  14. 


Schwarz’s  Inequality  In  Section  10  we  proved  the  Schwarz  inequality  for  n- 
dimensional  Euclidean  space.  For  an  inner  product  space  satisfying  (14.4),  it  be¬ 
comes  [compare  (10.9)] 

(14.6)  \(A\B)\2<(A\A)(B\B). 


To  prove  this,  we  first  note  that  it  is  true  if  B  =  0.  For  B  ^  0,  let  C  =  A  —  [ iB , 
where  p  =  (B\A)/(B\B),  and  find  {C\C)  which  is  >  0  by  (14.4b).  Using  (14.4)  and 
(14.5),  we  write 

(14.7)  {A  -  fiB\A  -  [iB)  =  (A\A)  -  p*{B\A)  -  p{A\B)  +  n*[i(B\B)  >  0. 


Now  substitute  the  values  of  [i  and  [i*  to  get  (see  Problem  13) 

(A\B)  (B\A) 


(14.8)  (A\A)  -  ^(B\A)  '-(A\B ) 


(B\B) 


(B\B) 


(B\B)  (B\B) 


(B\B) 


(B\B) 


(B\B) 


which  gives  (14.6). 

For  a  function  space  as  in  (14.1)  to  (14.3),  Schwarz’s  inequality  becomes  (see 
Problem  14): 


(14.9) 


A*  ( x)B(x )  dx 


< 


A*{x)A{ x)  dx 


B*  ( x)B(x )  dx 


Orthonormal  Basis;  Gram-Schmidt  Method  Two  functions  are  called  or¬ 
thogonal  if  they  satisfy  (14.3);  a  function  is  normalized  if  its  norm  in  (14.2)  is  1. 
By  a  combination  of  the  two  words,  we  call  a  set  of  functions  orthonormal  if  they 
are  all  mutually  orthogonal  and  they  all  have  norm  1.  It  is  often  convenient  to 
write  the  functions  of  a  vector  space  in  terms  of  an  orthonormal  basis  (compare 
writing  ordinary  vectors  in  three  dimensions  in  terms  of  i,  j,  k).  Let’s  see  how  the 
Gram-Schmidt  method  applies  to  a  vector  space  of  functions  with  inner  product, 
norm,  and  orthogonality  defined  by  (14.1)  to  (14.3).  (Compare  Section  10,  Example 
4  and  the  paragraph  before  it.) 

Example  6.  In  Example  1,  we  found  that  the  set  of  all  polynomials  of  degree  <  3  is  a 
vector  space  of  dimension  4  with  basis  1,  x,  x2,  x3.  Let’s  consider  these  polynomials 
on  the  interval  —  1  <  x  <  1  and  construct  an  orthonormal  basis.  To  keep  track  of 
what  we’re  doing,  let  /o,  /i,  /2,  /3  =  1,  x,  x2,  x3-  let  po,  pi,  P2,  P3  be  a  corresponding 
orthogonal  basis  (which  we  find  by  the  Gram-Schmidt  method);  and  let  eo,  e\,  e-2, 
e3,  be  the  orthonormal  basis  (which  we  get  by  normalizing  the  functions  pi).  Recall 
the  Gram-Schmidt  routine  (see  Section  10,  Example  4):  Normalize  the  first  function 
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to  get  eg.  Then  for  the  rest  of  the  functions,  subtract  from  /,;  each  preceding  e^ 
multiplied  by  the  inner  product  of  and  _/),  that  is,  find 

(14.10)  pi  =  ei(ei  I  fi)  =  fi  -  X]  e-J  /  e^i  dx- 

j<i  j<i 

Finally,  normalize  pi  to  get  e*. 

We  can  save  effort  by  noting  in  advance  that  many  of  the  inner  products  we 
need  are  going  to  be  zero.  You  can  easily  show  (Problem  15)  that  the  integral  of 
an  odd  power  of  x  from  x  =  —1  to  1  is  zero,  and  consequently  any  even  power  of  x 
is  orthogonal  to  any  odd  power.  Observe  that  the  fi  are  alternately  even  and  odd 
powers  of  x.  Then  you  can  show  that  the  corresponding  pi  and  e,  will  also  involve 
just  even  or  just  odd  powers  of  x.  The  Gram-Schmidt  method  gives  the  following 
results  (Problem  16). 

n  f1  1 

/o  =  l=p0,  |  boll2  =  J  l2dx  =  2,  e0  =  —j=- 

fi  =  x:  pi  =  x  because  x  is  orthogonal  to  eg. 


fi  =  x2 .  Since  x 2  is  orthogonal  to  e\  but  not  to  eo, 


f3  =  x3.  Since  x 3  is  orthogonal  to  eo  and  e-2, 


Ibsll2  =  /  i  (t3  -  jjz)  dx=  :j|g,  e3=  (5x3 -3x)^. 

This  process  could  be  continued  for  a  vector  space  with  basis  1,  x,  x2,  •••,  xN 
(but  it  is  not  very  efficient) .  The  orthonormal  functions  e*  are  well-known  functions 
called  (normalized)  Legendre  polynomials.  In  Chapters  12  and  13,  we  will  discover 
these  functions  as  solutions  of  differential  equations  and  see  their  applications  in 
physics  problems. 

Infinite  Dimensional  Spaces  If  a  vector  space  does  not  have  a  finite  basis,  it 
is  called  an  infinite  dimensional  vector  space.  It  is  beyond  our  scope  to  go  into  a 
detailed  mathematical  study  of  such  spaces.  However,  you  should  know  that,  by 
analogy  with  finite  dimensional  vector  spaces,  we  still  use  the  term  basis  functions 
for  sets  of  functions  (like  xn  or  sinjrcc)  in  terms  of  which  we  can  expand  suitably 
restricted  functions  in  infinite  series.  So  far  we  have  discussed  only  power  series 
(Chapter  1).  In  later  chapters  you  will  discover  many  other  sets  of  functions  which 
provide  useful  bases  in  applications:  sines  and  cosines  in  Chapter  7,  various  special 
functions  in  Chapters  12  and  13.  When  we  introduce  them,  we  will  discuss  questions 
of  convergence  of  the  infinite  series,  and  of  completeness  of  sets  of  basis  functions. 
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PROBLEMS,  SECTION  14 

1.  Verify  the  statements  indicated  in  Examples  1  to  5  above. 

For  each  of  the  following  sets,  either  verify  (as  in  Example  1)  that  it  is  a  vector  space, 

or  show  which  requirements  are  not  satisfied.  If  it  is  a  vector  space,  find  a  basis  and  the 

dimension  of  the  space. 

2.  Linear  combinations  of  the  set  of  functions  { ex ,  sinhx,  xexj. 

3.  Linear  combinations  of  the  set  of  functions  {x,  cosx,  xcosx,  ex  cosx,  (2  —  3ex)  cosx, 
x(l  +  5cosx)}. 

4.  Polynomials  of  degree  <  3  with  02  =  0. 

5.  Polynomials  of  degree  <  5  with  ai  =03. 

6.  Polynomials  of  degree  <  6  with  03  =  3. 

7.  Polynomials  of  degree  <  7  with  all  the  even  coefficients  equal  to  each  other  and  all 

the  odd  coefficients  equal  to  each  other. 

8.  Polynomials  of  degree  <  7  but  with  all  odd  powers  missing. 

9.  Polynomials  of  degree  <  10  but  with  all  even  powers  having  positive  coefficients. 

10.  Polynomials  of  degree  <  13,  but  with  the  coefficient  of  each  odd  power  equal  to  half 

the  preceding  coefficient  of  an  even  power. 

11.  Verify  that  the  definitions  in  (14.1)  and  (14.2)  satisfy  the  requirements  for  an  inner 
product  listed  in  (14.4)  and  (14.5).  Hint:  Write  out  all  the  equations  (14.4)  and 
(14.5)  in  the  integral  notation  of  (14.1)  and  (14.2). 

12.  Verify  that  the  relations  in  (14.5)  follow  from  (14.4).  Hints:  For  (14.5a),  take  the 
complex  conjugate  of  (14.4c).  To  take  the  complex  conjugate  of  a  bracket,  use 
(14.4a). 

13.  Verify  (14.7)  and  (14.8)  Hints:  Remember  that  a  norm  squared,  like  (B\B),  is  a 
real  and  non-negative. scalar,  so  its  complex  conjugate  is  just  itself.  But  ( B\A )  is  a 
complex  scalar  and  (B\A)  =  {A\B)*  by  (14.4).  Show  that  p*  =  (A\B) / (B\B) . 

14.  Verify  that  (14.9)  is  (14.6)  with  the  definition  of  scalar  product  as  in  (14.1). 

15.  For  Example  6,  verify  the  claimed  orthogonality  on  (  —  1,1)  of  an  even  power  of  x 
and  an  odd  power  of  x.  Hint:  For  example,  consider  J_*  l  x2x3  dx. 

16.  For  Example  6,  verify  the  details  of  the  terms  omitted  in  the  functions  pi  because  of 
orthogonality.  Hint:  See  Problem  15.  Also  verify  the  calculations  of  inner  products 
and  norms  and  the  orthonormal  set  d. 


15.  MISCELLANEOUS  PROBLEMS 


1.  Show  that  if  each  element  of  one  row  (or  column)  of  a  determinant  is  the  sum  of  two 
terms,  the  determinant  can  be  written  as  a  sum  of  two  determinants;  for  example, 


an 

a  12  +  b  12 

ai3 

Oil 

012 

013 

On 

6l2 

013 

a2i 

022  +  &22 

023 

= 

021 

022 

023 

+ 

021 

622 

023 

031 

032  +  632 

033 

031 

032 

033 

O31 

f>32 

033 

Use  this  result  to  verify  Fact  4b  of  Section  3. 


Section  15 


Miscellaneous  Problems  185 


2.  What  is  wrong  with  the  following  argument?  “If  we  add  the  first  row  of  a  determi¬ 
nant  to  the  second  row  and  the  second  row  to  the  first  row,  then  the  first  two  rows 
of  the  determinant  are  identical,  and  the  value  of  the  determinant  is  zero.  Therefore 
all  determinants  have  the  value  zero.” 

3.  (a)  Find  the  equations  of  the  line  through  the  points  (4,  —1,  2)  and  (3, 1, 4). 

(b)  Find  the  equation  of  the  plane  through  the  points  (0,  0,  0),  (1,  2,  3)  and  (2, 1, 1). 

(c)  Find  the  distance  from  the  point  (1, 1,  1)  to  the  plane  3a;  —  2 y  +  6z  =  12. 

(d)  Find  the  distance  from  the  point  (1,  0,  2)  to  the  line  r  =  2i+j  — k+(i  —  2j  +  2k)f. 

(e)  Find  the  angle  between  the  plane  in  (c)  and  the  line  in  (d). 

4.  Given  the  line  r  =  3i  —  j  +  (2i  +  j  —  2k)f: 

(a)  Find  the  equation  of  the  plane  containing  the  line  and  the  point  (2, 1,  0). 

(b)  Find  the  angle  between  the  line  and  the  (y,  z)  plane. 

(c)  Find  the  perpendicular  distance  between  the  line  and  the  x  axis. 

(d)  Find  the  equation  of  the  plane  through  the  point  (2, 1,  0)  and  perpendicular  to 
the  line. 

(e)  Find  the  equations  of  the  line  of  intersection  of  the  plane  in  (d)  and  the  plane 
V  =  2  z- 

5.  (a)  Write  the  equations  of  a  straight  line  through  the  points  (2,  7,  —1)  and  (5,  7,  3). 

(b)  Find  the  equation  of  the  plane  determined  by  the  two  lines  r  =  (i  —  2j  +  k )t 
and  r  =  (6i  —  3j  +  2k)t. 

(c)  Find  the  angle  which  the  line  in  (a)  makes  with  the  plane  in  (b). 

(d)  Find  the  distance  from  (1, 1, 1)  to  the  plane  in  (b). 

(e)  Find  the  distance  from  (1,6,  —3)  to  the  line  in  (a). 

6.  Derive  the  formula 

n  _  \ax0  +  by0  +  cz0  -  d\ 

Va2  +  b2  +  c2 

for  the  distance  from  (*o,  Vo,  zo)  to  ax  +  by  +  cz  =  d. 


7. 


Given  the  matrices  A,  B,  C  below,  find  or  mark  as  meaningless  the  matrices:  AT, 
A-1,  AB,  A,  AtBt,  BtAt,  BAt,  ABC,  ABTC,  BTAC,  A+,  BTC,  B-1C,  C^A, 
CBT. 


A  = 


8. 


9. 


Given 

/I 

0 

2  i\ 

A  =  * 

-3 

0  )  ,  find  At,  A,  A^,  A~\ 

V 

0 

ij 

The  following  matrix  product 

is  used  in  discussing  a  thick  lens 

(1  (n-l)/R2\(  l  0\  A  —  (n  — 

V0  1  )  \d/n  l)  V0  1  ) 


where  d  is  the  thickness  of  the  lens,  n  is  its  index  of  refraction,  and  Ri  and  R2  are 
the  radii  of  curvature  of  the  lens  surfaces.  It  can  be  shown  that  element  A12  of  A  is 
—  1  //  where  /  is  the  focal  length  of  the  lens.  Evaluate  A  and  det  A  (which  should 
equal  1)  and  find  1//.  [See  Am.  J.  Phys.  48,  397-399  (1980).] 
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10.  The  following  matrix  product  is  used  in  discussing  two  thin  lenses  in  air: 


M  = 


where  /i  and  fo  are  the  focal  lengths  of  the  lenses  and  d  is  the  distance  between 
them.  As  in  Problem  9,  element  M12  is  —1//  where  /  is  the  focal  length  of  the 
combination.  Find  M,  detM,  and  1//. 

11.  There  is  a  one-to-one  correspondence  between  two-dimensional  vectors  and  complex 
numbers.  Show  that  the  real  and  imaginary  parts  of  the  product  Z1Z2  (the  star 
denotes  complex  conjugate)  are  respectively  the  scalar  product  and  ±  the  magnitude 
of  the  vector  product  of  the  vectors  corresponding  to  z\  and  Z2- 

12.  The  vectors  A  =  ai  +  b]  and  B  =  ci  +  dj  form  two  sides  of  a  parallelogram.  Show 
that  the  area  of  the  parallelogram  is  given  by  the  absolute  value  of  the  following 
determinant.  (Also  see  Chapter  6,  Section  3.) 

a  b 
c  d 


13.  The  plane  2x  +  3y  +  6z  =  6  intersects  the  coordinate  axes  at  points  P,  Q,  R,  forming 
a  triangle.  Find  the  vectors  PQ  and  PR.  Write  a  vector  formula  for  the  area  of  the 
triangle  PQR,  and  find  the  area. 


In  Problems  14  to  17,  multiply  matrices  to  find  the  resultant  transformation.  Caution:  Be 
sure  you  are  multiplying  the  matrices  in  the  right  order. 


14. 


15. 


16. 


17. 


f  x'  =  {x  +  yV 3)/2  f  x"  =  (-x1  +  y'V 3)/2 

1  y'  =  (~x\/3  +  y)/2  \  y"  =  ~(x'V3  +  y')/2 

(  x'  =  2x  +  5 y  (  x"  =  x'  -  2y' 

\y'  =  x  +  3y  \  y"  =  3x'  -  by' 

(  x'  =  {x  +  yV2  +  z)/2  (  x"  =  {x'V2  +  z'V2)/2 

\  y'  =  (xV2-zV2)/2  y"  =  {-x'  -y'V2  +  z')/2 

[  z'  =  {-x  +  yV2-  z)/2  [  z"  =  (x1  -  y' V2  -  z')/2 

(  x'  =  {2x  +  y  +  2z)/3  (  x”  =  (2.x'  +  y'  +  2z')/3 

y'  =  (x  +  2y-2z)/3  i  y"  =  {-x'  -2y'  +2z')/3 
[  z' =  (2x-2y  -  z)/3  {  z"  =  (~2x'  +  2y’  +  z’)/3 


Find  the  eigenvalues  and  eigenvectors  of  the  matrices  in  the  following  problems. 
18.  ()  J)  19.  (5  II 


22. 


5  -4 

-4  5 


21. 


3 

0 

^2\ 

/3 

0 

f  2 

-3 

0 

4 

0 

23. 

° 

3 

1  ) 

24. 

-3 

2 

2 

0 

3/ 

U 

1 

2/ 

V  4 

0 

25.  Find  the  C  matrix  which  diagonalizes  the  matrix  M  of  Problem  18.  Observe  that  M 
is  not  symmetric,  and  C  is  not  orthogonal  (see  Section  11).  However,  C  does  have 
an  inverse;  find  C^1  and  show  that  C-1MC  =  D. 

26.  Repeat  Problem  25  for  Problem  19. 

In  Problems  27  to  30,  rotate  the  given  quadric  surface  to  principal  axes.  What  is  the  name 
of  the  surface?  What  is  the  shortest  distance  from  the  origin  to  the  surface? 


Section  15 


Miscellaneous  Problems  187 


27.  x2  +  y2  —  5  z2  +  Axy  =  15 

28.  7x2  +  Ay2  +  z2  —  8  xz  =  36 

29.  3x2  +  5 y2  -  3z2  +  6yz  =  54 

30.  7x2  +  7 y2  +  7z2  +  10*2  -  2Ayz  =  20 

31.  Find  the  characteristic  vibration  frequencies  of  a  system  of  masses  and  springs  as  in 
Figure  12.1  if  the  spring  constants  are  k,  3k,  k. 

32.  Do  Problem  31  if  the  spring  constants  are  6k,  2k,  3k. 

33.  Prove  the  Caley-Hamilton  theorem  (Problem  11.60)  for  any  matrix  M  for  which 
D  =  C_1MC  is  diagonal.  See  hints  in  Problem  11.60. 

34.  In  problems  6.30  and  6.31,  you  found  the  matrices  eA  and  ec  (put  k  =  1)  where  A 
and  C  are  the  Pauli  matrices  from  Problem  6.6.  Now  find  the  matrix  (A  +  C)  and  its 
powers  and  so  find  the  matrix  eA+c  to  show  that  eA+c  ^  eAec.  See  Problem  6.29. 

35.  Show  that  a  square  matrix  A  has  an  inverse  if  and  only  if  A  =  0  is  not  an  eigenvalue 
of  A.  Hint:  Write  the  condition  for  A  to  have  an  inverse  (Section  6),  and  the 
condition  for  A  to  have  the  eigenvalue  A  =  0  (Section  11). 

36.  Write  the  three  3  by  3  matrices  for  180°  rotations  about  the  x,  y,  z  axes.  Show  that 
these  three  matrices  commute  (contrary  to  what  we  usually  expect — see  Problems 
7.30  and  7.31).  By  writing  the  multiplication  table,  show  that  these  three  matrices 
with  the  unit  matrix  form  a  group.  To  which  order  4  group  is  it  isomorphic?  Hint: 
See  Problem  13.5. 

37.  Show  that  for  a  given  irreducible  representation  of  a  group,  the  character  of  the  class 
consisting  of  the  identity  is  always  the  dimension  of  the  irreducible  representation. 
Hint:  What  is  the  trace  of  a  unit  n-by-n  matrix? 

38.  For  a  cyclic  group,  show  that  every  element  is  a  class  by  itself.  Show  this  also  for 
an  Abelian  group. 
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Partial  Differentiation 


1.  INTRODUCTION  AND  NOTATION 

If  y  =  /( x),  then  dy/dx  can  be  thought  of  either  as  the  slope  of  the  curve  y  =  f(x)  or 
as  the  rate  of  change  of  y  with  respect  to  x.  Rates  occur  frequently  in  physics;  time 
rates  such  as  velocity,  acceleration,  and  rate  of  cooling  of  a  hot  body  are  obvious 
examples.  There  are  also  other  rates:  rate  of  change  of  volume  of  a  gas  with  applied 
pressure,  rate  of  decrease  of  the  fuel  in  your  automobile  tank  with  distance  traveled, 
and  so  on.  Equations  involving  rates  (differential  equations)  often  need  to  be  solved 
in  applied  problems.  Derivatives  are  also  used  in  finding  maximum  and  minimum 
points  of  a  curve  and  in  finding  the  power  series  of  a  function.  All  these  applications, 
and  more,  occur  also  when  we  consider  a  function  of  several  variables. 

Let  z  be  a  function  of  two  variables  x  and  y ;  we  write  z  =  f(x,y).  Just  as  we 
think  of  y  =  f(x)  as  a  curve  in  two  dimensions,  so  it  is  useful  to  interpret  z  =  f(x,  y) 
geometrically.  If  x,  y,  z  are  rectangular  coordinates,  then  for  each  x,  y  the  equation 
gives  us  a  value  of  z,  and  so  determines  a  point  (x,  y,  z)  in  three  dimensions.  All 
the  points  satisfying  the  equation  ordinarily  form  a  surface  in  three-dimensional 
space  (see  Figure  1.1).  (It  might  happen  that  an  equation  would  not  be  satisfied 
by  any  real  points,  for  example  x2  +  y2  +  z2  =  —  1,  but  we  shall  be  interested 
in  equations  whose  graphs  are  real  surfaces.)  Now  suppose  x  is  constant;  think 
of  a  plane  x  =  const,  intersecting  the  sur¬ 
face  (see  Figure  1.1).  The  points  satisfying 
z  =  f(x,y)  and  x  =  const,  then  lie  on  a 
curve  (the  curve  of  intersection  of  the  sur¬ 
face  and  the  x  =  const,  plane;  this  is  AB 
in  Figure  1.1).  We  might  want  the  slope, 
maximum  and  minimum  points,  etc.,  of  this 
curve.  Since  z  is  a  function  of  y  (on  this 
curve),  we  might  write  dz/dy  for  the  slope. 

However,  to  show  that  z  is  actually  a  func¬ 
tion  of  two  variables  x  and  y  with  one  of 
them  (x)  temporarily  a  constant,  we  write 
dz/dy,  we  call  dz/dy  the  partial  derivative 
of  z  with  respect  to  y.  Similarly,  we  can 


z 


Figure  1.1 
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hold  y  constant  and  find  dz/dx,  the  partial  derivative  of  z  with  respect  to  x.  If 
these  partial  derivatives  are  differentiated  further,  we  write 

d  dz  d2z  d  dz  d2z  d  d2z  d3z 

dx  dx  dx 2  ’  dx  dy  dxdy  ’  dx  dxdy  dx2dy  ’ 

Other  notations  are  often  useful.  If  z  =  f(x,y),  we  may  use  zx  or  fx  or  /j  for 
df/dx,  and  corresponding  notations  for  the  higher  derivatives. 


Example.  Given  z  =  f(x,  y)  =  x3y  —  exy ,  then 

=  tt-  =  fx  =  zx  =  fi  =  3x2y  -  yexv, 
OX  ox 

—  =  —  =  f  =  z  =  fo  —  x3  -  xexv 
dy  ~  dy  ~  h  ~  v  ~  h  ~  ’ 

d2f  _  d2z  _ 


dx  dy  dx  dy 


—  fyx  —  Zyx  —  ,/*21  —  6  ^  XyC 


xy 


=  zxx  =  fu  =  6a :y  -  y2exy, 

dxz  dxz 

d3f  _  d3z  _  _  _  _  3  xy 

'dy3  =  dy3  =  *yvv  =  Zyvv  =  222  ~  ~x  e  ’ 


d3f 


d3z 


—  fyxx  —  Zyxx  —  /211  —  6x  2 yc  y  xy  e 


dx2  dy  dx2  dy 

We  can  also  consider  functions  of  more  variables  than  two,  although  in  this  case 
it  is  not  so  easy  to  give  a  geometrical  interpretation.  For  example,  the  temperature 
T  of  the  air  in  a  room  might  depend  on  the  point  (a ’,y,  z)  at  which  we  measured  it 
and  on  the  time  t:  we  would  write  T  =  T(x,  y,  z,  t).  We  could  then  find,  say,  dT /dy, 
meaning  the  rate  at  which  T  is  changing  with  y  for  fixed  x  and  z  at  one  instant  of 
time  t. 

A  notation  which  is  frequently  used  in  applications  (particularly  thermodynam¬ 
ics)  is  ( dz/dx)y ,  meaning  dz/dx  when  z  is  expressed  as  a  function  of  x  and  y.  (Note 
two  different  uses  of  the  subscript  y;  in  the  example  above,  fy  meant  df  /dy.  A 
subscript  on  a  partial  derivative ,  however,  does  not  mean  another  derivative,  but 
just  indicates  the  variable  being  held  constant  in  the  indicated  partial  differentia¬ 
tion.)  For  example,  let  z  =  x2  —  y2.  Then  using  polar  coordinates  r  and  9,  (recall 
that  x  =  rcosO,  y  =  rsin0,  x2  +  y2  =  r2),  we  can  write  z  in  several  other  ways. 
For  each  new  expression  let  us  find  dz/dr. 


2  2 

z  =  x  -  y  , 

z  =  r  cos  6  —  r  sm  t/, 
z  =  2x2  -  x2  -y2  =  2x2  -  r2, 
z  =  x2  +  y2  -  2y2  =  r2  -  2 y2 , 


2r(cos2  8 


—2r, 


=  +2r. 


sin2  9) , 


These  three  expressions  for  dz  / dr  have  different  values  and  are  derivatives  of  three 
different  functions,  so  we  distinguish  them  as  indicated  by  writing  the  second  inde¬ 
pendent  variable  as  a  subscript.  Note  that  we  do  not  write  z{x,y)  or  z{r,9)-,  z  is 
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one  variable,  but  it  is  equal  to  several  different  functions.  Pure  mathematics  books 
usually  avoid  the  subscript  notation  by  writing,  say,  z  =  f(r,  9)  =  g(r,  x)  =  h{r,  y), 
etc.;  then  ( dz/dr)g  can  be  written  as  just  df  /dr,  and  similarly 

dz\  dg  ( dz\  dh 

fr)x=fr  “  \Jh)y=lh' 

However,  this  multiplicity  of  notation  (z  =  /  =  g  =  h,  etc.)  would  be  inconve¬ 
nient  and  confusing  in  applications  where  the  letters  have  physical  meanings.  For 
example,  in  thermodynamics,  we  might  need 

fdT\  fdT\  f  dT\  fdT\ 

\dp)v’  UJ/  \dp)u’  \ds)p'  6tC‘’ 

as  well  as  many  other  similar  partial  derivatives.  Now  T  means  temperature  (and 
the  other  letters  similarly  have  physical  meanings  which  must  be  recognized) .  If  we 
wrote  T  =  A(p,v)  =  B(v,s)  =  C(p,u)  =  D(s,p)  and  similar  formulas  for  the  eight 
commonly  used  quantities  in  thermodynamics,  each  as  functions  of  pairs  from  the 
other  seven,  we  would  not  only  have  an  unwieldy  system,  but  the  physical  meaning 
of  equations  would  be  lost  until  we  translated  them  back  to  standard  letters.  Thus 
the  subscript  notation  is  essential. 


The  symbol  ( dz/dr)x  is  usually  read  “the  partial  of  z  with  respect  to  r,  with  x 
held  constant.”  However,  the  important  point  to  understand  is  that  the  notation 
means  that  z  has  been  written  as  a  function  of  the  variables  r  and  x  only,  and 
then  differentiated  with  respect  to  r. 


A  little  experimenting  with  various  functions  f(x,  y)  will  probably  convince  you 
that  (d/dx)(df  jdy)  =  (d/dy)(df /dx);  this  is  usually  (but  not  always)  true  in 
applied  problems.  It  can  be  proved  (see  advanced  calculus  texts)  that  if  the  first  and 
second  order  partial  derivatives  of  /  are  continuous,  then  d2f/dxdy  and  d2  f  /dydx 
are  equal.  In  many  applied  problems,  these  conditions  are  met;  for  example,  in 
thermodynamics  they  are  normally  assumed  and  are  called  the  reciprocity  relations. 

PROBLEMS,  SECTION  1 

1.  If  u  =  x2 /(x2  +  y2),  find  du/dx,  du/dy. 

2.  If  s  =  tu,  find  ds/dt,  ds/du. 

3.  If  z  =  lny/u2  +  v2  +  w2,  find  dz/du,  dz/dv,  dz/dw. 

4.  For  w  =  x3  —  y3  —  2 xy  +  6,  find  d2w/dx 2  and  d2w/dy 2  at  the  points  where 
dw/dx  =  dw/dy  =  0. 

5.  For  w  =  8ie4  +  y4  —  2xy2 ,  find  d2w/dx 2  and  d2w/dy2  at  the  points  where 
dw/dx  =  dw/dy  =  0. 

6.  For  u  =  ex  cos  y, 

(a)  verify  that  d2u/dxdy  =  d2u/dydx\ 

(b)  verify  that  d2u/dx2  +  d2u/dy 2  =  0. 


Section  2 


Power  Series  in  Two  Variables  191 


If  z  =  x2  +  2 y2,  x  =  r  cos  9,  y  =  r  sin  find  the  following  partial  derivatives. 


7. 


13. 


19. 


I)  8' 

/  v 


dz 

de 

d2z 
dr  dy 


14. 


20. 


dz\ 

dx), 

dz 

de 


d2z 
dx  de 


9. 


15. 


21. 


dx) 


dz 

de 

d2z 
dy  d6 


]  10. 

e 

16. 


22. 


d2z 
dr  dx 


11. 


17. 


23. 


d2z 
dr  de 


12. 


18. 


24. 


7'  to  24'.  Repeat  Problems  7  to  24  if  z  =  r2tan2  9. 


d2z 
dx  dy 


2.  POWER  SERIES  IN  TWO  VARIABLES 

Just  as  in  the  one-variable  case  discussed  in  Chapter  1,  the  power  series  (about  a 
given  point)  for  a  function  of  two  variables  is  unique,  and  we  may  use  any  convenient 
method  of  finding  it  (see  Chapter  1  for  methods). 


Example  1 .  Expand  /(x,  y )  =  sin  x  cos  y  in  a  two-variable  Maclaurin  series.  We  write  and 
multiply  the  series  for  sin  x  and  cos  y.  This  gives 


sin  x  cos  y 


Example  2.  Find  the  two- variable  Maclaurin  series  for  ln(l  +  x  —  y).  We  replace  x  in 
equation  (13.4)  of  Chapter  1  by  x  —  y  to  get 

ln(l  +  x-y)  =  (x-y)-(x-  y)2  /  2  +  (x  -  y)3/ 3  H - 

=  x  —  y  —  x2/2  +  xy  —  y2 /2  +  x3/3  —  x2y  +  xy2  —  y3 / 3  H —  •  . 

The  methods  of  Chapter  1,  used  as  we  have  just  shown,  provide  an  easy  way 
of  obtaining  the  power  series  for  many  simple  functions  /(x,j/).  However,  it  is 
also  convenient,  for  theoretical  purposes,  to  have  formulas  for  the  coefficients  in 
the  Taylor  series  or  the  Maclaurin  series  for  /(x,y);  see,  for  example,  Problem  8.2. 
Following  a  process  similar  to  that  used  in  Chapter  1,  Section  12,  we  can  find  the 
coefficients  of  the  power  series  for  a  function  of  two  variables  /(x,  y)  (assuming  that 
it  can  be  expanded  in  a  power  series).  To  find  the  series  expansion  of  /(x,  y)  about 
the  point  (a,  b)  we  write  /(x,  y)  as  a  series  of  powers  of  (x  —  a)  and  ( y  —  b)  and  then 
differentiate  this  equation  repeatedly  as  follows. 

/(x,  y)  =  a00  +  ai0(x  -  a)  +  a0i(y  -  b)  +  a2 o(x  -  a)2  +  an(x  -  a)(y  -  b) 

+  ao2 (y  -  b)2  +  a30(x  -  a)3  +  a2i(x  -  a)2(y  -  b) 

+  a12(x  -  a)(y  -  b)2  +  a03(y  —  b)3  H - . 

(2-1)  fx  =  aio  +  2 a20(x  -  a)  +  an(y  -  b)  H - , 

fv  =  a0i  +  an(x  -  a)  +  2 a02(y  -  b)  H - , 

fxx  =  2a2o  +  terms  containing  (x  —  a)  and/or  (y  —  6) , 
fxy  =  clu  +  terms  containing  (x  —  a)  and/or  (y  —  b). 

[We  have  written  only  a  few  derivatives  to  show  the  idea.  You  should  be  able  to 
calculate  others  in  the  same  way  (Problem  7).]  Now  putting  x  =  a,  y  =  b  in  (2.1), 
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we  get 

^  f(a,b)  =  aoo,  fx(a,b)  =  a10,  fy{a,b)  =  a0 1, 

fxx(a,  b)  =  2a20,  fxy(a,b)  =  a±1,  etc. 

[Remember  that  fx(a ,  6)  means  that  we  are  to  find  the  partial  derivative  of  /  with 
respect  to  x  and  then  put  x  =  a,  y  =  b,  and  similarly  for  the  other  derivatives.] 
Substituting  the  values  for  the  coefficients  into  (2.1),  we  find 

(2.3) 

f{x,  y)  =  f(a,  b)  +  fx(a,  b){x  -  a)  +  fy(a,  b){y  -  b) 

+  7fi[fxx(a,  b){x  -  a)2  +  2 fxy(a,  b){x  -  a){y  -  b)  +  fyy(a,  b)(y  -  b )2] 

This  can  be  written  in  a  simpler  form  if  we  put  x  —  a  =  h  and  y  —  b  =  k.  Then  the 
second-order  terms  (for  example)  become 

(2.4)  7fi[fxx(a,  b)h2  +  2 fxy(a,  b)hk  +  fyy(a ,  b)k2}. 

We  can  write  this  in  the  form 

<2-5'  h{hl+kb)2  !(a'b) 

if  we  understand  that  the  parenthesis  is  to  be  squared  and  then  a  term  of  the  form 
h(d/dx)k(d/dy)f(a,b)  is  to  mean  hkfxy(a,b).  It  can  be  shown  (Problem  7)  that 
the  third-order  terms  can  be  written  in  this  notation  as 

i/O  r\  \  3  i 

(2.6)  y\h]h'+k~dy)  =  +  ^b?kfxxy{a,b)  d - ] 

and  so  on  for  terms  of  any  order.  Thus  we  can  write  the  series  (2.3)  in  the  form 


The  numbers  appearing  in  the  nth  order  terms  are  the  familiar  binomial  coefficients 
[in  the  expansion  of  (p  +  q)n]  divided  by  (n!).  (See  Chapter  1,  Section  13C.) 

PROBLEMS,  SECTION  2 

Find  the  two- variable  Maclaurin  series  for  the  following 

1.  cos®  sinhy  2.  cos(®  +  y) 

4.  exy  5.  s/l  +  xy 

7.  Verify  the  coefficients  of  the  third-order  terms  [(2.6)  or  n  =  3  in  (2.7)]  of  the  power 
series  for  f{x,y)  by  finding  the  third-order  partial  derivatives  in  (2.1)  and  substi¬ 
tuting  x  =  a,  y  =  b. 

8.  Find  the  two- variable  Maclaurin  series  for  ex  cos  y  and  ex  siny  by  finding  the  series 
for  ez  =  ex+ly  and  taking  real  and  imaginary  parts.  (See  Chapter  2.) 


functions. 

3. 

6. 


ln(l  +  x ) 

l  +  y 

x+y 
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3.  TOTAL  DIFFERENTIALS 


The  graph  (Figure  3.1)  of  the  equation  y  =  f{x)  is  a  curve  in  the  (s,y)  plane  and 


(3.1) 


is  the  slope  of  the  tangent  to  the  curve  at  the  point  (s,  y).  In  calculus,  we  use  Ax  to 
mean  a  change  in  x,  and  Ay  means  the  corresponding  change  in  y  (see  Figure  3.1). 
By  definition 


(3.2) 


dy  Ay 

—  =  lim  — — . 

ax  Ai-<o  As 


We  shall  now  define  the  differential  dx  of  the  independent  variable  as 

(3.3)  dx  =  Ax. 

However,  dy  is  not  the  same  as  Ay.  From  Figure  3.1  and  equation  (3.1),  we  can 
see  that  Ay  is  the  change  in  y  along  the  curve,  but  dy  =  y'dx  is  the  change  in 
y  along  the  tangent  line.  We  say  that  dy  is  the  tangent  approximation  (or  linear 
approximation)  to  Ay. 

Example.  If  y  =  f(t)  represents  the  distance  a  particle  has  gone  as  a  function  of  t,  then 
dy/dt  is  the  speed.  The  actual  distance  the  particle  has  gone  between  time  t  and 
time  t  +  dt  is  Ay.  The  tangent  approximation  dy  =  ( dy/dt)dt  is  the  distance  it 
would  have  gone  if  it  had  continued  with  the  same  speed  dy/dt  which  it  had  at 
time  t. 

You  can  see  from  the  graph  (Figure  3.1)  that  dy  is  a  good  approximation  to 
Ay  if  dx  is  small.  We  can  say  this  more  exactly  using  (3.2).  Saying  that  dy/dx  is 
the  limit  of  Ay/Ax  as  Ax  — >  0  means  that  the  difference  Ay/Ax  —  dy/dx  — »  0  as 
As  — >  0.  Let  us  call  this  difference  e;  then  we  can  say 

(3.4)  — —  =  +  e,  where  e  — *  0  as  As  — *  0, 

As  dx 

or  since  dx  =  Ax 

(3.5)  Ay  =  (yr  +  e)dx,  where  e  — >  0  as  As  — >  0. 
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The  differential  dy  =  y'dx  is  called  the  principal  part  of  Ay ;  since  e  is  small  for 
small  dx ,  you  can  see  from  (3.5)  that  dy  is  then  a  good  approximation  to  Ay. 

In  our  example,  suppose  y  =  t2 ,  t  =  1,  dt  =  0.1.  Then 

Ay 
dy 
€ 

Ay 

Thus  dy  is  a  good  approximation  to  Ay. 

For  a  function  of  two  variables,  z  =  f(x,  y),  we  want  to  do  something  similar  to 
this.  We  have  said  that  this  equation  represents  a  surface  and  that  the  derivatives 
df  /dx ,  df  /dy ,  at  a  point,  are  the  slopes  of  the  two  tangent  lines  to  the  surface 
in  the  x  and  y  directions  at  that  point.  The  symbols  A*  =  dx  and  Ay  =  dy 
represent  changes  in  the  independent  variables  x  and  y.  The  quantity  A z  means 
the  corresponding  change  in  z  along  the  surface.  We  define  dz  by  the  equation 


=  (l.l)2  -  l2  =  0.21, 

=  C^-dt  =  2  -  1  •  (0.1)  =  0.2, 

=  *  2.1  — 2  =  0.1, 

At  dt 

=  ( y '  +  e)dt  =  (2  +  0.1)(0.1)  =  dy  +  edt  =  0.2  +  0.01. 


dz  dz 

(3.6) 

dz  =  —  dx  +  —  dy. 
ox  oy 

Q 


Si**1 


»f  j 
Ty  d> 


ER  =  &2 
Eli  =  dz 


Figure  3.2 

The  differential  dz  is  called  the  total  differential  of  z.  Let  us  consider  the  geometrical 
meaning  of  dz.  Recall  (Figure  3.1)  that  for  y  =  f(x),  dy  was  the  change  in  y  along 
the  tangent  line;  here  we  shall  see  that  dz  is  the  change  in  z  along  the  tangent  plane. 
In  Figure  3.2,  PQRS  is  a  surface,  PABC  is  the  plane  tangent  to  the  surface  at  P, 
and  PDEF  is  a  horizontal  plane  through  P.  Thus  PSCF  is  the  plane  y  =  const, 
(through  P),  PS  is  the  curve  of  intersection  of  this  plane  with  the  surface,  and 
PC  is  the  tangent  line  to  this  curve  and  so  has  the  slope  df  /dx\  then  (just  as  in 
Figure  3.1),  if  PF  =  dx,  we  have  CF  =  (df  /dx)  dx.  Similarly,  PQAD  is  a  plane 
x  =  const.,  intersecting  the  surface  in  the  curve  PQ,  whose  tangent  is  PA:  with 
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PD  =  dy ,  we  have  DA  =  (df  /dy)  dy.  From  the  figure,  GE  =  CF,  and  BG  =  AD, 
so 

EB  =  CF  +  DA  =  ^-dx+^-dy  =  dz. 

ox  dy 

Thus,  as  we  said,  dz  is  the  change  in  z  along  the  tangent  plane  when  x  changes  by 
dx  and  y  by  dy.  In  the  figure,  ER=  A z,  the  change  in  2  along  the  surface. 

From  the  geometry,  we  can  reasonably  expect  dz  to  be  a  good  approximation 
to  A2  if  dx  and  dy  are  small.  However,  we  should  like  to  say  this  more  accurately 
in  an  equation  corresponding  to  (3.5).  We  can  do  this  if  df  /dx  and  df  /dy  are 
continuous  functions.  By  definition 

(3.7)  Az  =  f(x  +  Ax,y  +  Ay)  -  f(x,y). 

By  adding  and  subtracting  a  term,  we  get 

(3.8)  Az  =  f(x  +  Ax,  y)  -  f(x,  y)  +  /(x  +  A x,y  +  Ay)  -  f(x  +  Ax,  y). 

Recall  from  calculus  that  the  mean  value  theorem  (law  of  the  mean)  says  that  for 
a  differentiable  function  /(x), 

(3.9)  f(x  +  Ax)  -  f(x)  =  (Ax)/'(xi), 

where  Xi  is  between  x  and  x  +  Ax.  Geometrically  this  says  (Figure  3.3)  that  there 
is  a  tangent  line  somewhere  between  x  and  x  +  Ax  which  has  the  same  slope  as  the 


line  AB.  In  the  first  two  terms  of  the  right  side  of  (3.8),  y  is  constant,  and  we  can 
use  (3.9)  if  we  write  df  /dx  for  /'.  In  the  last  two  terms  of  (3.8),  x  is  constant  and 
we  can  use  an  equation  like  (3.9)  with  y  as  the  variable;  y\  will  mean  a  value  of  y 
between  y  and  y  +  Ay.  Then  (3.8)  becomes 


(3.10) 


Az  =  a/|llI)Aj 

OX 


df(x+  Ax,yi) 
dy 


Ay. 


If  the  partial  derivatives  of  /  are  continuous,  then  their  values  in  (3.10)  at  points 
near  (x,  y)  differ  from  their  values  at  (x,  y)  by  quantities  which  approach  zero  as 
Ax  and  Ay  approach  zero.  Let  us  call  these  quantities  t\  and  £2-  Then  we  can 
write 


(3.11) 

Az  = 


i+ei|Ax- 


Ay  =  dz  +  ei  Ax  +  £2  Ay 

(ei  and  £2  — »  0  as  Ax  and  Ay  — >  0), 


where  df  /dx  and  df  /dy  in  (3.11)  are  evaluated  at  (x,  y).  Equation  (3.11)  [like  (3.5) 
for  the  y  =  /(x)  case]  tells  us  algebraically  what  we  suspected  from  the  geometry, 
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that  (if  df  /dx  and  df/dy  are  continuous  )  dz  is  a  good  approximation  to  Az  for 
small  dx  and  dy.  The  differential  dz  is  called  the  principal  part  of  A z. 

Everything  we  have  said  about  functions  of  two  variables  works  just  as  well  for 
functions  of  any  number  of  variables,  if  u  =  f(x,  y,z,-  ■  ■),  then  by  definition 


(3.12) 


df  df  df 

du=-^-dx+-^-dy+-^-dz 
dx  dy  dz 


and  du  is  a  good  approximation  to  A u  if  the  partial  derivatives  of  /  are  continuous 
and  dx ,  dy,  dz,  etc.,  are  small. 


PROBLEMS,  SECTION  3 

1.  Consider  a  function  f(x,y)  which  can  be  expanded  in  a  two-variable  power  series, 
(2.3)  or  (2.7).  Let  x  —  a  =  h  =  Ax,  y  —  b  =  k  =  Ay,  then  x  =  a  +  Aa:,  y  =  b  +  Ay 
so  that  f{x,y)  becomes  f(a  +  A x,b  +  Ay).  The  change  A z  in  z  =  f(x,y )  when  x 
changes  from  a  to  a  +  Ax  and  y  changes  from  b  to  b  +  Ay  is  then 

A z  =  f(a  +  Ax,  b  +  Ay)  -  f(a,  b). 

Use  the  series  (2.7)  to  obtain  (3.11)  and  to  see  explicitly  what  ei  and  t2  are  and 
that  they  approach  zero  as  Aa:  and  Ay  — *  0. 


4.  APPROXIMATIONS  USING  DIFFERENTIALS 

Let’s  consider  some  examples. 


Example  1.  Find  approximately  the  value  of 

1  1 
VO. 25  -  10  -  20  V(h25’ 

If  f(x)  =  1  / yfx,  the  desired  difference  is  A /  =  /( 0.25  —  10  -  20)  —  /(0.25).  But  A / 
is  approximately  df  =  d(  1/y/x)  with  x  =  0.25  and  dx  =  — 10-20. 

d(  1/y/x)  =  (-l/2)a :~3/2dx  =  (-l/2)(0.25)-3/2(-KT20)  =  4  x  1CT20. 

Now  why  not  just  use  a  computer  or  calculator  for  a  problem  like  this?  First 
note  that  we  are  subtracting  two  numbers  which  are  almost  equal  to  each  other.  If 
your  calculator  or  computer  isn’t  carrying  enough  digits,  you  may  lose  all  accuracy 
in  the  subtraction  (see  Chapter  1,  Section  15,  Example  1).  So  it  may  take  you  more 
time  to  check  on  this  and  to  type  the  problem  into  the  computer  than  to  find  df 
which  you  can  probably  do  in  your  head!  However,  there  is  another  important  point 
here  which  is  shown  in  the  next  example.  For  theoretical  purposes,  we  may  want  a 
formula  rather  than  a  numerical  result. 
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Example  2.  Show  that  when  n  is  very  large 

1  1  ~  2 
n2  (n  +  l)2  n3 

(=  means  “approximately  equal  to”).  If  f(x)  =  1  / re2,  the  desired  difference  is 
A /  =  /(n)  —  f{n  +  1).  But  A /  is  approximately  df  =  d(  1/x2)  with  x  =  n  and 
dx  =  —1. 


(This  result  is  used  in  obtaining  the  “correspondence  principle”  in  quantum  me¬ 
chanics;  see  texts  on  quantum  physics.)  Also  see  Problem  17. 

Example  3.  The  reduced  mass  /i  of  a  system  of  two  masses  m\  and  m2  is  defined  by 
/i^1  =  m i1  +  m2x .  If  m±  is  increased  by  1%,  what  fractional  change  in  m2  leaves 
/i  unchanged?  Taking  differentials  of  the  equation  and  substituting  dm\  =  O.Olmi, 
we  find 


0  =  ~m12  dm\  —  m2  2  dm2 , 

dm2  dm  1  O.Olmi  dm2  „  , 

— 5-  = - 5-  = - 5 —  or  - =  —  O.OlnWmi. 

m2  mj  mf  m2 

For  example,  if  mi  =  m2,  m2  should  be  decreased  by  1%;  if  m2  =  3mi,  m2  should 
be  decreased  by  3%;  and  so  on. 

Example  4.  The  electrical  resistance  R  of  a  wire  is  proportional  to  its  length  and  inversely 
proportional  to  the  square  of  its  radius,  that  is,  R  =  kl/r2.  If  the  relative  error 
in  length  measurement  is  5%  and  the  relative  error  in  radius  measurement  is  10%, 
find  the  relative  error  in  R.  in  the  worst  possible  case. 

The  relative  error  in  l  means  the  actual  error  in  measuring  l  divided  by  the  length 
measured.  Since  we  might  measure  l  either  too  large  or  too  small,  the  relative  error 
dl/l  might  be  either  +0.05  or  —0.05  in  the  worst  cases.  Similarly  |dr/r|  might  be 
as  large  as  0.10.  We  want  the  largest  value  which  \dR/R\  could  have;  we  can  find 
dR/R  by  differentiating  Ini?.  From  R  =  kl/r 2  we  find 

In  R  =  In  k  +  In  l  —  2  In  r. 


Then 


dR  dl  „  dr 

~R=  T~  V' 


In  the  worst  case  (that  is,  largest  value  of  |di?/i?|),  dl/l  and  dr/r  might  have 
opposite  signs  so  the  two  terms  would  add.  Then  we  would  have: 


Largest 


dR 


R 


=  0.05 +  2(0.10)  =  0.25  or 


25%. 
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Example  5.  Estimate  the  change  in 


.  fx  sint 

fix)  =  J  —dt 

when  x  changes  from  n/2  to  (l  +  e)7r/2  where  e  <<  1/10.  Recall  from  calculus  that 
df/dx  =  (sinx)/x.  Then  we  want  df  =  (df/dx)dx  with  x  =  7t/2  and  dx  =  en/2. 
Thus 

sin7r/2.  .  . 
df=^jr(en/2)  =  e. 

Note  that  the  approximations  we  have  been  making  correspond  to  using  a  Taylor 
series  through  the  /'  term.  We  can  write  Chapter  1  equation  (12.8)  with  the 
replacements  x  — >  x  +  Ax,  a  — >  x,  x  —  a  — >  Ax,  to  get 

fix  +  Ax)  =  f(x)  +  f'(x) Ax  +  f"(x)( Axf/2!  H - . 

Dropping  the  (Ax)2  and  higher  terms  we  have  the  approximation  we  have  been 
using: 

df  =  A  f  =  f(x  +  Ax)  -  f(x)  =  f\x)  Ax  =  f(x)dx. 

PROBLEMS,  SECTION  4 

1.  Use  differentials  to  show  that,  for  very  large  n ,  - — - — — - -r  = 

J  &  (n  +  l)3  n3 

2.  Use  differentials  to  show  that,  for  large  n  and  small  a,  y/n  +  a  — 

Find  the  approximate  value  of  a/1026  +  5  —  \/l026. 

3.  The  thin  lens  formula  is 

1  1  1 
-  H —  —  7, 
i  o  f 

where  /  is  the  focal  length  of  the  lens  and  o  and  i  are  the  distances  from  the  lens 
to  the  object  and  image.  If  i  =  15  when  o  =  10,  use  differentials  to  find  i  when 
o  =  10.1. 

4.  Do  Problem  3  if  i  =  12  when  o  =  18,  to  find  i  if  o  =  17.5. 

5.  Let  R  be  the  resistance  of  Ri  =  25  ohms  and  R2  =  15  ohms  in  parallel.  (See  Chapter 

2,  Problem  16.6.)  If  Ri  is  changed  to  25.1  ohms,  find  R2  so  that  R  is  not  changed. 

6.  The  acceleration  of  gravity  can  be  found  from  the  length  l  and  period  T  of  a  pen¬ 
dulum;  the  formula  is  g  =  Att21/T2.  Find  the  relative  error  in  g  in  the  worst  case  if 
the  relative  error  in  l  is  5%,  and  the  relative  error  in  T  is  2%. 

7.  Coulomb’s  law  for  the  force  between  two  charges  qi  and  52  at  distance  r  apart  is 

F  =  kqiq2/r2.  Find  the  relative  error  in  52  in  the  worst  case  if  the  relative  error  in 

qi  is  3%;  in  r,  5%;  and  in  F,  2%. 

8.  About  how  much  (in  percent)  does  an  error  of  1%  in  a  and  b  affect  o2b3? 

9.  Show  that  the  approximate  relative  error  ( df)/ f  of  a  product  f  =  gh  is  the  sum  of 
the  approximate  relative  errors  of  the  factors. 

10.  A  force  of  500  nt  is  measured  with  a  possible  error  of  1  nt.  Its  component  in  a 
direction  60°  away  from  its  line  of  action  is  required,  where  the  angle  is  subject  to  an 
error  of  0.5°.  What  is  (approximately)  the  largest  possible  error  in  the  component? 
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11. 


12. 

13. 

14. 


15. 


16. 


Show  how  to  make  a  quick  estimate  (to  two  decimal  places)  of  (4.98)2  —  (3.03)2 
without  using  a  computer  or  a  calculator.  Hint:  Consider  f(x,y)  =  \/x2  —  y2. 

As  in  Problem  11,  estimate  \/(2.05)2  +  (1.98)2. 

Without  using  a  computer  or  a  calculator,  estimate  the  change  in  length  of  a  space 
diagonal  of  a  box  whose  dimensions  are  changed  from  200  x  200  x  100  to  201  x  202  x  99. 

Estimate  the  change  in 

f(x)  =  ^  —  dt 

Jo  t2  +  0.51 

if  x  changes  from  0.7  to  0.71. 


For  an  ideal  gas  of  N  molecules,  the  number  of  molecules  with  speeds  <  v  is  given 
by  the  formula 


i(v)  = 


4a3N 


f 

Jo 


2 

x  e 


dx , 


where  a  is  a  constant  and  N  is  the  total  number  of  molecules.  If  IV  =  1026, 
the  number  of  molecules  with  speeds  between  v  =  1/a  and  1.01/a. 


estimate 


The  operating  equation  for  a  synchrotron  in  the  relativistic  range  is 

qB  =  wm[l  —  ((ui7)2/c2]_1^2, 


where  q  and  m  are  the  charge  and  rest  mass  of  the  particle  being  accelerated,  B  is 
the  magnetic  field  strength,  R  is  the  orbit  radius,  ui  is  the  angular  frequency,  and 
c  is  the  speed  of  light.  If  t u  and  B  are  varied  (all  other  quantities  constant),  show 
that  the  relation  between  du  and  dB  can  be  written  as 


dB  (  q  \2  dui 

Ip  =  \m)  HJs’ 


dB  dui .  ,  D  ,  \2i  — 1 


17. 


Here  are  some  other  ways  of  obtaining  the  formula  in  Example  2. 

(a)  Combine  the  two  fractions  to  get  (2n+  l)/[n2(n  +  l)2].  Then  note  that  for  large 
n,  2n  +  1  =  2n  and  n  +  1  =  n. 


(b)  Factor  the  expression  as  (  — =■  |  (  1 - 77 

Wjy  (1+i)2 

mial  series  to  two  terms,  and  then  simplify. 


,  expand 


by  bino- 


5.  CHAIN  RULE  OR  DIFFERENTIATING  A  FUNCTION  OF  A  FUNCTION 

You  already  know  about  the  chain  rule  whether  you  have  called  it  that  or  not.  Look 
at  this  example. 


Examplel.  Find  dy/dx  if  y  =  lnsin2x. 

You  would  say 

.  1  •  cos  2x  ■  -7-  (2x)  =  2  cot  2x. 

sin  2x  dx 

u  =  sinu  and  v  =  2x. 

dy  du  dv 
dx  du  dv  dx 

This  is  an  example  of  the  chain  rule.  We  shall  want  a  similar  equation  for  a  function 
of  several  variables.  Consider  another  example. 


dy_ 

dx 


—  (sin  2x)  = 


sin  2x  dx 
We  could  write  this  problem  as 

y  =  \nu,  where 
Then  we  would  say 


dy 
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Example  2.  Find  dz/dt  if  2  =  2t2  sint. 

Differentiating  the  product,  we  get 

dz  .  .  „  9 

—  =  4t  sin  t  +  2 1  cos  t. 
dt 

We  could  have  written  this  problem  as 

z  =  xy1  where  x  =  2 12  and  y  =  sin  t, 

dz  dx  dy 
dt=V~dt+Xdi ' 

But  since  x  is  dz/dy  and  y  is  dz/dx ,  we  could  also  write 

,  .  dz  dz  dx  dz  dy 

dt  dx  dt  dy  dt 


We  would  like  to  be  sure  that  (5.1)  is  a  correct  formula  in  general,  when  we 
are  given  any  function  z{x,  y)  with  continuous  partial  derivatives  and  x  and  y  are 
differentiable  functions  of  t.  To  see  this,  recall  from  our  discussion  of  differentials 
that  we  had 

dz  dz 

(5.2)  Az  = —Ax  + —^Ay  +  eiAx  +  e2Ay, 

dx  dy 

where  ei  and  £2  — >  0  with  Ax  and  Ay.  Divide  this  equation  by  At  and  let  At  — >  0; 
since  Ax  and  Ay  — >  0,  £1  and  £2  — >  0  also,  and  we  get  (5.1). 

It  is  often  convenient  to  use  differentials  rather  than  derivatives  as  in  (5.1).  We 
would  like  to  be  able  to  use  (3.6),  but  in  (3.6)  x  and  y  were  independent  variables 
and  now  they  are  functions  of  t.  However,  it  is  possible  to  show  (Problem  8)  that  dz 
as  defined  in  (3.6)  is  a  good  approximation  to  A z  even  though  x  and  y  are  related. 
We  may  then  write 


dz  dz 

(5.3) 

dz  =  —dx  +  —dy 
dx  dy 

whether  or  not  x  and  y  are  independent  variables,  and  we  may  think  of  getting  (5.1) 
by  dividing  (5.3)  by  dt.  This  is  very  convenient  in  doing  problems.  Thus  we  could 
do  Example  2  in  the  following  way: 

dz  =  xdy  +  ydx  =  x  cos  tdt  +  y  -4tdt=  (2 12  cos  t  +  4t  sin  t)dt, 

dz  „  ,  ,  . 

—  =  2 1  cos  t  +  4t  sin  t. 

dt 

In  doing  problems,  we  may  then  use  either  differentials  or  derivatives.  Here  is 
another  example. 
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Example  3.  Find  dz/dt  given  z  =  xv ,  where  y  =  tan  1 1,  x  =  sinf. 

Using  differentials,  we  find 

dz  =  yxv~x  dx  +  xv  In  xdy  =  yxv~ 1  cos  t  dt  +  xv  In  x  ■  - - 

1  +  tz 

=  yxy_1  cost  +  xy  lnx  •  ^  . 

dt  1  +  tz 

You  may  wonder  in  a  problem  like  this  why  we  don’t  just  substitute  x  and  y 
as  functions  of  t  into  z  =  xv  to  get  z  as  a  function  of  t  and  then  differentiate. 
Sometimes  this  may  be  the  best  thing  to  do  but  not  always.  For  example,  the 
resulting  formula  may  be  very  complicated  and  it  may  save  a  lot  of  algebra  to  use 
(5.1)  or  (5.3).  This  is  especially  true  if  we  want  dz/dt  for  a  numerical  value  of  t. 
Then  there  are  cases  when  we  cannot  substitute;  for  example,  if  a:  as  a  function  of  t 
is  given  by  x+ex  =  t,  we  cannot  solve  for  x  as  a  function  of  t  in  terms  of  elementary 
functions.  But  we  can  find  dx/dt ,  and  so  we  can  find  dz/dt  by  (5.1).  Finding  dx/dt 
from  such  an  equation  is  called  implicit  differentiation;  we  shall  discuss  this  process 
in  the  next  section. 

Computers  can  find  derivatives,  so  why  should  we  learn  the  methods  shown 
here  and  in  the  following  sections?  Perhaps  the  most  important  reason  is  that  the 
techniques  are  needed  in  theoretical  derivations.  However,  there  is  also  a  practical 
reason:  When  a  problem  involves  a  number  of  variables,  there  may  be  many  ways 
to  express  the  answer.  (You  might  like  to  verify  that  dz/dt  =  z(ycott  +  lnxcos2  y) 
is  another  form  for  the  answer  in  Example  3  above).  Your  computer  may  not  give 
you  the  form  you  want  and  it  may  be  as  easy  to  do  the  problem  by  hand  as  to 
convert  the  computer  result.  But  in  problems  involving  a  lot  of  algebra,  a  computer 
can  save  time,  so  a  good  study  method  is  to  do  problems  both  by  hand  and  by 
computer  and  compare  results. 


PROBLEMS,  SECTION  5 

1.  Given  z  =  xe~y,  x  =  cosht,  y  =  cos  t,  find  dz/dt. 

2.  Given  w  =  \/ u2  +  v2,  u  =  cos[lntan(p  +  ^7r)],  v  =  sin[lntan(p  +  ^7r)],  find  dw/dp. 

3.  Given  r  =  e~p  ~q  ,  p  =  es ,  q  =  e~s,  find  dr/ds. 

4.  Given  x  =  ln(w2  —  v2),  u  =  t2,  v  =  cost,  find  dx/dt. 

5.  If  we  are  given  z  =  z(x,  y )  and  y  =  y{x),  show  that  the  chain  rule  (5.1)  gives 

dz  dz  dz  dy 
dx  dx  dy  dx 

6.  Given  z  =  (x  +  y)5,  y  =  sin  10a:,  find  dz/dx. 

7.  Given  c  =  sin(a  —  6),  b  =  ae2a,  find  dc/da. 

8.  Prove  the  statement  just  after  (5.2),  that  dz  given  by  (3.6)  is  a  good  approximation 
to  Az  even  though  dx  and  dy  are  not  independent.  Hint:  let  x  and  y  be  functions 
of  t;  then  (5.2)  is  correct,  but  Aa:  ^  dx  and  Ay  ^  dy  (because  x  and  y  are  not 
independent  variables).  However,  Ax/ At  is  nearly  dx/dt  for  small  dt  and  dt  =  At 
since  t  is  the  independent  variable.  You  can  then  show  that 


Ax 


dt  =  dx  +  exdt, 
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and  a  similar  formula  for  Ay  ,  and  get 


dz  dz 

A z  =  T—dx  +  —  du  +  (terms  containing  e’s)  •  dt  =  dz  +  edt, 
ox  ay 


where  e  — >  0  as  At  — >  0. 


6.  IMPLICIT  DIFFERENTIATION 


Some  examples  will  show  the  use  of  implicit  differentiation. 


Example  1.  Given  x  +  ex  =  t.  find  dx/dt  and  d2x/dt2 . 

If  we  give  values  to  x,  find  the  corresponding  t  values,  and  plot  x  against  t,  we 
have  a  graph  whose  slope  is  dx/dt.  In  other  words,  x  is  a  function  of  t  even  though 
we  cannot  solve  the  equation  for  x  in  terms  of  elementary  functions  of  t.  To  find 
dx/dt ,  we  realize  that  x  is  a  function  of  t  and  just  differentiate  each  term  of  the 
equation  with  respect  to  t  (this  is  called  implicit  differentiation) .  We  get 


(6.1) 


dx  _  dx 

- Ve  — 

dt  dt 


Solving  for  dx/dt ,  we  get 

dx  1 
dt  1  +  ex 

Alternatively,  we  could  use  differentials  here,  and  write  first  dx+exdx  =  dt;  dividing 
by  dt  then  gives  (6.1). 

We  can  also  find  higher  derivatives  by  implicit  differentiation  (but  do  not  use 
differentials  for  this  since  we  have  not  given  any  meaning  to  the  derivative  or  dif¬ 
ferential  of  a  differential).  Let  us  differentiate  each  term  of  (6.1)  with  respect  to  t: 
we  get 


(6.2) 


d2x 

~di2 


=  0. 


Solving  for  d2x/dt 2  and  substituting  the  value  already  found  for  dx/dt ,  we  get 


(6.3) 


d2x  ~e*(fr 

dt2  1  +  ex 


(1  +  ex)3 


This  problem  is  even  easier  if  we  want  only  the  numerical  values  of  the  derivatives 
at  a  point.  For  x  =  0  and  t  =  1,  (6.1)  gives 


dx  dx 

dt  dt 


or 


dx 

dt 


and  (6.2)  gives 


d2  x  d2x  ( 1'. 


or 


d2  x 
dt 2 


Implicit  differentiation  is  the  best  method  to  use  in  finding  slopes  of  curves  with 
complicated  equations. 
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Example  2.  Find  the  equation  of  the  tangent  line  to  the  curve  ar3  —  3 y3  +  xy  +  21  =  0  at 
the  point  (1, 2). 

We  differentiate  the  given  equation  implicitly  with  respect  to  x  to  get 


2  dy  ,  __  dy 


3x  —  9  y  —  +  x—  +  y  =  0. 


dx 


dx 


Substitute  x  =  1,  y  =  2: 


„  „  „  dy  dy  dy  5  1 

3  —  36  — — b-T-  +  2  =  0,  —  =  —  = 

dx  dx  dx  35  7 

Then  the  equation  of  the  tangent  line  is 

y- 2  1  -  ,  1Q  n 

- —  =  —  or  x  —  7y  13  =  0. 

x  —  1  7 

By  computer  plotting  the  curve  and  the  tangent  line  on  the  same  axes,  you  can 
check  to  be  sure  that  the  line  appears  tangent  to  the  curve. 

PROBLEMS,  SECTION  6 

1.  If  pva  =  C  (where  a  and  C  are  constants),  find  dv/dp  and  d2v/dp2 . 

2.  If  yexy  =  sin®  find  dy/dx  and  d2y/dx2  at  (0,  0). 

3.  If  xy  =  yx,  find  dy/dx  at  (2,4). 

4.  If  xev  =  yex,  find  dy/dx  and  d2y/dx 2  for  y  ^  f. 

5.  If  x y3  —  yx3  —  6  is  the  equation  of  a  curve,  find  the  slope  and  the  equation  of  the 
tangent  line  at  the  point  (1,2).  Computer  plot  the  curve  and  the  tangent  line  on 
the  same  axes. 

6.  In  Problem  5  find  d2y/dx 2  at  (1,2). 

7.  If  y3  —  x2y  =  8  is  the  equation  of  a  curve,  find  the  slope  and  the  equation  of  the 
tangent  line  at  the  point  (3,  —1).  Computer  plot  the  curve  and  the  tangent  line  on 
the  same  axes. 

8.  In  Problem  7  find  d2y/dx2  at  (3,  —1). 

9.  For  the  curve  x2//3  +  y2^3  =  4,  find  the  equations  of  the  tangent  lines  at  (2y/2,  — 2\/2), 
at  (8,0),  and  at  (0,8).  Computer  plot  the  curve  and  the  tangent  lines  on  the  same 
axes. 

10.  For  the  curve  xey  +  yex  =  0,  find  the  equation  of  the  tangent  line  at  the  origin. 
Caution:  Substitute  x  =  y  =  0  as  soon  as  you  have  differentiated.  Computer  plot 
the  curve  and  the  tangent  line  on  the  same  axes. 

11.  In  Problem  10,  find  y"  at  the  origin. 

7.  MORE  CHAIN  RULE 

Above  we  have  considered  z  =  f(x,y),  where  x  and  y  are  functions  of  t.  Now 
suppose  z  =  f(x ,  y)  as  before,  but  x  and  y  are  each  functions  of  two  variables  s  and 
t.  Then  z  is  a  function  of  s  and  t  and  we  want  to  find  dz/ds  and  dz/dt.  We  show 
by  some  examples  how  to  do  problems  like  this. 
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Example  1.  Find  dz/ds  and  dz/dt  given 

z  =  xy ,  a;  =  sin(s  +  t),  y  =  s  —  t. 

We  take  differentials  of  each  of  the  three  equations  to  get 

dz  =  ydx  +  xdy,  dx  =  cos  (s  +  t)  ( ds  +  dt) ,  dy  =  ds  —  dt. 

Substituting  dx  and  dy  into  dz,  we  get 

(7.1)  dz  =  ycos(s  +  t)(ds  +  dt)  +  x(ds  —  dt) 

=  [y  cos (s  +  t)  +  x]  ds  +  [y  cos (s  +  t)  —  a;]  dt. 

Now  if  s  is  constant,  ds  =  0,  2  is  a  function  of  one  variable  t,  and  we  can  divide 
(7.1)  by  dt  [see  (5.1)  and  the  discussion  following  it].  For  dz  +  dt  on  the  left  we  write 
dz/dt  because  that  is  the  notation  which  properly  describes  what  we  are  finding, 
namely,  the  rate  of  change  of  z  with  t  when  s  is  constant.  Thus  we  have 

dz 

=  y  cos(s  +  t)-x 

and  similarly 

dz 

—1  =  y  COs(s  +  t)  +  X. 
ds 

Notice  that  in  (7.1)  the  coefficient  of  ds  is  dz/ds  and  the  coefficient  of  dt  is  dz/dt 
[also  compare  (5.3)].  If  you  realize  this,  you  can  simply  read  off  dz/ds  and  dz/dt 
from  (7.1). 

We  can  do  problems  with  more  variables  in  the  same  way. 


Example 2.  Find  du/ds,  du/dt,  given  u  =  x2  +  2 xy  —  y\nz  and 
x  =  s  +  t2 ,  y  =  s  —  t2 ,  z  =  2t. 

We  find 


V 

du  =  2 xdx  +  2x  dy  +  2 ydx - dz  —  hizdy 

z 

=  (2x  +  2 y)  (ds  +  2 1  dt)  +  (2x  —  In  z)  (ds  —  2 1  dt)  —  —  (2  dt) 

z 

=  (4a;  +  2 y  —  In z)  ds  +  \  4yt  +  2t\nz  —  —  J  dt. 


Then 


—  =  4a;  +  2y  —  In  2, 
ds 


du 

~dt 


Ayt  +  2t  In  z 


2 y 

z 


If  we  want  just  one  derivative,  say  du/dt ,  we  can  save  some  work  by  letting  ds  =  0 
to  start  with.  To  make  it  clear  that  we  have  done  this,  we  write 


dus 


(2x  +  2y)(2t  dt)  +  (2a;  —  In  z)(—2t  dt)  —  —(2  dt) 

z 

f  Ayt  +  2t  In  z - -  J  dt. 
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The  subscript  s  indicates  that  s  is  being  held  constant.  Then  dividing  by  dt,  we 
have  du/dt  as  before.  We  could  also  use  derivatives  instead  of  differentials.  By  an 
equation  like  (5.1),  we  have 


(7.2) 


du  du  dx  du  dy  du  dz 
dt  dx  dt  +  dy  dt  ^  dz  dt  ’ 


where  we  have  written  all  the  t  derivatives  as  partials  since  it,  x ,  y ,  and  z  depend 
on  both  s  and  t.  Using  (7.2),  we  get 


—  =  ( 2x  +  2y)(2t)  +  (2x  —  lnz)(— 2t)  +  (—  —  'j  (2)  =  4 yt  +  2tlnz  —  — . 
dt  \  z)  z 


It  is  sometimes  useful  to  write  chain  rule  formulas  in  matrix  form  (for  matrix 
multiplication,  see  Chapter  3,  Section  6).  Given,  as  above,  u  =  f(x,y,z),  x(s,t), 
y(s,t),  z(s,t ),  we  can  write  equations  like  (7.2)  in  the  following  matrix  form: 


(7.3) 


/ du  du\ 
\d~s~di) 


(  du  du  du 
\<9a;  dy  dz 


/dx 

dx  \ 

~dt 

dy 

dy 

ds 

dt 

dz 

dz 

V  ds 

dt / 

[Sometimes  (7.3)  is  written  in  the  abbreviated  form 


d(u)  d(u)  d(x,y,z) 
d(s,t)  d{x,y,z)  d(s,  t) 


which  is  reminiscent  of 

dy  dy  dx 
dt  dx  dt 1 

but  be  careful  of  this  for  two  reasons:  (a)  It  may  be  helpful  in  remembering  the 
formula  but  to  use  it  you  must  understand  that  it  means  the  matrix  product  (7.3). 
(b)  The  symbol  d{u,  v) /d{x,  y)  usually  means  a  determinant  rather  than  a  matrix 
of  partial  derivatives — see  Chapter  5,  Section  4]. 

Again  in  these  problems,  you  may  say,  why  not  just  substitute?  Look  at  the 
following  problem. 


Example  3.  Find  dz/ dt  given  z  =  x  —  y  and 


2,2  ,2 

x  +y  =  t  , 

x  sin  t  =  yev . 


From  the  z  equation,  we  have 


dz  =  dx  —  dy. 

We  need  dx  and  dy ;  here  we  cannot  solve  for  x  and  y  in  terms  of  t.  But  we  can 
find  dx  and  dy  in  terms  of  dt  from  the  other  two  equations  and  this  is  all  we  need. 
Take  differentials  of  both  equations  to  get 

2x  dx  +  2 y  dy  =  2 1  dt, 
sin  tdx  +  x  cos  tdt=  ( yev  +  ev)dy. 
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Rearrange  terms: 


x  dx  +  y  dy  =  t  dt, 
sin  tdx  —  (y  +  1  )evdy  =  — x  cos  t  dt. 

Solve  for  dx  and  dy  (in  terms  of  dt)  by  determinants: 

t(y  +  l)ev  +  xy  cos  t 
-x(y  +  l)ev  —  ysint 

and  similarly  for  dy.  Substituting  dx  and  dy  into  the  formula  for  dz  and  dividing 
by  dt,  we  get  dz/dt.  A  computer  may  save  us  some  time  with  the  algebra. 

We  can  also  do  problems  like  this  when  x  and  y  are  given  implicitly  as  functions 
of  two  variables  s  and  t. 


dx  = 


tdt  y 

-xcostdt  ~{y+  l)ev 


x  y 

sinf  —  (y  +  l)ev 


Example  4.  Find  dz/ds  and  dz/dt  given 

z  =  x2  +  xy, 
x2  +  y3  =  st  +  5, 
x  — y  =  s  +t  . 

We  have  dz  =  2xdx  +  xdy  +  ydx.  To  find  dx  and  dy  from  the  other  two 
equations,  we  take  differentials  of  each  equation: 

2x  dx  +  3y2  dy  =  sdt  +  t  ds, 

(7.4)  o 

3x2  dx  —  2ydy  =  2s  ds  +  2t  dt. 


We  can  solve  these  two  equations  for  dx  and  dy  in  terms  of  ds  and  dt  to  get 


s  dt  +  tds 

2s  ds  +  2 1  dt 

3  y2 
-2  y 

(—2 ys  —  6 ty2)  dt  +  {—2yt  —  6 sy2)  ds 

2x  3  y2 
3x2  —2  y 

—4  xy  —  9  x2y2 

and  a  similar  expression  for  dy.  We  substitute  these  values  of  dx  and  dy  into  dz  and 
find  dz  in  terms  of  ds  and  dt  just  as  in  Example  1;  we  can  then  write  dz/ds  and 
dz/dt  just  as  we  did  there  (Problem  11).  Notice  that  if  we  want  only  one  derivative, 
say  dz/dt,  we  could  save  some  algebra  by  putting  ds  =  0  in  (7.4).  Also  note  that 
we  can  save  some  algebra  if  we  want  the  derivatives  only  at  one  point.  Suppose  we 
were  asked  for  dz/ds  and  dz/dt  at  x  =  3,  y  =  1,  s  =  1,  t  =  5.  We  substitute  these 
values  into  (7.4)  to  get 


6  dx  +  3  dy  =  dt  +  5  ds, 

27  dx  —  2  dy  =  10  dt  +  2  ds. 


We  solve  these  equations  for  dx  and  dy  and  substitute  into  dz  just  as  before,  but 
the  algebra  is  easier  with  the  numerical  coefficients  (Problem  11). 
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So  far,  we  have  been  assuming  that  the  independent  variables  were  “natural” 
pairs  like  x  and  y ,  or  s  and  t.  For  example,  we  wrote  dx/ds  above,  taking  it  for 
granted  that  the  variable  held  constant  was  t.  In  some  applications  (particularly 
thermodynamics),  it  is  not  at  all  clear  what  the  other  independent  variable  is  and 
we  have  to  be  more  explicit.  We  write  ( dx/ds)t ;  this  means  that  s  and  t  are  the 
two  independent  variables,  that  x  is  thought  of  as  a  function  of  them,  and  then  x 
is  differentiated  partially  with  respect  to  s.  Suppose  we  try  to  find  from  the  three 
equations  of  Example  4  a  rather  peculiar  looking  derivative. 


Example  5.  Given  the  equations  of  Example  4,  find  ( ds/dz)x . 

First,  let  us  see  that  the  question  makes  sense.  There  are  five  variables  in  the 
three  equations.  If  we  give  values  to  two  of  them,  we  can  solve  for  the  other  three; 
that  is,  there  are  two  independent  variables,  and  the  other  three  are  functions  of 
these  two.  If  z  and  x  are  the  independent  ones,  then  s,  t ,  and  y  are  functions  of 
z  and  x\  we  should  be  able  to  find  their  partial  derivatives,  for  example  ( ds/dz)x 
which  we  wanted.  To  carry  out  the  necessary  work,  we  first  rearrange  equations 
(7.4)  and  the  dz  equation  to  get 

— x  dy  =  (2x  +  y)  dx  —  dz, 
t  ds  +  s  dt  —  3 y2  dy  =  2x  dx, 

2s  ds  +  2 tdt  +  2 y  dy  =  3 x2dx. 


From  these  three  equations  we  could  solve  for  ds,  dt,  and  dy  in  terms  of  dx  and  dz 
(by  determinants  or  by  elimination — the  same  methods  you  use  to  solve  any  set  of 
linear  equations).  Then  we  could  find  any  partial  derivative  of  s(x,z),  t{x,z),  or 
y(x,  z)  with  respect  to  x  to  z.  For  example,  to  find  ( dy/dz)x ,  we  get  from  the  first 
equation 


dy  =  —  dz  ~  dx, 

X  X 


( ®1\  =  I 

\9z)x  X' 


Note  that  we  would  not  need  to  differentiate  all  three  equations  if  we  wanted  only 
this  derivative;  you  should  always  look  ahead  to  see  how  much  differentiation  is 
necessary!  To  find  ( ds/dz)x ,  we  must  solve  the  three  equations  for  ds  in  terms  of 
dx  and  dz ;  we  can  save  ourselves  some  work  [if  we  want  only  (ds/dz)x\  by  putting 
dx  =  0  to  start  with.  To  make  it  clear  that  we  have  done  this  we  write  dsx  and 
dzx.  Then  we  get 


—x 
-3  y2 
2  y 

0  0  —x 
t  s  —3  y2 
2s  2 t  2 y 


— dzx  0 

0  s 

0  2 1 


-(2  sy  +  6ty2)dzx 
—x(2 12  —  2s2) 


/  ds\  sy  +  3ty2 
\dz)x  x(t2  -  s2)' 

We  could  use  a  computer  to  save  us  some  algebra  in  this  problem. 
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Example  6.  Let  x,  y  be  rectangular  coordinates  and  r,  9  be  polar  coordinates  in  a  plane. 
Then  the  equations  relating  them  are 


(7.5) 

x  =  rcos0, 
y  =  rsin0, 

or 

(7.6) 

r  =  \J  x2  +  y- 

9  =  tan”1  — . 

X 

Suppose  we  want  to  find  dO/dx.  Remembering  that  if  y  =  f(x),  dy/dx  and  dx/dy 
are  reciprocals,  you  might  be  tempted  to  find  89/ dx  by  taking  the  reciprocal  of 
dx/89 ,  which  is  easier  to  find  than  89 /dx.  This  is  wrong.  From  (7.6)  we  get 

(7  7)  —  =  _  _y_ 

8x  l  +  (y2/x2)  r2' 


From  x  =  r  cos  9  we  get 

dx 

(7.8)  —  =  -r  sin  9  =  -y. 

These  are  not  reciprocals.  You  should  think  carefully  about  the  reason  for  this; 
89/8x  means  (89/8x)y,  whereas  dx/89  means  (8x/89)r.  In  one  case  y  is  held 
constant  and  in  the  other  case  r  is  held  constant;  this  is  why  the  two  derivatives 
are  not  reciprocals.  It  is  true  that  ( 89/dx)y  and  ( 8x/89)y  are  reciprocals.  But 
to  find  ( 8x/89)y  directly,  we  have  to  express  £  as  a  function  of  9  and  y.  We  find 
x  =  y  cot  9 ,  so  we  get 


(7.9) 


r 


2 


? 

y 


which  is  the  reciprocal  of  89 /dx  in  (7.7). 


This  is  a  general  rule:  8u/8v  and  8v/8u  are  not  usually  reciprocals;  they  are 
reciprocals  if  the  other  independent  variables  (besides  u  or  v )  are  the  same  in 
both  cases. 


You  can  see  this  clearly  from  the  equations  involving  differentials.  From  the  equa¬ 
tion  9  =  arc  tan (y/x),  we  can  find 


(7.10) 


d6  = 


xdy  —  y  dx 


1  + 


y 


xdy  —  y  dx 


From  x  =  r  cos  9 ,  we  get 

X 

(7.11)  dx  =  cos  9  dr  —  r  sin  9  d9  =  —  dr  —  y  d9. 

r 

From  (7.10),  if  y  is  constant,  dy  =  0,  and  we  can  write 

(7.12)  d9y  =  -^dxy, 
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where  the  y  subscript  indicates  that  y  is  constant.  From  (7.12),  we  then  find  either 


ae\ 

dxJn 


ddy 

dxn 


or 


fdx 

\de 


dXy 

ddv 


and  these  are  reciprocals.  From  (7.11),  however,  we  can  find  ( dx/dd)r  or  ( 89/dx)r; 
these  are  again  reciprocals  of  each  other,  but  are  different  from  the  derivatives  found 
from  (7.12). 

It  is  interesting  to  write  equations  like  (7.11)  in  matrix  notation: 

(7.13) 


fdx 

dx\ 

dr 

~d9 

dy 

dy 

V  dr 

89) 

cos  6  —rsind 
sin  9  r  cos  9 


=  A 


where  A  stands  for  the  square  matrix  in  (7.13).  Similarly,  we  can  write 


(7.14) 


/dr  dr\ 
dx  dy 

d9  d9 
\dx  dy) 


=  A 


-l 


dx 

dy )' 


we  have  written  the  square  matrix  as  A  1  since  by  (7.13), 
,  -l  (dx 


A~ 


dy 


=  A~lA 


Then,  finding  A  1  (Problem  9)  and  using  (7.14),  we  have 


(7.15) 


/ dr  dr\ 
dx  dy 

d9  d9 
\dx  dy) 


(cos  9  sin  9  N 

—  sin  9  -  cos  9. 

t  r  / 


We  can  simply  read  off  the  four  partial  derivatives  of  r,  9 ,  with  respect  to  x,  y ,  from 
equation  (7.15).  (Also  see  Problem  9.)  Also  using  (7.5),  and  specifically  noting  that 
x  and  y  are  independent  variables,  we  have: 


=  cos  9  = 


(7.16) 


dr  /  dr  \ 
dx  \dx)y 

89  _  ("\  _  1  •  V 

dx  \dx)y  r Sin  r2 


dr 

dy  \dy 


f  dr 
d9 


=  sin  9 


.7 


f  89 


dy  Vdy 


1  a  x 
=  -  cos  0  =  — . 


[In  the  notation  mentioned  just  after  (7.3),  we  could  write 


-i  d(x,  y)  d(r,6) 


AA”1  = 


d(r,9)  d(x,y) 


=  unit  matrix 


thus,  although  the  individual  pairs  of  partial  derivatives  discussed  above  are  not 
reciprocals,  the  two  matrices  of  partial  derivatives  are  inverses.] 
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PROBLEMS,  SECTION  7 

1.  If  x  =  ?/z  and  y  =  2sin(?/  +  z),  find  dx/dy  and  d2x/dy2. 

2.  If  P  =  r  cos  t  and  r  sin  t  —  2 ter  =  0,  find  dP/dt. 

3.  If  z  =  xe~y  and  x  =  cosht,  y  =  cos  s,  find  dz/ds  and  dz/dt. 

4.  If  w  =  e~r  ~s  ,  r  =  uv,  s  =  u  +  2v,  find  dw/du  and  dw/dv. 

5.  If  u  =  x2y3z  and  x  =  sin(s  +  t),  y  =  cos(s  +  t),z~  est,  find  du/ds  and  du/dt. 

6.  If  w  =  f(x,  y)  and  x  =  r  cos  6,  y  =  r  sin#,  find  formulas  for 
dw/dr,  dw/dd,  and  d2w/dr 2 . 

7.  If  x  =  r  cos  8  and  y  =  rsin#,  find  ( 8y/89)r  and  ( dy/d8)x ■  Also  find  (d8/dy)x  in 
two  ways  (by  eliminating  r  from  the  given  equations  and  then  differentiating,  or  by 
taking  differentials  in  both  equations  and  then  eliminating  dr).  When  are  dy/dd 
and  88/ dy  reciprocals? 

8.  If  xs2  +  yt2  =  1  and  x2s  +  y2t  =  xy  —  4,  find  dx/ds,  dx/dt,  dy/ds,  dy/dt,  at 

(, x,y,s,t )  =  (1,  —  3,2,  —  1).  Hint:  To  simplify  the  work,  substitute  the  numerical 

values  just  after  you  have  taken  differentials. 

9.  Verify  (7.16)  in  three  ways: 

(a)  Differentiate  equations  (7.6). 

(b)  Take  differentials  of  (7.5)  and  solve  for  dr  and  d8. 

(c)  Find  A^1  in  (7.15)  from  A  in  (7.13);  note  that  this  is  (b)  in  matrix  notation. 

10.  If  x2  +  y2  =  2 st  —  10  and  2xy  =  s2  —  t2,  find  dx/ds,  dx/dt,  dy/ds,  dy/dt  at 
( x,y,s,t )  =  (4,  2,  5,  3). 

11.  Finish  Example  4  above,  both  for  the  general  case  and  for  the  given  numerical  values. 
Substitute  the  numerical  values  into  your  general  formulas  to  check  your  answers. 

12.  If  w  =  x  +  y  with  x3  +  xy  +  y3  =  s  and  x2y  +  xy2  =  t,  find  dw/ds,  dw/dt. 

13.  If  to  =  pq  with  a  sin  p  —  p  =  q  and  6  cos  q  +  q  =  p,  find  ( dp/dq)m ,  ( dp/dq)a ,  ( dp/dq)b , 

(■ db/da)p ,  ( da/dq)m ■ 

14.  If  u  =  x2  +  y2  +  xyz  and  x4  +  y4  +  z4  =  2x2y2z 2  +  10,  find  ( du/8x)z  at  the  point 
[x,y,z)  =  (2, 1, 1). 

15.  Given  x2u  —  y2v  =  1,  and  x  +  y  =  uv.  Find  ( dx/8u)v ,  [dx/du)y. 

16.  Let  w  =  x2  +  xy  +  z2. 

(a)  If  x3  +  *  =  3t,  y4  +  y  =  4t,  z5  +  z  =  5f,  find  dw/dt. 

(b)  If  y3  +  xy  =  1  and  z3  —  xz  =  2,  find  dw/dx. 

(c)  If  x3z  +  z3y  +  y3x  =  0,  find  (dw/dx)y. 

17.  If  p3  +  sq  =  t,  and  q3  +tp  =  s,  find  ( dp/ds)t ,  ( dp/ds)q  at  (p,  q,  s,  t)  =  (—1,  2,  3,  5). 

18.  If  m  =  a  +  b  and  n  =  a2  +  b2  find  ( db/dm)n  and  ( dm/db)a ■ 

19.  If  z  =  r  +  s2 ,  x  +  y  =  s3  +  r3  —  3,  xy  =  s2  —  r2,  find  ( 8x/dz)s ,  ( dx/dz)r ,  ( dx/dz)y 

at  (r,  s,  x,  y,  z)  =  (-1,  2,  3, 1,  3). 

20.  If  u 2  +  v2  =  x3  —  y3  +  4,  u2  —  v2  =  x2y2  +  1,  find  (du/dx)y,  ( du/dx)v ,  (dx/du) y, 
( dx/du)v  at  ( x ,  y,  u ,  v)  =  (2,  —1,  3,  2). 
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21.  Given  x2  +  y2  +  z2  =  6,  and  w3  +z3  =  5 xy  + 12,  find  the  following  partial  derivatives 
at  the  point  (x,y,  z,  w)  =  (1,  —2, 1, 1). 


<9z\ 

dxL 


dw\ 

d^)z 


dx  \ 
dw)z 


22.  If  w  =  f(ax  +  by),  show  that  b  —  a  =  0. 

ox  ay 

Hint:  Let  ax  +  by  =  z. 


23.  If  u  =  f(x  —  ct)  +  g(x  +  ct),  show  that 


d2u  1  d2u 
dx 2  c2  dt2  ' 


dz  dz 

24.  If  z  =  cos(a:w),  show  that  x— -  —  y—  =  0. 

dx  dy 

25.  The  formulas  of  this  problem  are  useful  in  thermodynamics, 
(a)  Given  /( x,  y,  z)  =  0,  find  formulas  for 


dx\ 

dy)z’ 


(b)  Show  that 


and 


dy 


dy 


and 


dx)  ' 

'V 


32 1 1^1  Si 


dx 


dx 


dy 


dy 


dz 


^  ^  ^  =-l. 


dz 


dx 


(c)  If  x,  y,  z  are  each  functions  of  t,  show  that 

'  dx 


corresponding  formulas  for 


and 


dy 


26.  Given  f(x,y,z)  =  0  and  g(x,y,z)  =  0,  find  a  formula  for  dy/dx. 

27.  Given  u(x,y)  and  y(x,z),  show  that 


du\ 


dx  J  \dx  J 

/  z  \  y  i 


du\ 


—  1=1^1+1  —  11^ 

dx 


du 


dy 


dy 


0£\ 

dt)L 


)  and 


28.  Given  s(v,T)  and  v(p,T),  we  define  cp  =  T(ds/dT)p,  cv  =  T(ds/dT)v.  (The  c’s  are 
specific  heats  in  thermodynamics.)  Show  that 

ds\  fdv\ 


dvJT  \dT J  ' 


8.  APPLICATION  OF  PARTIAL  DIFFERENTIATION 
TO  MAXIMUM  AND  MINIMUM  PROBLEMS 

You  will  recall  that  derivatives  give  slopes  as  well  as  rates  and  that  you  find  max¬ 
imum  and  minimum  points  of  y  =  /( x)  by  setting  dy/dx  =  0.  Often  in  applied 
problems  we  want  to  find  maxima  or  minima  of  functions  of  more  than  one  variable. 
Think  of  z  =  f{x,y)  which  represents  a  surface.  If  there  is  a  maximum  point  on 
it  (like  the  top  of  a  hill),  then  the  curves  for  x  =  const,  and  y  =  const,  which  pass 
through  the  maximum  point  also  have  maxima  at  the  same  point.  That  is,  dz/dx 
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and  dz/dy  are  zero  at  the  maximum  point.  Recall  that  dy/dx  =  0  was  a  necessary 
condition  for  a  maximum  point  of  y  =  /(x),  but  not  sufficient;  the  point  might 
have  been  a  minimum  or  perhaps  a  point  of  inflection  with  a  horizontal  tangent. 
Something  similar  can  happen  for  z  =  /(x,y).  The  point  where  dz/dx  =  0  and 
dz/dy  =  0  may  be  a  maximum  point,  a  minimum  point,  or  neither.  (An  interesting 
example  of  neither  is  a  “saddle  point” — a  curve  from  front  to  back  on  a  saddle  has 
a  minimum;  one  from  side  to  side  has  a  maximum.  See  Figure  8.1.)  In  finding  max¬ 
ima  of  y  =  /(x),  it  is  sometimes  possible  to  tell  from 
the  geometry  or  physics  that  you  have  a  maximum. 

If  necessary  you  can  find  d2y/dx2\  if  it  is  negative, 
then  you  know  you  have  a  maximum  point.  There  is 
a  similar  (rather  complicated)  second  derivative  test 
for  functions  of  two  variables  (see  Problems  1  to  7), 
but  we  use  it  only  if  we  have  to;  usually  we  can  tell 
from  the  problem  whether  we  have  a  maximum,  a 
minimum,  or  neither.  Let  us  consider  some  examples 
of  maximum  or  minimum  problems. 


Example.  A  pup  tent  (Figure  8.2)  of  given  volume  V,  with  ends  but  no  floor,  is  to  be 
made  using  the  least  possible  material.  Find  the  proportions. 


Using  the  letters  indicated  in  the  figure,  we  find  the 
volume  V  and  the  area  A. 

1  9 

V  =  —  ■  2 w  ■  l  •  w  tan  0  =  w  l  tan  9, 

2 


V 


1 


A  =  2w2  tan  9  + 


2  Iw 


2  w 


cos  9  Figure  8.2 

Since  V  is  given,  only  two  of  the  three  variables  w,  l,  and  9  are  independent,  and 
we  must  eliminate  one  of  them  from  A  before  we  try  to  minimize  A.  Solving  the  V 
equation  for  l  and  substituting  into  A ,  we  get 


A  =  2 w2  tan  9  + 


2  w 


V 


n  2V 

=  2 w  tan  9  H - esc  9. 


cos  9  w2  tan  9  w 

We  now  have  A  as  a  function  of  two  independent  variables  w  and  9.  To  minimize 
A  we  find  dA/dw  and  dA/d9  and  set  them  equal  to  zero. 

d  A  2V  esc  9 

=  4  w  tan  9 - - —  =  0, 


dw 

dA 


2V 


„„  =  2w2  sec2  9  —  —  esc  9  cot  0  =  0. 
dO  w 

Solving  each  of  these  equations  for  w3  and  setting  the  results  equal,  we  get 
V  esc  9  V  esc  9  cot  9  cos  9  cos  9  cos2  9 


w3  = 


2  tan  9 


sec2  9 


or 


2  sin2  9 


sin2  9 


You  should  convince  yourself  that  neither  sin0  =  0  nor  cos  9  =  0  is  possible  (the 
tent  collapses  to  zero  volume  in  both  cases) .  Therefore  we  may  assume  sin  9^0  and 
cos  9  ^  0  and  cancel  these  factors,  getting  cos2  9  =  |  or  9  =  45°.  Then  tan0  =  1, 
V  =  w2 1,  and  from  the  dA/dw  equation  we  have  2w  =  l\J 2.  Then  the  height  of  the 
tent  (at  the  peak)  is  w  tan  9  =  w  =  \/\[2. 
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PROBLEMS,  SECTION  8 

1.  Use  the  Taylor  series  about  x  =  a  to  verify  the  familiar  “second  derivative  test”  for 
a  maximum  or  minimum  point.  That  is,  show  that  if  f'(a)  =  0,  then  f"(a)  >  0 
implies  a  minimum  point  at  x  —  a  and  f"  (a)  <  0  implies  a  maximum  point  at  x  =  a. 
Hint:  For  a  minimum  point,  say,  you  must  show  that  f(x)  >  f(a)  for  all  x  near 
enough  to  a. 

2.  Using  the  two-variable  Taylor  series  [say  (2.7)]  prove  the  following  “second  derivative 
tests”  for  maximum  or  minimum  points  of  functions  of  two  variables.  If  fx  =  fy  =  0 
at  (a,  b),  then 

(a,  b)  is  a  minimum  point  if  at  (a,  b),  fxx  >  0,  fvy  >  0,  and  fxxfyy  >  fxy; 

(a,  b)  is  a  maximum  point  if  at  (a,  b),  fxx  <  0,  fyy  <  0,  and  fxxfyy  >  fxy; 

( a,b )  is  neither  a  maximum  nor  a  minimum  point  if  fxxfyy  <  fxy.  (Note  that 
this  includes  fxxfyy  <  0,  that  is,  fxx  and  fyy  of  opposite  sign.) 

Hint:  Let  fxx  =  A,  fxy  =  B,  fyy  =  C\  then  the  second  derivative  terms  in  the  Taylor 
series  are  Ah 2  +  2Bhk  +  Ck2\  this  can  be  written  A(h  +  Bk/A)2  +  (C  —  B2 /A)k2 . 
Find  out  when  this  expression  is  positive  for  all  small  h,  k  [that  is,  all  (a:,  y )  near 
(a,  6)];  also  find  out  when  it  is  negative  for  all  small  h,  k,  and  when  it  has  both 
positive  and  negative  values  for  small  h,  k. 

Use  the  facts  stated  in  Problem  2  to  find  the  maximum  and  minimum  points  of  the 
functions  in  Problems  3  to  6. 


3.  x2  +  y2  +  2x  —  4y  +  10 

pr  .  .  2  1  2 

5.  4  +  a;-| -y  —  x  —  xy  —  —y 


4.  x2  —  y2  +  2x  —  4y  +  10 

6.  x3  —  y3  —  2  xy  +  2 


7.  Given  z  =  (y  —  x2)(y  —  2x2),  show  that  z  has  neither  a  maximum  nor  a  minimum 
at  (0,  0),  although  z  has  a  minimum  on  every  straight  line  through  (0,  0). 

8.  A  roof  gutter  is  to  be  made  from  a  long  strip  of  sheet  metal,  24  cm  wide,  by 
bending  up  equal  amounts  at  each  side  through  equal  angles.  Find  the  angle  and 
the  dimensions  that  will  make  the  carrying  capacity  of  the  gutter  as  large  as  possible. 


9.  An  aquarium  with  rectangular  sides  and  bottom  (and  no  top)  is  to  hold  5  gal.  Find 
its  proportions  so  that  it  will  use  the  least  amount  of  material. 

10.  Repeat  Problem  9  if  the  bottom  is  to  be  three  times  as  thick  as  the  sides. 

11.  Find  the  most  economical  proportions  for  a  tent  as  in  the  figure,  with  no  floor. 


12. 


Find  the  shortest  distance  from  the  origin  to  the  surface  z  =  xy  +  5. 
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13.  Given  particles  of  masses  m,  2m,  and  3m  at  the  points  (0, 1),  (1,0),  and  (2,3),  find 
the  point  P  about  which  their  total  moment  of  inertia  will  be  least.  (Recall  that 
to  find  the  moment  of  inertia  of  m  about  P,  you  multiply  m  by  the  square  of  its 
distance  from  P.) 

14.  Repeat  Problem  13  for  masses  mi,  m2,  m3  at  (*1,3/1),  (*2,3/2),  (*3,3/3).  Show  that 
the  point  you  find  is  the  center  of  mass. 

15.  Find  the  point  on  the  line  through  (1,0,0)  and  (0,1,0)  that  is  closest  to  the  line 
*  =  y  =  z.  Also  find  the  point  on  the  line  *  =  y  —  z  that  is  closest  to  the  line 
through  (1,0,0)  and  (0,1,0). 

16.  To  find  the  best  straight  line  fit  to  a  set  of  data  points  (*„,  yn)  in  the  “least  squares” 
sense  means  the  following:  Assume  that  the  equation  of  the  line  is  3/  =  m*  +  b  and 
verify  that  the  vertical  deviation  of  the  line  from  the  point  (x„,  yn)  is  yn  —  ( mx„  +  b ). 
Write  S  =  sum  of  the  squares  of  the  deviations,  substitute  the  given  values  of  *„,  yn 
to  give  S'  as  a  function  of  m  and  b,  and  then  find  m  and  b  to  minimize  S. 

Carry  through  this  routine  for  the  set  of  points:  (-1,-2),  (0,0),  (1,3).  Check 
your  results  by  computer,  and  also  computer  plot  (on  the  same  axes)  the  given 
points  and  the  approximating  line. 

17.  Repeat  Problem  16  for  each  of  the  following  sets  of  data  points. 

(a)  (1,0),  (2,-1),  (3,-8) 

(b)  (-2,  -6),  (-1,  -3),  (0,  0),  (1,  9/2),  (2,  7) 

(c)  (-2,  4),  (-1, 0),  (0,  -1),  (1,  -8),  (2,  -10) 

9.  MAXIMUM  AND  MINIMUM  PROBLEMS  WITH 
CONSTRAINTS;  LAGRANGE  MULTIPLIERS 

Some  examples  will  illustrate  these  methods. 

Example  1.  A  wire  is  bent  to  fit  the  curve  y  =  1  —  x2  (Figure  9.1).  A  string  is  stretched 
from  the  origin  to  a  point  (*,  y)  on  the  curve.  Find  (*,  y)  to  minimize  the  length  of 
the  string. 


We  want  to  minimize  the  distance  d  =  -\/x2  +  y2  from  the  origin  to  the  point 
(*,  y)\  this  is  equivalent  to  minimizing  /  =  d?  =  x2  +  y2 .  But  x  and  y  are  not  inde¬ 
pendent;  they  are  related  by  the  equation  of  the  curve.  This  extra  relation  between 
the  variables  is  what  we  mean  by  a  constraint.  Problems  involving  constraints  occur 
frequently  in  applications. 

There  are  several  ways  to  do  a  problem  like  this.  We  shall  discuss  the  following 
methods:  (a)  elimination,  (b)  implicit  differentiation,  (c)  Lagrange  multipliers. 
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(a)  Elimination  The  most  obvious  method  is  to  eliminate  y.  Then  we  want  to 
minimize 

/  =  x2  +  (1  -  x2)2  =  x2  +  1  -  2x2  +  x4  =  x4  -  x2  +  1. 

This  is  just  an  ordinary  calculus  problem: 

=  4a;3  —  2x  =  0,  x  =  0,  or  x=±\~. 
dx  V  2 

It  is  not  immediately  obvious  which  of  these  points  is  a  maximum  and  which  is  a 
minimum,  so  in  this  simple  problem  it  is  worth  while  to  find  the  second  derivative: 


dx2 


=  12x 2  -  2  = 


—2  at  x  =  0  (relative  maximum), 

4  at  x  =  dh  1/2  (  minimum) . 


The  minimum  we  wanted  then  occurs  at  x  =  iW  1/2,  y  =  1/2. 

(b)  Implicit  Differentiation  Suppose  it  had  not  been  possible  to  solve  for  y 
and  substitute;  we  could  still  do  the  problem.  From  /  =  x2  +  y2 ,  we  find 


(9.1) 


df  =  2x  dx  +  2y  dy  or  =  2x  +  2y  — — . 

dx  dx 


From  an  equation  like  y  =  1  —  x2  relating  x  and  y ,  we  could  find  dy  in  terms  of  dx 
even  if  the  equation  were  not  solvable  for  y.  Here  we  get 

dy  =  —2a;  dx. 


Eliminating  dy  from  df ,  we  have 


df 


df  =  ( 2x  —  4a :y)dx  or  -f-  =  2x  —  Axy. 

dx 

To  minimize  /,  we  set  df  / dx  =  0  (or  in  the  differential  notation  we  set  df  =  0  for 
arbitrary  dx).  This  gives 

2a;  —  Axy  =  0. 

This  equation  must  now  be  solved  simultaneously  with  the  equation  of  the  curve 
y  =  1  —  x2.  We  get  2x  —  4a;(l  —  x2)  =  0,  x  =  0  or  AzyJ  1/2  as  before. 

To  test  for  maxima  or  minima  we  need  d2f/dx2.  Differentiating  df/dx  in  (9.1) 
with  respect  to  x ,  we  get 


d2f  ( dy 

dx 2=  +2\fa 


o  fy 

2ydx2' 


At  x  =  0,  we  find  y  =  1,  dy/dx  =  0,  d2y/dx2  =  —2,  so 

d2f 


dx2 


=  2-4=  -2; 


this  is  a  maximum  point.  At  x  =  ±  ^1/2,  we  find 

1  dy  r-  d2y 

2/=o>  =  Tv2,  —  =  -2, 


dx 


dx2 
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d2  f 

—  =  2  +  4-  2  =  4; 

this  point  is  the  required  minimum.  Notice  particularly  here  that  you  could  do 
every  step  of  (b)  even  if  the  equation  of  the  curve  could  not  be  solved  for  y. 

We  can  do  problems  with  several  independent  variables  by  methods  similar  to 
those  we  have  just  used  in  Example  1.  Consider  this  problem. 

Example  2.  Find  the  shortest  distance  from  the  origin  to  the  plane  x  —  2y  —  2z  =  3. 

We  want  to  minimize  the  distance  d  =  \J  x2  +  y2  +  z2  from  the  origin  to  a  point 
(x,  y ,  z)  on  the  plane.  This  is  equivalent  to  minimizing  /  =  d2  =  x2  +  y2  +  z2  if 
x  —  2y  —  2z  =  3.  We  can  eliminate  one  variable,  say  x ,  from  /  using  the  equation 
of  the  plane.  Then  we  have 

/  =  (3  +  2  y  +  2  z)2  +  y2  +  z2 . 

Here  /  is  a  function  of  the  two  independent  variables  y  and  z,  so  to  minimize  /  we 
set  df/dy  =  0,  df/dz  =  0. 

df 

■7T-  =  2(3  +  2y  +  2z)  ■  2  +  2y  =  0, 
dy 

df 

-J-  =  2(3  +  2y  +  2z)  ■  2  +  2z  =  0. 
oz 

Solving  these  equations  for  y  and  z1  we  get  y  =  z  =  —  2/3,  so  from  the  equation  of 
the  plane  we  get  x  =  1/3.  Then 


It  is  clear  from  the  geometry  that  there  is  a  minimum  distance  from  the  origin  to 
a  plane;  therefore  this  is  it  without  a  second-derivative  test.  (Also  see  Chapter  3, 
Section  5  for  another  way  to  do  this  problem.) 

Problems  with  any  number  of  variables  can  be  done  this  way,  or  by  method  (b) 
if  the  equations  are  implicit. 

(c)  Lagrange  Multipliers  However,  methods  (a)  and  (b)  can  involve  an  enor¬ 
mous  amount  of  algebra.  We  can  shortcut  this  algebra  by  a  process  known  as  the 
method  of  Lagrange  multipliers  or  undetermined  multipliers.  We  want  to  consider 
a  problem  like  the  one  we  discussed  in  (a)  or  (b).  In  general,  we  want  to  find  the 
maximum  or  minimum  of  a  function  f(x,y),  where  x  and  y  are  related  by  an  equa¬ 
tion  y)  =  const.  Then  /  is  really  a  function  of  one  variable  (say  a;).  To  find  the 
maximum  or  minimum  points  of  /,  we  set  df/dx  =  0  or  df  =  0  as  in  (9.1).  Since 
<j>  =  const.,  we  get  d(f>  =  0. 
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In  method  (b)  we  solved  the  d<f>  equation  for  dy  in  terms  of  dx  and  substituted  it  into 
d/;  this  often  involves  messy  algebra.  Instead,  we  shall  multiply  the  dcf)  equation  by 
A  (this  is  the  undetermined  multiplier — we  shall  find  its  value  later)  and  add  it  to 
the  df  equation;  then  we  have 


(9.3) 


df  ,  ,d(j> 


dx  dx 


df  ,  , dd 


AT  +  )  dx+  \  TT.+  )  dV  ~  °- 


dy  dy 


We  now  pick  A  so  that 
(9.4) 


df  +  x^_ 


[That  is,  we  pick  A  =  —(df/dy)/(d<p/dy),  but  it  isn’t  necessary  to  write  it  in  this 
complicated  form!  In  fact,  this  is  exactly  the  point  of  the  Lagrange  multiplier  A; 
by  using  the  abbreviation  A  for  a  complicated  expression,  we  avoid  some  algebra.] 
Then  from  (9.3)  and  (9.4)  we  have 


(9.5) 


^  +  A^  =  0. 
dx  dx 


Equations  (9.4),  (9.5),  and  cj>(x,y)  =  const,  can  now  be  solved  for  the  three  un¬ 
knowns  x,  y ,  A.  We  don’t  actually  want  the  value  of  A,  but  often  the  algebra  is 
simpler  if  we  do  find  it  and  use  it  in  finding  x  and  y  which  we  do  want.  Note 
that  equations  (9.4)  and  (9.5)  are  exactly  the  equations  we  would  write  if  we  had  a 
function 


(9.6)  F{x,y)  =  f(x,y)  +  \(j>{x,y) 

of  two  independent  variables  x  and  y  and  we  wanted  to  find  its  maximum  and 
minimum  values.  Actually,  of  course,  x  and  y  are  not  independent;  they  are  related 
by  the  </>  equation.  However,  (9.6)  gives  us  a  simple  way  of  stating  and  remembering 
how  to  get  equations  (9.4)  and  (9.5).  Thus  we  can  state  the  method  of  Lagrange 
multipliers  in  the  following  way: 


To  find  the  maximum  or  minimum  values  of  f(x,y)  when  x  and  y 
are  related  by  the  equation  (j){x,y )  =  const.,  form  the  function 
(9.7)  F(x,y)  as  in  (9.6)  and  set  the  two  partial  derivatives  of  F  equal  to 
zero  [equations  (9.4)  and  (9.5)].  Then  solve  these  two  equations  and 
the  equation  (f>(x,y )  =  const,  for  the  three  unknowns  x,  y1  and  A. 


As  a  simple  illustration  of  the  method  we  shall  do  the  problem  of  Example  1  by 
Lagrange  multipliers.  Here 

f(x,y)  =  x2  +  y2,  <t>(x,y)  =  y  +  x2  =  1, 

and  we  write  the  equations  to  minimize 

F{x,  y)  =  f  +  \<p  =  x2  +  y2  +  A (y  +  x2), 
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namely 


(9.8) 


dF 

dx 

dF 

dy 


2x  +  A  •  2x  =  0, 
2  y  +  A  =  0. 


We  solve  these  simultaneously  with  the  <f>  equation  y  +  x2  =  1.  From  the  first 
equation  in  (9.8),  either  x  =  0  or  A  =  —1.  If  x  =  0,  y  =  1  from  the  equation  (and 
A  =  —  2).  If  A  =  —  1,  the  second  equation  gives  y  =  and  then  the  cj>  equation 
gives  x2  =  i.  These  are  the  same  values  we  had  before.  The  method  offers  nothing 
new  in  testing  whether  we  have  found  a  maximum  or  a  minimum,  so  we  shall  not 
repeat  that  work;  if  it  is  possible  to  see  from  the  geometry  or  the  physics  what  we 
have  found,  we  don’t  bother  to  test. 

Lagrange  multipliers  simplify  the  work  enormously  in  more  complicated  prob¬ 
lems.  Consider  this  problem. 


Example  3.  Find  the  volume  of  the  largest  rectangular  parallelepiped  (that  is,  box),  with 
edges  parallel  to  the  axes,  inscribed  in  the  ellipsoid 


=  1. 


Let  the  point  (x,  y ,  z)  be  the  corner  in  the  first  octant  where  the  box  touches  the 
ellipsoid.  Then  ( x,y,z )  satisfies  the  ellipsoid  equation  and  the  volume  of  the  box 
is  8 xyz  (since  there  are  8  octants).  Our  problem  is  to  maximize  f(x:y,z)  =  8xyz , 
where  x,  y,  z  are  related  by  the  ellipsoid  equation 


y2  , 


Hx,  V,z)  =  -2  ,  2  '  2 

az  tr  cz 

By  the  method  of  Lagrange  multipliers  we  write 


=  1. 


F(x,  y,  z)  =  f  +  \4>  =  8 xyz  +  A  (  —  +  —  + 


2  2 
y  * 


and  set  the  three  partial  derivatives  of  F  equal  to  0: 


dF  o  2x 

—  =  8yz  +  A  •  =  0, 

ox  az 

dF  o  2y  „ 

—  =  8 XZ  +  A  •  -pr  =  0, 

ay  bz 

dF  2z  „ 

— —  =  8xy  +  A  •  —  =  0. 
oz  & 


We  solve  these  three  equations  and  the  equation  (f>  =  1  simultaneously  for  x,  y , 
z,  and  A.  (Although  we  don’t  have  to  find  A,  it  may  be  simpler  to  find  it  first.) 
Multiply  the  first  equation  by  x,  the  second  by  y,  and  the  third  by  z,  and  add  to 
get 

fx2  v2  z2\ 

3  •  8 xyz  +  2A  ^  ^  J  =  0- 
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Using  the  equation  of  the  ellipsoid,  we  can  simplify  this  to 


24  xyz  +  2A  =  0  or  A  =  —12  xyz. 


Substituting  A  into  the  dF/dx  equation,  we  find  that 


2x 

8 yz  —  12 xyz  ■  — ~  =  0. 

az 


From  the  geometry  it  is  clear  that  the  corner  of  the  box  should  not  be  where  y  or 
z  is  equal  to  zero,  so  we  divide  by  yz  and  solve  for  x,  getting 


The  other  two  equations  could  be  solved  in  the  same  way.  However,  it  is  pretty 
clear  from  symmetry  that  the  solutions  will  be  y2  =  |b2  and  z2  =  |c2.  Then  the 
maximum  volume  is 

8  abc 

Sxvz  =  57T 

You  might  contrast  this  fairly  simple  algebra  with  what  would  be  involved  in 
method  (a).  There  you  would  have  to  solve  the  ellipsoid  equation  for,  say,  z,  substi¬ 
tute  this  into  the  volume  formula,  and  then  differentiate  the  square  root.  Even  by 
method  (b)  you  would  have  to  find  dz/dx  or  similar  expressions  from  the  ellipsoid 
equation. 

We  should  show  that  the  Lagrange  multiplier  method  is  justified  for  problems 
involving  several  independent  variables.  We  want  to  find  maximum  or  minimum 
values  of  f(x,  y,  z )  if  p(x,  y,  z)  =  const.  (You  might  note  at  each  step  that  the  proof 
could  easily  be  extended  to  more  variables.)  We  take  differentials  of  both  the  /  and 
the  p  equations.  Since  p  =  const.,  we  have  dp  =  0.  We  put  df  =  0  because  we  want 
maximum  and  minimum  values  of  /.  Thus  we  write 


(9.9) 


,,  df  df  df 

df  =  —  dx  +  —dy  +  —dz  =  0, 
ox  ay  oz 

, ,  dp  dp  dcj) 

d<p  =  -z~dx  +  -z~dy  +  — dz  =  0. 

dx  dy  dz 


We  could  find  dz  from  the  dip  equation  and  substitute  it  into  the  df  equation;  this 
corresponds  to  method  (b)  and  may  involve  complicated  algebra.  Instead,  we  form 
the  sum  F  =  /  +  \<p  and  find,  using  (9.9), 


(9.10)  dF  =  df  +  X  dp 

df  ,  dp 


dx  - 


df  dp 
w — P  A— - 
dy  dy 


dy  ■ 


df  .  dp 
-d~z+XJ~z 


dz. 


There  are  two  independent  variables  in  this  problem  (since  x,  y,  and  z  are  related 
by  p  =  const.).  Suppose  x  and  y  are  the  independent  ones;  then  z  is  determined 
from  the  p  equation.  Similarly,  dx  and  dy  may  have  any  values  we  choose,  and  dz 
is  determined.  Let  us  select  A  so  that 
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Then  from  (9.10),  for  dy  =  0,  we  get 


(9.12) 

and  for  dx  =  0  we  get 

(9.13) 


=  0 

OX  ox 


^l  +  x^  —  o 
d,  +  xa,-° ' 


We  can  state  a  rule  similar  to  (9.7)  for  obtaining  equations  (9.11),  (9.12),  and 

(9.13). 


To  find  the  maximum  and  minimum  values  of  f(x ,  y,  z )  if 
<p(a o,y,z)  =  const.,  we  form  the  function  F  =  f  +  \(j)  and  set  the 
three  partial  derivatives  of  F  equal  to  zero.  We  solve  these  equa¬ 
tions  and  the  equation  </>  =  const,  for  x,  y ,  z,  and  A.  (For  a  problem 
with  still  more  variables  there  are  more  equations,  but  no  change 
in  method.) 


It  is  interesting  to  consider  the  geometric  meaning  of  equations  (9.9)  to  (9.13). 
Recall  that  x,  y,  z  are  related  by  the  equation  (j>{x,y,z)  =  const.  We  might, 
for  example,  think  of  solving  the  (j>  equation  for  z  =  z(x,y).  Then  x  and  y  are 
independent  variables,  and  *  is  a  function  of  them.  Geometrically,  2  =  z{x,y)  is  a 
surface  as  in  Figure  3.2.  If  we  start  at  the  point  P  of  this  surface  (see  Figure  3.2)  and 
increase  x  by  dx,  y  by  dy,  and  z  by  dz  as  given  in  equation  (3.6),  we  are  at  a  point 
on  the  plane  tangent  to  the  surface  at  P.  That  is,  the  vector  dr  =  i dx  +  j dy  +  k dz 
(. PB  in  Figure  3.2)  lies  in  the  tangent  plane  of  the  surface.  Now  the  second  equation 
in  (9.9)  is  a  dot  product  [see  Chapter  3,  equation  (4.10)]  of  dr  with  the  vector 

.d<j)  ,d(j)  dcj) 

'dx  +J^  +  k9l 

(called  the  gradient  of  <f>  and  written  grad  </>;  see  Chapter  6,  Section  6ff.).  We  could 
write  the  second  equation  in  (9.9)  as  d<p  =  (grad  (f>)  •  dr  =  0.  Recall  [Chapter  3, 
equation  (4.12)]  that  if  the  dot  product  of  two  vectors  is  zero,  the  vectors  are 
perpendicular.  Thus  since  dr  lies  anywhere  in  the  plane  tangent  to  the  surface  <j>  = 
const.,  (9.9)  says  that  grad  <f)  is  perpendicular  to  this  plane,  or  perpendicular  to 
the  surface  <f>  =  const,  at  P.  The  first  of  equations  (9.9)  says  that  grad  /  is  also 
perpendicular  to  this  plane.  Thus  grad  <j>  and  grad  /  are  in  the  same  direction,  so 
their  components  are  proportional;  this  is  what  equations  (9.11),  (9.12),  and  (9.13) 
say.  We  can  also  say  that  the  surfaces  (j)  =  const,  and  /  =  const,  are  tangent  to  each 
other  at  P;  that  is,  they  have  the  same  tangent  plane  and  their  normals,  grad  <j> 
and  grad  /,  are  in  the  same  direction. 

We  can  also  use  the  method  of  Lagrange  multipliers  if  there  are  several  conditions 
(4>  equations).  Suppose  we  want  to  find  the  maximum  or  minimum  of  f(x,y,z,w) 
if  <f>i(x,y,  z,w)  =  const,  and  </>2 (x,y,z,w)  =  const.  There  are  two  independent 
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variables,  say  x  and  y.  We  write 


(9.15) 


,,  df  df  df  df 
df  =  —  dx  +  —  dy  +  —  dz  +  —  dw  =  0, 
ox  oy  oz  ow 

, ,  dpi  ,  ,  dpi  ,  dpi  ,  dpi 

dp  t  =  -5—  ax  +  — —  dy  +  — —  dz  +  — —  dw  =  0, 

ox  oy  oz  ow 

dh=?*idx+?tidy+?*id,+?*idw=l>, 

ox  oy  oz  Ow 


Again  we  could  use  the  dp  1  and  dtp 2  equations  to  eliminate  dz  and  dw  from  df 
(method  b),  but  the  algebra  is  forbidding!  Instead,  by  the  Lagrange  multiplier 
method  we  form  the  function  F  =  f  +  Xipi  +  X2P2  and  write,  using  (9.15), 


(9.16)  dF  =  df  +  Ai  dpi  +  X2  dp 2 

'df  dpx  dp2\ 

t: - r  M  — - h  A2  -7; — 

.  ox  ox  ox  J 


+  a2 


,  .'df  ,  .  dp2\ 

+  (  -7 - h  Ai  — - h  A2  -7 —  I  dz  ■ 

oz  oz  oz 


d£ 

dy  1  dy 

df  dpi 

o - ^  - ^ 

dw  dw 


dp2 

dy 

dp2\ 

dw  ) 


dy 


dw. 


We  determine  Ai  and  A2  from  the  two  equations 
(9.17) 


df  .  \  dpi  dp2  _ 

■5 - r  A 1  — - h  A2  —  —  U, 

dz  dz  dz 


Then  for  dy  =  0,  we  have 

(9.18) 

and  for  dx  =  0,  we  have 

(9.19) 


df  ,  A  dpi  dp2  _ 

■7; - r  M  — - h  A2  -7; —  —  U. 

ow  ow  ow 


df  ,  A  ,  A  d<f 2  _  n 

■7; r  Ai  — - h  A2  -7; —  —  U 

ox  ox  ox 


?t+\.^+\M=o. 

dy  dy  dy 


As  before,  we  can  remember  the  method  of  finding  (9.17),  (9.18),  and  (9.19)  by 
thinking: 


To  find  the  maximum  or  minimum  of  /  subject  to  the  conditions 
pi  =  const,  and  p2  =  const.,  define  F  =  f  +  Xipi  +  X2P2  and 
set  each  of  the  partial  derivatives  of  F  equal  to  zero.  Solve  these 
equations  and  the  p  equations  for  the  variables  and  the  A’s. 


Example  4.  Find  the  minimum  distance  from  the  origin  to  the  intersection  of  xy  =  6 
with  7x  +  24 2  =  0. 

We  are  to  minimize  x2  +  y2  +  z 2  subject  to  the  two  conditions  xy  =  6 
and  7x  +  24 z  =  0.  By  the  Lagrange  multiplier  method,  we  find  the 
three  partial  derivatives  of 

F  =  x2  +  y2  +  z2  +  Xi(7x  +  24  z)  +  X2  xy 
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and  set  each  of  them  equal  to  zero.  We  get 

2x  +  7Ai  +  A2  y  =  0, 

(9.21)  2y  +  X2X  =  0, 

2z  +  24Ai  =  0. 

These  equations  can  be  solved  with  xy  =  6  and  lx  +  242  =  0  to  get  (Problem  10) 
x  =  ±12/5,  y  =  ±5/2,  ^  =  =f7/10. 

Then  the  required  minimum  distance  is  (Problem  10) 

d  =  y/x1  +  y2  +  z 2  =  5/-\/2  =  3.54. 


PROBLEMS,  SECTION  9 


1.  What  proportions  will  maximize  the  area  shown  in  the  figure  (rectangle  with  isosceles 
triangles  at  its  ends)  if  the  perimeter  is  given? 


2. 


3. 

4. 


5. 


6. 

7. 

8. 

9. 


10. 


11. 


12. 


What  proportions  will  maximize  the  volume  of  a  projectile  in  the  form  of  a  circular 
cylinder  with  one  conical  end  and  one  flat  end,  if  the  surface  area  is  given? 


Find  the  largest  rectangular  parallelepiped  (box)  that  can  be  shipped  by  parcel  post 
(length  plus  girth  =  108  in). 


Find  the  largest  box  (with  faces  parallel  to  the  coordinate  axes)  that  can  be  inscribed 
in 
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Find  the  point  on  2x  +  3y  +  z  —  11  =  0  for  which  Ax2  +  y2  +  z2  is  a  minimum. 


A  box  has  three  of  its  faces  in  the  coordinate  planes  and  one  vertex  on  the  plane 
2x  +  3y  +  Az  =  6.  Find  the  maximum  volume  for  the  box. 


Repeat  Problem  6  if  the  plane  is  ax  +  by  +  cz  =  d. 

A  point  moves  in  the  ( x ,  y)  plane  on  the  line  2x  +  3y  —  A  =  0.  Where  will  it  be  when 
the  sum  of  the  squares  of  its  distances  from  (1,  0)  and  (—1,0)  is  smallest? 

Find  the  largest  triangle  that  can  be  inscribed  in  the  ellipse  (x2 /a2)  +  (y2 /b2)  =  1 
(assume  the  triangle  symmetric  about  one  axis  of  the  ellipse  with  one  side  perpen¬ 
dicular  to  this  axis). 


Complete  Example  4  above. 


Find  the  shortest  distance  from  the  origin  to  the  line  of  intersection  of  the  planes 
2x  +  y  —  z  =  1  and  x  —  y  +  z  =  2. 

Find  the  right  triangular  prism  of  given  volume  and  least  area  if  the  base  is  required 
to  be  a  right  triangle. 
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10.  ENDPOINT  OR  BOUNDARY  POINT  PROBLEMS 


So  far  we  have  been  assuming  that  if  there  is  a  maximum  or  minimum  point,  calculus 
will  find  it.  Some  simple  examples  (see  Figures  10.1  to  10.4)  show  that  this  may 
not  be  true.  Suppose,  in  a  given  problem,  x  can  have  values  only  between  0  and 
1;  this  sort  of  restriction  occurs  frequently  in  applications.  For  example  the  graph 
of  f(x)  =  2  —  x2  exists  for  all  real  x,  but  if  x  =  |cos0|,  0  real,  the  graph  has  no 
meaning  except  for  0  <  x  <  1.  As  another  example,  suppose  x  is  the  length  of 
a  rectangle  whose  perimeter  is  2;  then  x  <  0  is  meaningless  in  this  problem  since 
£  is  a  length,  and  x  >  1  is  impossible  because  the  perimeter  is  2.  Let  us  ask 
for  the  largest  and  smallest  values  of  each  of  the  functions  in  Figures  10.1  to  10.4 
for  0  <  x  <  1.  In  Figure  10.1,  calculus  will  give  us  the  minimum  point,  but  the 
maximum  of  f(x)  for  x  between  0  and  1  occurs  at  x  =  1  and  cannot  be  obtained 
by  calculus,  since  f'[x)  ^  0  there.  In  Figure  10.2,  both  the  maximum  and  the 
minimum  of  f{x)  are  at  endpoints,  the  maximum  at  x  =  0  and  the  minimum  at 
x  =  1.  In  Figure  10.3  a  relative  maximum  at  P  and  a  relative  minimum  at  Q  are 
given  by  calculus,  but  the  absolute  minimum  between  0  and  1  occurs  at  x  =  0, 
and  the  absolute  maximum  at  x  =  1.  Here  is  a  practical  example  of  this  sort  of 
function.  It  is  said  that  geographers  used  to  give  as  the  highest  point  in  Florida  the 
top  of  the  highest  hill;  then  it  was  found  that  the  highest  point  is  on  the  Alabama 
border!  [See  H.  A.  Thurston,  American  Mathematical  Monthly ,  vol.  68  (1961),  pp. 
650-652.  A  later  paper,  same  journal,  vol.  98,  (1991),  pp.  752-3,  reports  that  the 
high  point  is  actually  just  south  of  the  Alabama  border,  but  gives  another  example 
of  a  geographic  boundary  point  maximum.]  Figure  10.4  illustrates  another  way  in 
which  calculus  may  fail  to  give  us  a  desired  maximum  or  minimum  point;  here  the 
derivative  is  discontinuous  at  the  maximum  point. 


Figure  10.1  Figure  10.2 


These  are  difficulties  we  must  watch  out  for  whenever  there  is  any  restriction 
on  the  values  any  of  the  variables  may  take  (or  any  discontinuity  in  the  functions 
or  their  derivatives).  These  restrictions  are  not  usually  stated  in  so  many  words; 
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you  have  to  see  them  for  yourself.  For  example,  if  x2  +  y2  =  25,  x  and  y  are  both 
between  —5  and  +5.  If  y2  =  x2  —  1,  then  |x|  must  be  greater  than  or  equal  to 
1.  If  x  =  esc  9,  where  6  is  a  first-quadrant  angle,  then  x  >  1.  If  y  =  y/x,  y'  is 
discontinuous  at  the  origin. 

Example  1 .  A  piece  of  wire  40  cm  long  is  to  be  used  to  form  the  perimeters  of  a  square  and 
a  circle  in  such  a  way  as  to  make  the  total  area  (of  square  and  circle)  a  maximum. 
Call  the  radius  of  the  circle  r;  then  the  circumference  of  the  circle  is  2nr.  A  length 
40  —  2?rr  is  left  for  the  four  sides  of  the  square,  so  one  side  is  10  —  r.  The  total 

A  =  nr2  +  (10  —  b7Tr)2  ■ 


area  is 

Then 

dA 

dr 

If  dA/dr  =  0, 

we  get 

—j—  =  2nr  +  2(10  —  -nr) (—  ^n)  =  2nr  ^1  +  —  j  —  107T. 


r  =  2.6 


A  =  56 


Now  we  might  think  that  this  is  the  maximum  area.  But  let  us  apply  the  second 
derivative  test  to  see  whether  we  have  a  maximum.  We  find 


d2A 

dr2 


=  27T 


>0; 


we  have  found  the  minimum  area!  The  problem  asks  for  a  maximum.  One  way  to 
find  it  would  be  to  sketch  A  as  a  function  of  r  and  look  at  the  graph  to  see  where 
A  has  its  largest  value.  A  simpler  way  is  this.  A  is  a  continuous  function  of  r  with 
a  continuous  derivative.  If  there  were  an  interior  maximum  (that  is,  one  between 
r  =  0  and  2nr  =  40),  calculus  would  find  it.  Therefore  the  maximum  must  be  at 
one  end  or  the  other. 


At  r  =  0,  A  =  100. 

At  2nr  =  40,  r  =  20/-7T, 


A  =  400/tt=  127 +. 


We  see  that  A  takes  its  largest  value  at  r  =  20/7 r;  A  =  400/7T  =  127+  is  then  the 
desired  maximum.  It  corresponds  to  using  all  the  wire  to  make  a  circle;  the  side  of 
the  square  is  zero. 

A  similar  difficulty  can  arise  in  problems  with  more  variables. 

Example  2.  The  temperature  in  a  rectangular  plate  bounded  by  the  lines  x  =  0,  y  =  0, 
x  =  3,  and  y  =  5  is 

T  =  xy2  —  x2y  +  100. 

Find  the  hottest  and  coldest  points  of  the  plate. 

We  first  set  the  partial  derivatives  of  T  equal  to  zero  to  find  any  interior  maxima 
and  minima.  We  get 

dT 


dx=y  ~  2xy  =  0, 

dT  n  o  n 
—  =  2 xy  -x2  =  0. 
dy 


Section  10 


Endpoint  or  Boundary  Point  Problems  225 


The  only  solution  of  these  equations  is  x  =  y  =  0,  for  which  T  =  100. 

We  must  next  ask  whether  there  are  points  around  the  boundary  of  the  plate 
where  T  has  a  value  larger  or  smaller  than  100.  To  see  that  this  might  happen, 
think  of  a  graph  of  T  plotted  as  a  function  of  x  and  y;  this  is  a  surface  above  the 
(x,y)  plane.  The  mathematical  surface  does  not  have  to  stop  at  x  =  3  and  y  =  5, 
but  it  has  no  meaning  for  our  problem  beyond  these  values.  Just  as  for  the  curves  in 
Figures  10.1  to  10.4,  the  graph  of  the  temperature  may  be  increasing  or  decreasing 
as  we  cross  a  boundary;  calculus  will  not  then  give  us  a  zero  derivative  even  though 
the  temperature  at  the  boundary  may  be  larger  (or  smaller)  than  at  other  points 
of  the  plate.  Thus  we  must  consider  the  complete  boundary  of  the  plate  ( not  just 
the  corners!).  The  lines  x  =  0,  y  =  0,  x  =  3,  and  y  =  5  are  the  boundaries;  we 
consider  each  of  them  in  turn.  On  x  =  0  and  y  =  0  the  temperature  is  100.  On  the 
line  x  =  3,  we  have 

T  =  3y2  —  9y  +  100. 

We  can  use  calculus  to  see  whether  T  has  maxima  or  minima  as  a  function  of  y 
along  this  line.  We  have 


dT 

=  6y  -  9  =  0, 

dy 


y=  f> 


T  =  93±. 


Similarly,  along  the  line  y  =  5,  we  find 


T  =  25x~  5x2  +  100, 
dT 


dx 


=  25  -  lOx  =  0, 


x  =  - 
u-  2 , 


T  =  13l|. 


Finally,  we  must  find  T  at  the  corners. 


At  (0,0),  (0,5),  and  (3,0),  T  =  100. 

At  (3,5),  T  =  130. 

Putting  all  our  results  together,  we  see  that  the  hottest  point  is  (§,5)  with  T  = 
13l|,  and  the  coldest  point  is  (3,  |)  with  T  =  93^. 


Example  3.  Find  the  point  or  points  closest  to  the  origin  on  the  surfaces 

(10.1a)  x2  —  4yz  =  8, 

(10.1b)  22-a;2  =  l. 

We  want  to  minimize  /  =  x2  +  y2  +  z2  subject  to  a  condition  [(a)  or  (b)].  If  we 
eliminate  x2  in  each  case,  we  have 

(10.2a)  /  =  8  +  Ayz  +  y2  +  z27 

(10.2b)  f  =  z2 -1  +  y2  +  z2  =  2z2 +  y2 -1. 

In  both  problems  (a)  and  (b)  the  mathematical  function  f{y ,  z)  is  defined  for  all  y 
and  z.  For  our  problems ,  however,  this  is  not  true.  In  (a),  since  x2  >  0,  we  have 
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x2  =  8  +  4 yz  >  0  so  we  are  interested  in  minimum  values  of  f(y ,  z)  in  (a)  only  in 
the  region  yz  >  —2.  [Compare  Example  2  where  T{x,y)  was  of  interest  only  inside 
a  rectangle.]  Thus  we  look  for  “interior”  minima  in  (a)  satisfying  yz  >  —2;  then 
we  substitute  z  =  —2 /y  into  (10.2a)  and  find  any  minima  on  the  boundary  of  the 
region  of  interest.  In  (b),  since  x2  =  z2  —  1  >  0,  we  must  have  z2  >  1.  Again  we 
try  to  find  “interior”  minima  satisfying  z2  >  1;  then  we  set  z2  =  1  and  look  for 
boundary  minima.  We  now  carry  out  these  steps. 

From  (10.2a),  we  find 


(10.3a) 


df 

dy 

d£ 

dz 


z  +  2y  =  Q, 

> 


y  =  z  =  o. 


y  +  2z  =  0, 


These  values  satisfy  the  condition  yz  >  —2  and  so  give  points  inside  the  region  of 
interest.  We  find  from  (10.1a),  x2  =  8,  x  =  ±2-\/2;  the  points  are  (±2a/2, 0,0)  at 
distance  2\/2  from  the  origin.  Next  we  consider  the  boundary  x  =  0,  z  =  —2 /y; 
from  (10.2a), 


9  4 

/  =  0  +  y  H — 2 , 

y 2 

yA  =  4,  y  =  ±V2, 


df_ 

dy 


~2/y  =  tV2. 


Remembering  that  x  =  0,  we  have  the  points  (0,  y/2,  —V2)  and  (0,  —  \/2,  \/2)  at 
distance  2  from  the  origin.  Since  2  <  2y/2,  these  boundary  points  are  closest  to  the 
origin. 

(10.4a)  Answer  to  (a):  (0,  V2, -y/2),  (0,  -y/2,  y/2). 


From  (10.2b)  we  find 


(10.3b) 


d£ 

dy 


2y  =  0, 


ir  =  42  =  0' 

oz 


y  =  z  =  o. 


Since  z  =  0  does  not  satisfy  z2  >  1,  there  is  no  minimum  point  inside  the  region  of 
interest ,  so  we  look  at  the  boundary  z2  =  1.  From  (10.1b),  x  —  0,  and  from  (10.2b) 


f  =  y2  + 1,  y-  =  2y  =  0, 

dy 


y  =  o. 


Thus  we  find  the  points  (0,  0,  ±1)  at  distance  1  from  the  origin.  Since  the  geometry 
tells  us  that  there  must  be  a  point  or  points  closest  to  the  origin,  and  calculus 
tells  us  that  these  are  the  only  possible  minimum  points,  these  must  be  the  desired 
points. 


(10.4b)  Answer  to  (b):  (0,0,  ±1). 

In  both  these  problems,  we  could  have  avoided  having  to  consider  the  boundary 
of  the  region  of  interest  by  eliminating  2  to  obtain  /  as  a  function  of  x  and  y.  Since 
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x  and  y  are  allowed  by  (10.1a)  or  (10.1b)  to  take  any  values,  there  are  no  boundaries 
to  the  region  of  interest.  In  (b)  this  is  a  satisfactory  method;  in  (a)  the  algebra  is 
complicated.  In  both  problems,  Lagrange  multipliers  offer  a  more  routine  method. 
For  example,  in  (a)  we  write 

F  =  x2  +  y2  +  z2  +  X(x2  —  4  yz)\ 

x  =  0  or  A  =  — 1; 

if  A  =  —1,  y  =  z  =  0,  x2  =  8; 

if  x  =  0,  A  =  —  =  — ,  y2  =  z2  —  2. 

2z  2y  y 

We  obtain  the  same  results  as  above,  namely,  the  points  (±2\/2,  0,  0),  (0,  ±y/2,  =p \/2); 
the  points  (0,  y/2,  —y/2),  (0,  —  y/2,  \/2)  are  closer  to  the  origin  by  inspection.  Part  (b) 
can  be  done  similarly  (Problem  14). 

We  see  that  using  Lagrange  multipliers  may  simplify  maximum  and  minimum 
problems.  However,  the  Lagrange  multiplier  method  still  relies  on  calculus;  conse¬ 
quently,  it  can  work  only  if  the  maximum  and  minimum  can  be  found  by  calculus 
using  some  set  of  variables  (x  and  y,  not  y  and  z,  in  Example  3).  For  example, 
a  problem  in  which  the  maximum  or  minimum  occurs  at  endpoints  in  all  variables 
cannot  be  done  by  any  method  that  depends  on  setting  derivatives  equal  to  zero. 

Example  4.  Find  the  maximum  value  of  y  —  x  for  nonnegative  x  and  y  if  x2  +  y2  =  1. 

Here  we  must  have  both  x  and  y  between  0  and  1.  Then  the  values  y  =  1  and 
x  =  0  give  y  —  x  its  largest  value;  these  are  both  endpoint  values  which  cannot  be 
found  by  calculus. 

PROBLEMS,  SECTION  10 

1.  Find  the  shortest  distance  from  the  origin  to  x2  —  y2  =  1. 

2.  Find  the  largest  and  smallest  distances  from  the  origin  to  the  conic  whose  equation 
is  5x2  —  6 xy  +  5j/2  —  32  =  0  and  hence  determine  the  lengths  of  the  semiaxes  of  this 
conic. 

3.  Repeat  Problem  2  for  the  conic  6a;2  +  4 xy  +  3 y2  =  28. 

Find  the  shortest  distance  from  the  origin  to  each  of  the  following  quadric  surfaces.  Hint: 
See  Example  3  above. 

4.  3x2  +  y2  —  4xz  =  4. 

5.  2a2  +  6  xy  =  3. 

6.  4y2  +  2  z2  +  3a :y  =  18. 

7.  Find  the  largest  2  for  which  2a;  +  4y  =  5  and  a;2  +  z2  =  2y. 

8.  If  the  temperature  at  the  point  (a:,  y,  z)  is  T  =  xyz,  find  the  hottest  point  (or  points) 
on  the  surface  of  the  sphere  x2  +  y2  +  z2  =  12,  and  find  the  temperature  there. 

9.  The  temperature  T  of  the  disk  x2  +  y2  <  1  is  given  by  T  =  2x2  —  3y2  —  2x.  Find 
the  hottest  and  coldest  points  of  the  disk. 


—  -  2x(l  +  A)  -  0, 

TT-  =  2 y-  4Xz  =  0; 
dy 


OF 

dz 


=  2  z  —  4A  y  =  0; 
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10.  The  temperature  at  a  point  ( x ,  y ,  z)  in  the  ball  x2+y2+z2  <  1  is  given  by  T  =  y2+xz. 
Find  the  largest  and  smallest  values  which  T  takes 

(a)  on  the  circle  y  =  0,  x2  +  z2  =  1, 

(b)  on  the  surface  x2  +  y2  +  z2  =  1, 

(c)  in  the  whole  ball. 

11.  The  temperature  of  a  rectangular  plate  bounded  by  the  lines  x  =  ±1,  y  =  ±1,  is 
given  by  T  =  2x2  —  3y2  —  2x  +  10.  Find  the  hottest  and  coldest  points  of  the  plate. 

12.  Find  the  largest  and  smallest  values  of  the  sum  of  the  acute  angles  that  a  line 
through  the  origin  makes  with  the  three  coordinate  axes. 

13.  Find  the  largest  and  smallest  values  of  the  sum  of  the  acute  angles  that  a  line 
through  the  origin  makes  with  the  three  coordinate  planes. 

14.  Do  Example  3b  using  Lagrange  multipliers. 

11.  CHANGE  OF  VARIABLES 

One  important  use  of  partial  differentiation  is  in  making  changes  of  variables  (for 
example,  from  rectangular  to  polar  coordinates) .  This  may  give  a  simpler  expression 
or  a  simpler  differential  equation  or  one  more  suited  to  the  physical  problem  one  is 
doing.  For  example,  if  you  are  working  with  the  vibration  of  a  circular  membrane, 
or  the  flow  of  heat  in  a  circular  cylinder,  polar  coordinates  are  better;  for  a  problem 
about  sound  waves  in  a  room,  rectangular  coordinates  are  better.  Consider  the 
following  problems. 


Example  1.  Make  the  change  of  variables  r  =  x  +  vt,  s  =  x  —  vt  in  the  wave  equation 

d  2F  1  d2F 


(11.1) 


=  0, 


dx 2  v2  dt2 

and  solve  the  equation.  (Also  see  Chapter  13,  Sections  1,  4,  and  6.) 
We  use  the  equations 


(11.2) 


r  =  x  +  vt, 
s  =  x  —  vt, 


and  equations  like  (7.2)  to  find 

dF  dr  dF  ds  _dF  dF  _  /  d  d\ 

dr  dx  +  ds  dx  dr  +  ds  \<9r  ds)  ’ 

dF  dr  dFds  _  dF  dF  _  /  d  d\ 

dr  dt  +  ds  dt  V  dr  V  ds  V  \  dr  ds  J 

It  is  helpful  to  say  in  words  what  we  have  written  in  (11.3):  To  find  the  partial  of  a 
function  with  respect  to  x,  we  find  its  partial  with  respect  to  r  plus  its  partial  with 
respect  to  s;  to  find  the  partial  with  respect  to  t ,  we  find  the  partial  with  respect 
to  r  minus  the  partial  with  respect  to  s  and  multiply  by  the  constant  v.  It  is  useful 
to  write  this  in  operator  notation  (see  Chapter  3,  Section  7): 

d__  d_  d_  d_  _  (d_  d_  \ 

~  +  0?  dt^V[d^~d^)' 


(11.3) 


dF 

dx 

dF 

~dt 


(11.4) 
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Then  from  (11.3)  and  (11.4)  we  find 
(11.5) 

d2F  _  d  (dF\  _  /  d  d  \  / dF  dF\  _  d2F  d2F  d2F 

dx2  dx  \dx  J  \  dr  ds  J  \  dr  ds  )  dr 2  drds  ds2 

d2F  _  d_  (dF\  _  f  d_  d  \  f  dF  dF\  _  2  f  cpF_  d2F  d2F\ 

dt 2  dt  \  dt  J  V  \dr  ds )  \  dr  V  ds  )  V  \  dr2  drds  ds 2  ) 

Substitute  (11.5)  into  (11.1)  to  get 

nifii  d2p  1  d2p  -  i  d2p  -  n 

1  j  dx2  v 2  dt2  drds 

We  can  easily  solve  (11.6).  We  have 

d2F  _  d  (dF\  _ 
drds  dr  \  ds  J 

that  is,  the  r  derivative  of  dF/ds  is  zero.  Then  dF/ds  must  be  independent  of 
r,  so  dF/ds  =  some  function  of  s  alone.  We  integrate  with  respect  to  s  to  find 
F  =  f(s)+“ const.”;  the  “constant”  is  a  constant  as  far  as  s  is  concerned,  but  it 
may  be  any  function  of  r,  say  g(r),  since  ( d/ds)g(r )  =  0.  Thus  we  find  that  the 
solution  of  (11.6)  is 

(11.7)  F  =  f(s)  +  g(r). 

Then,  using  (11.2),  we  find  the  solution  of  (11.1): 

(11.8)  F  =  /( x  —  vt)  +  g(x  +  vt ), 

where  /  and  g  are  arbitrary  functions.  This  is  known  as  d’Alembert’s  solution  of 
the  wave  equation.  Also  see  Problem  7.23  and  Chapter  13,  Problem  1.2. 


Example  2.  Write  the  Laplace  equation 


(11.9) 


d2F  d2F 


dx2  dy2 
in  terms  of  polar  coordinates  r,  9 ,  where 

x  =  rcos0, 
y  =  rsinfl. 


=  0 


(11.10) 


Note  that  equations  (11.10)  give  the  old  variables  x  and  y  in  terms  of  the  new  ones, 
r  and  9,  whereas  (11-2)  gave  the  new  variables  r  and  s  in  terms  of  the  old  ones. 
In  this  situation,  there  are  several  ways  to  get  equations  like  (11.3).  One  way  is  to 
write 


dF 

dr 

dF 

~d9 


F  dx  dF  dy 


.  dF 


,  dF 


rT-  +  =  cos6»  —  +  sin  9  — — , 

>x  dr  dy  dr  dx  dy 


F  dx  dF  dy  .  dF 

h  d9  dy  d9  dx 


dF 

-  r  cos  9  — — , 
dy 


(11.11) 
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and  then  solve  (11.11)  for  dF/dx  and  dF/dy  (Problem  5).  Another  way  is  to  find 
the  needed  partial  derivatives  of  r  and  #  with  respect  to  x  and  y  [for  methods  and 
results,  see  Section  7,  Example  6,  equation  (7.16)  and  Problem  7.9]  and  then  write 
as  in  (11.3),  using  (7.16), 

dF  89  8F  sin  9  8F 

8F  89  „  8F 

=  sin  9  ■ 


(11.12) 


8F 

8x 

8F 

8y 


8F  dr 
dr  8x 
8F  dr 
dr  8y 


r  89  ' 
cos  9  8F 


89  dy 


dr 


89 


In  finding  the  second  derivatives,  it  will  be  convenient  to  use  the  abbreviations 
G  =  dF/dx  and  H  =  dF/dy.  Thus, 


(11.13) 


Then 


(11.14) 


G  = 


H  = 


dF 

dx 

dF 

dy 


d2F 

dx2 


8G 
dx ’ 


d2F 

dy2 


=  COS  I 


=  sin  9 


8H 

W 


8F 

dr 

dF 

dr 


sin  9  dF 
~r~~89’ 
cos  9  dF 
r  ~89' 


so 


d  2F  d2F 
dx2  dy2 


dG 

dx 


dH 
dy  ’ 


Now  equations  (11.12)  are  correct  for  any  function  F;  in  particular  they  are  correct 
if  we  replace  F  by  G  or  by  F[.  Let  us  replace  F  by  G  in  the  first  equation  (11.12) 
and  replace  F  by  FI  in  the  second  equation.  Then  we  have 


(11.15) 


8G  dG  sin  9  dG 

—  =  cos  9  — - 

dx  dr 


dH  .  fJH 
— —  =  sin  9  — — 

dy  dr 


r  89  ’ 
cos  9  dH 


r 


89 


Substituting  (11.15)  into  (11.14),  we  get 


(11.16) 


d2F  82F 
dx2  dy 2 


=  COS  I 


dG 

dr 


-  sin  i 


dH 

dr 


H —  cos 


dH 

~89 


—  smi 


dG 

89 


We  find  the  four  partial  derivatives  of  G  and  H  which  we  need  in  (11.16),  by 
differentiating  the  right-hand  sides  of  equations  (11.13). 


(11.17) 


dG 

dr 

dH 

dr 

dH 

~8Q 

8G 

~89 


=  cost 


=  sin  9 


d2F  sin#  82F  ^  sin  9  dF 
dr2 
d2F 


=  sin  ( 


=  cost 


dr2 
d2F 
89dr 
,  82F 


r  8rd9 
cos  9  d2F 
r 


cos  9  dF 


cos  9 


drd9 

r2 

89  ’ 

dF 

1  _J_ 

cos  9 

82F 

sin# 

8F 

dr 

r 

89 2 

r 

89  ’ 

dF 

sin# 

d2F 

cos  # 

8F 

dr 

r 

892 

r 

89  ' 

89dr 

We  combine  these  to  obtain  the  expressions  needed  in  (11.16): 

dG  „  dH  d2F 

—  +  sm  — —  =  - 

dr  dr 


cos  9 


(11.18) 


1 


dH 


-  cos  9  —  sin  9  ——  =  -  — — | — 


89 


dr2  ’ 

8G\_l(dF  1  d2F\ 
89  )  r  \  dr  r  892  ) 
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Finally,  substituting  (11.18)  into  (11.16)  gives 


d2F  d2F  d2F 


(11.19) 


dx2  dy2 


dr 2 
ld_ 
r  dr 


1  dF 
r  dr 
dF 
~dr 


1  d2F 
r2  d92 
1  d2F 
r2  d92 


We  next  discuss  a  simple  kind  of  change  of  variables  which  is  very  useful  in 
thermodynamics  and  mechanics.  This  process  is  sometimes  known  as  a  Legendre 
transformation.  Suppose  we  are  given  a  function  f(x,y);  then  we  can  write 


,,  df  df 
df  =  —dx  +  -—dy. 
dx  dy 


(11.20) 

Let  us  call  df/dx  =  p,  and  df  /dy  =  q\  then  we  have 
(11.21)  df  =  pdx  +  qdy. 

If  we  now  subtract  from  df  the  quantity  d(qy),  we  have 


(11.22) 

If  we  define  the  function  g  by 


df  ~  d(qy )  =  pdx  +  qdy  -  qdy  -  ydq 
d(f  ~  qy )  =  pdx -ydq. 


or 


(ii-23)  g  =  f-qy, 

then  by  (11.22) 

(11.24)  dg  =  pdx  —  ydq. 


Because  dx  and  dq  appear  in  (11.24),  it  is  convenient  to  think  of  g  as  a  function  of 
x  and  q.  The  partial  derivatives  of  g  are  then  of  simple  form,  namely, 


(11.25) 


dg 

dq 


=  -y- 


Similarly,  we  could  replace  the  p  dx  term  in  df  by  —  x  dp  by  considering  the  function 
/  —  xp.  This  sort  of  change  of  independent  variables  is  called  a  Legendre  transfor¬ 
mation.  (For  applications,  see  Problems  10  to  13.)  For  a  discussion  of  Legendre 
transformations,  see  Callen,  Chapter  5. 

From  the  equations  above,  we  can  find  useful  relations  between  partial  deriva¬ 
tives.  For  example,  from  equations  (11.24)  and  (11.25)  we  can  write 


(11.26) 


d2g  =  ( dp\  .  ,  d2g 

dqdx  \dq)x  “  dxdq 


Assuming 


d2g 

dqdx 


d2g 

dxdq 


(reciprocity  relations,  see  end  of  Section  1),  then  we  have 


(11.27) 


(dp\  =_(dy\ 

\dqjx  \dx)q 


Many  equations  like  these  appear  in  thermodynamics  (see  Problems  12  and  13). 
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PROBLEMS,  SECTION  11 


1.  In  the  partial  differential  equation 

d2z  d2z  d2z 

dx 2  dxdy  cty2 

put  s  =  y  +  2x,  t  =  y  +  3x  and  show  that  the  equation  becomes  d2z/dsdt  =  0. 
Following  the  method  of  solving  (11.6),  solve  the  equation. 

2.  As  in  Problem  1,  solve 

d2z  d2z  d2z 

dx2  dxdy  dy2 

by  making  the  change  of  variables  u  =  5x  —  2y,  v  =  2x  +  y. 

3.  Suppose  that  w  =  f(x,y)  satisfies 

d2w  d2w  _ 
dx2  dy2 

Put  x  =  u  +  v,  y  =  u  —  v,  and  show  that  w  satisfies  d2w/dudv  =  1.  Hence  solve  the 
equation. 

4.  Verify  the  chain  rule  formulas 


dF  _  (FFdr_  dF  dd 
dx  dr  dx  +  dd  dx' 

and  similar  formulas  for 

dF  dF  dF 

dy  ’  dr  ’  dd  ’ 

using  differentials.  For  example,  write 


dF 


dF  ,  dF 

frdr+^dd9 


and  substitute  for  dr  and  d6\ 


dr  dr 

dr  =  — dx  +  —dy 
dx  dy 


(and  similarly  dd). 


Collect  coefficients  of  dx  and  dy,  these  are  the  values  of  dF/dx  and  dF/dy. 


5. 

6. 


Solve  equations  (11.11)  to  get  equations  (11.12). 


Reduce  the  equation 


5y  =  0 


to  a  differential  equation  with  constant  coefficients  in  d2y/dz2,  dy/dz,  and  y  by  the 
change  of  variable  x  =  ez .  (See  Chapter  8,  Section  7d.) 


7.  Change  the  independent  variable  from  x  to  d  by  x  =  cos#  and  show  that  the 
Legendre  equation 

becomes 

d2V  ,  n  dV 
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8.  Change  the  independent  variable  from  xto«=  2 yfx  in  the  Bessel  equation 


>.d2y 


dy 


X  d^+Xd^-{1~X)y  =  0 


and  show  that  the  equation  becomes 

d2y  dy 


U~d^+U^  +  (U  ~^y  =  °- 


9.  If  x  =  es  cost,  y  =  es  sint,  show  that 

d2u  c^u  _  —2s 
dx2  dy 2 


d2u  d2u 
+  ~dF 


10. 


11. 


12. 


13. 


Given  du  =  T  ds  —  pdv,  find  a  Legendre  transformation  giving 

(a)  a  function  f(T,v); 

(b)  a  function  h(s,p); 

(c)  a  function  g(T,p). 

Hint  for  (c):  Perform  a  Legendre  transformation  on  both  terms  in  du. 

Given  L(q,q)  such  that  dL  =  pdq  +  pdq,  find  H(p,q)  so  that  dH  =  q  dp  —  pdq. 
Comments:  L  and  H  are  functions  used  in  mechanics  called  the  Lagrangian  and 
the  Hamiltonian.  The  quantities  q  and  p  are  actually  time  derivatives  of  p  and  q, 
but  you  make  no  use  of  the  fact  in  this  problem.  Treat  p  and  q  as  if  they  were 
two  more  variables  having  nothing  to  do  with  p  and  q.  Hint:  Use  a  Legendre 
transformation.  On  your  first  try  you  will  probably  get  —H.  Look  at  the  text 
discussion  of  Legendre  transformations  and  satisfy  yourself  that  g  =  qy  —  f  would 
have  been  just  as  satisfactory  as  g  =  f  —  qy  in  (11.23). 

Using  du  in  Problem  10,  and  the  text  method  of  obtaining  (11.27),  show  that 
=  _  (^0  (This  is  one  of  the  Maxwell  relations  in  thermodynamics.) 

As  in  Problem  12,  find  three  more  Maxwell  relations  by  using  your  results  in  Problem 
10,  parts  (a),  (b),  (c). 


12.  DIFFERENTIATION  OF  INTEGRALS;  LEIBNIZ'  RULE 

According  to  the  definition  of  an  integral  as  an  antiderivative,  if 

dF{x) 


(12.1) 

then 

(12.2) 


fix)  = 


fit)  dt  =  F(t) 


dx 


=  F(x)-F(a), 


where  a  is  a  constant.  If  we  differentiate  (12.2)  with  respect  to  x ,  we  have 

(12.3)  fj-  f  m  dt  =  ff-[F(x)  F(a )]  =  ^  =  f(x) 

ax  J  a  dx  dx 

by  (12.1).  Similarly, 


so 

(12.4) 


f(t)dt  =  F(a)  -  F(x), 


d  fa  ,,  ,  ,  dF(x) 

tel  '(‘’‘“—ST  =  -'<*>■ 
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Example  1.  Find 


dx 


sin  t  dt. 


7t/4 


By  (12.3),  we  find  immediately  that  the  answer  is  sin  a;.  We  can  check  this  by 
finding  the  integral  and  then  differentiating.  We  get 


r. 

J  7T /4 


sin  tdt  =  —  cos  t 


r/4 


=  —  COS  X  +  i  V2 


and  the  derivative  of  this  is  sin  a;  as  before. 

By  replacing  x  in  (12.3)  by  v,  and  replacing  x  in  (12.4)  by  u,  we  can  then  write 

d_ 

dv  . 

and 
(12.6) 


(12.5) 


f(t)  dt  =  f(v) 


y-  /  f(t)  dt  =  —/(«)• 
du  Ju 

Suppose  u  and  v  are  functions  of  x  and  we  want  dl / dx  where 

1=  f  m  dt. 

J  u 

When  the  integral  is  evaluated,  the  answer  depends  on  the  limits  u  and  v.  Finding 
dl / dx  is  then  a  partial  differentiation  problem;  /  is  a  function  of  u  and  v,  which 
are  functions  of  x.  We  can  write 


(12.7) 


dl  dl  du  dl  dv 
dx  du  dx  dv  dx 


But  dl/dv  means  to  differentiate  I  with  respect  to  v  when  u  is  a  constant;  this  is 
just  (12.5),  so  dl /dv  =  f(v).  Similarly,  dl /du  means  that  v  is  constant  and  we  can 
use  (12.6)  to  get  dl /du  =  — f(u ).  Then  we  have 


*1/3 


Example  2.  Find  dl/dx  if  I  =  /  t2  dt. 

Jo 

By  (12.8)  we  get 

§  =  <*'/3>l<*1/3>=*2/ 3-k2/34 

We  could  also  integrate  first  and  then  differentiate  with  respect  to  x: 


rl/3 


1  = 


t2dt  = 


rV3 


X 

3’ 


dJ 

dx 


This  last  method  seems  so  simple  you  may  wonder  why  we  need  (12.8).  Look 
at  another  example. 
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Example3.  Find  d//dx  if 


1  = 


sin  t 


dt. 


Here  the  indefinite  integral  cannot  be  evaluated  in  terms  of  elementary  functions; 
however,  we  can  find  dl/dx  by  using  (12.8).  We  get 

dl  sin(sin_1  x)  1  sinx2 

~  —  '  23? 

dx  sin  x  vl  —  x2  x 

x  2.2 

=  .  - — : - sin  x  . 

Vl  —  x2  sin  x  x 


Finally,  we  may  want  to  find  dl/dx  when  I  =  jb  f(x,t)  dt,  where  a  and  b  are 
constants.  Under  not  too  restrictive  conditions, 


(12.9) 


d_ 

dx 


f(x,t)  dt 


fb  df(x,  t) 

Ja  9X 


that  is,  we  can  differentiate  under  the  integral  sign.  [A  set  of  sufficient  conditions 
for  this  to  be  correct  would  be  that  fa  f(x,t)dt  exists,  df/dx  is  continuous  and 

\df(x,t)/dx\  <  g(t):  where  Jb  g{t)  dt  exists.  For  most  practical  purposes  this  means 
that  if  both  integrals  in  (12.9)  exist,  then  (12.9)  is  correct.]  Equation  (12.9)  is  often 
useful  in  evaluating  definite  integrals. 


Example  4.  Find  /0°°  tne~kt  dt  for  odd  n,  k  >  0. 

First  we  evaluate  the  integral 

poo  1 

1=  te~kt2dt=-—e 

,/n  2  k 


1 

2k' 


r°°  9  1 

+2.-kt2  1 


Now  we  calculate  successive  derivatives  of  I  with  respect  to  k. 

) 

t3e~kt~dt  = 

Repeating  the  differentiation  with  respect  to  k ,  we  get 


dl 

dk 


—tte  dt  =  — r  or 
2  k2 


00  9  1 

.3 „-kt2  -■  L 


2k2' 


1 

k3 

3 

fc4  ’ 


Continuing  in  this  way  (Problem  17),  we  can  find  the  integral  of  any  odd  power  of 
t  times  e~kt  : 


t2t3e~ 

~kt2dt  = 

2 

or 

pOO 

/  t5e~kt2 

"2P 

Jo 

t2t5e~ 

~kt2dt  = 

3 

or 

/»oo 

[  t7e~kt2 

fc4 

Jo 

(12.10) 


t 


2n+l  —  kt* 


dt  = 


nl 


2 kn+1 ' 


Your  computer  may  give  you  this  result  in  terms  of  the  gamma  function  (see  Chapter 
11,  Sections  1  to  5).  The  relation  is  n!  =  T(n  +  1). 
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Example  5.  Evaluate 


(12.11) 


r  f1  t°-l  , 


a  >  —1. 


First  we  differentiate  /  with  respect  to  a,  and  evaluate  the  resulting  integral. 


dl  f1  talnt  f1  n  ,  ta+1 

dt=  t  dt  = 


da  J  q  In  t 


a  1 


a  1 


Now  we  integrate  dl / da  with  respect  to  a  to  get  I  back  again  (plus  an  integration 
constant): 


(12.12) 


1  = 


da 

a  -\- 1 


=  ln(a  +  1)  +  C. 


If  a  =  0,  (12.11)  gives  1  =  0  and  (12.12)  gives  I  =  C,  so  C  =  0  and  we  have  from 
(12.12),  1  =  ln(a+  1). 

It  is  convenient  to  collect  formulas  (12.8)  and  (12.9)  into  one  formula  known  as 
Leibniz’  rule: 


(12.13) 


d_ 

dx 


(IV 

f(x,  t)  dt  =  f(x,v)  — 
ax 


dx 


Example  6.  Find  dl/dx  if 


By  (12.13)  we  get 


r2x  Pxt 

/=,  —dt. 


dJ_ 

dx 


2x 


■2- 


1  + 


r2x  text 


■  dt 


X 


2x 


l(e2x2  -ex2)  + 

X 

i(e2x2 -ex2 +e2x2 -ex2)  =  -(e2x2 
x  x 


Although  you  can  do  problems  like  this  by  computer,  in  many  cases  you  can  just 
write  down  the  answer  using  (12.13)  in  less  time  than  it  takes  to  type  the  problem 
into  the  computer. 


PROBLEMS,  SECTION  12 

f'y/x 


2.  If  s  = 

zero. 


/■ 

</o 

nv 

J  u 


1.  If  y  =  I  sin  t  at,  find  dy/dx. 
1  -e* 


dt,  find  ds/dv  and  ds/du  and  also  their  limits  as  u  and  v  tend  to 
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3. 

4. 


5. 


6. 

7. 

8. 

9. 


10. 


11. 


12. 

13. 

14. 


15. 


16. 


17. 


If  z  = 


r 

J  sir 


sint 


dz 


dt,  find  ,  . 
t  dx 


Use  L’Hopital’s  rule  to  evaluate  lim 

x^2  x  — 


Ifu  = 


L 


i  r  si 

-2  J2 

d 

r\  5  r\  5  ““U 

Ox  oy  ox 


sint 


dt. 


y~X  dt ,  find  du  and  = 


Hint:  Use  differentials. 
If  w  = 


I 

J  X 


2x  3y  du  dw  dw  ,dy 

,  find  — — :  and  —  at  x  =  3,  y  =  1. 


In  u 


dx’  dy 


rv 

L 


If  I  e  dt  =  x  and  uv  =  y,  find 
If  /  e~s  ds  =  u,  find 


dx 

du\  f  du 


dx 


dy 


and 


at  u  =  2,  v  =  0. 


L 

/*7T 

If  V=  sin  xtdt,  find  dy/dx  (a)  by  evaluating  the  integral  and  then  differenti- 

J  o 

ating,  (b)  by  differentiating  hrst  and  then  evaluating  the  integral. 


Find  dy/dx  explicitly  if  y  = 


/’ 

Jo 


exu  _  1 


du. 


Find 


Find 


Find 


( x  —  t)  dt  by  evaluating  the  integral  first,  and  by  differentiating  first. 


du 


d_  r 
dx  J3_ 

d_  r 

dx  Jx  ln(a:  +  u) ' 

d  f2^x  sin  it  , 
dx  Jl/x  t 


Given  that 


/" 

Jo 


dx  =  —}  differentiate  with  respect  to  y  and  so  evaluate 


y2  +  x2  2y 


f 

Jo 


dx 


(■ y 2  +  x2)2 


Given  that 


f 

Jo 


k 


a2  +  fc2  ’ 


differentiate  with  respect  to  a  to  show  that 


/" 


—ax  •  7  7 

xe  sin  kx  dx  = 


2  ka 


(a2  +  k2)2 


and  differentiate  with  respect  to  k  to  show  that 


f 

Jo 


—  ax  i  7 

xe  cos  kx  dx  = 


2  7  2 

a  —  k 
(a2  +  k2)2 ' 


In  kinetic  theory  we  have  to  evaluate  integrals  of  the  form  I  =  I  tne  at  dt.  Given 


that 


/” 


f 

Jo 


2  2 

e~at  dt  =  —  \J 7r/a  ,  evaluate  /  for  n  =  2,  4,  6,  •  •  • ,  2m. 


Complete  Example  4  to  obtain  (12.10). 
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18.  Show  that  u 


(*,!/)  =  £  f 

TT  i-c 

/' 

Jo 


f(t)  dt 


satisfies  uxx  +  uvv  =  0. 


l-oo  (x-t)2  +y2 
19.  Show  that  y  =  I  /( u)  sin(a;  —  u)  du  satisfies  y"  +  y  =  f(x). 


PX 

20.  (a)  Show  that  y=  f(x  —  t)  dt  satisfies  ( dy/dx )  =  f(x).  (Hint:  It  is  helpful  to 

Jo 

make  the  change  of  variable  x  —  t  =  u  in  the  integral.) 

PX 

(b)  Show  that  y=  (x  —  u)f(u)  du  satisfies  y"  =  f(x). 

Jo 

1  rx 

(c)  Show  that  y  =  - jyy  J  (x  —  u)71-1  / (u)  du  satisfies  y ^  =  /(*)• 

13.  MISCELLANEOUS  PROBLEMS 

1.  A  function  f(x,  y,  z )  is  called  homogeneous  of  degree  n  if  f(tx,  ty,  tz)  =  tnf(x,  y,  z). 
For  example,  z2\n(x/y)  is  homogeneous  of  degree  2  since 

(tz)2  In  —  =  t2  (z2  In-)  . 


ty 


y 


Euler’s  theorem  on  homogeneous  functions  says  that  if  /  is  homogeneous 
of  degree  n,  then 

df  df  df 

+  yir  +  =  nf- 

ox  ay  oz 

Prove  this  theorem.  Hints:  Differentiate  f(tx,tyHz )  =  tnf(x,y,z)  with  respect  to 
t,  and  then  let  t  =  1.  It  is  convenient  to  call  df/d(tx)  =  /i  (that  is,  the  partial 
derivative  of  /  with  respect  to  its  first  variable),  fa  =  df  /d(ty),  and  so  on.  Or,  you 
can  at  first  call  tx  =  u,  ty  =  v,  tz  =  w.  (Both  the  definition  and  the  theorem  can 
be  extended  to  any  number  of  variables.) 

2.  (a)  Given  the  point  (2, 1)  in  the  (x,y)  plane  and  the  line  3x  +  2y  =  4,  find  the 

distance  from  the  point  to  the  line  by  using  the  method  of  Chapter  3,  Section  5. 

(b)  Solve  part  (a)  by  writing  a  formula  for  the  distance  from  (2, 1)  to  ( x ,  y)  and 
minimizing  the  distance  (use  Lagrange  multipliers). 

(c)  Derive  the  formula 

axo  +  byo  —  c 


D  = 


\/a2  +  b2 

for  the  distance  from  (xo,  yo)  to  ax  +  by  =  c  by  the  methods  suggested  in  parts 
(a)  and  (b). 

In  Problems  3  to  6,  assume  that  x,  y  and  r,  6  are  rectangular  and  polar  coordinates. 


3.  Find 


d2y 
dxdd ' 


4.  Find 


d2r 

dddy' 


5.  Given  z  =  y  -  2x  ,  find  (—  ,  \  ttt 


dz 


dx 


dz 


d6 


d2z 
dxdd ' 


6.  If  z  =  r2  —  x2,  find 


d2z  (dz 
dr d6 ’  I  dx 


7.  About  how  much  (in  percent)  does  an  error  of  1%  in  x  and  y  affect  x3y2l 


Section  13 


Miscellaneous  problems  239 


8. 


9. 

10. 

11. 

12. 

13. 

14. 

15. 

16. 

17. 

18. 


19. 

20. 
21. 

22. 

23. 

25. 

27. 

28. 


Assume  that  the  earth  is  a  perfect  sphere.  Suppose  that  a  rope  lies  along  the  equator 
with  its  ends  fastened  so  that  it  fits  exactly.  Now  let  the  rope  be  made  2  ft  longer, 
and  let  it  be  held  up  the  same  distance  above  the  surface  of  the  Earth  at  all  points 
of  the  equator.  About  how  high  up  is  it?  (For  example,  could  you  crawl  under? 
Could  a  fly?)  Answer  the  same  questions  for  the  moon. 


If  z  =  xy  and  <  2  2  ’  find  dz/dt. 


2x3  +  2  y3  =  3 t2 
3x2  +  3y2  =  Qt, 

If  w  =  (r  cos  6)r  sln  e ,  find  dw/d8. 

y2  dy  d2y 

If  —  +  —  =  1,  find  —  and  — —  by  implicit  differentiation. 
a 1  bz  dx  dx 2 

Given  z  =  r2  +  s2  +  rst,  r4  +  s4  +  t4  =  2 r2s2t2  +  10,  find  ( dz/dr)t  when  r  =  2, 
s  =  t  =  1. 


I  2 t  +  ex  =  s  —  cos  y  —  2, 
Given  < 

I  2s  —  t  =  sin  y  +  x  —  1, 


find 


at  ( x,y,s,t ) 


(0,  7t/2,  —1,  —2). 


If  w  =  f(x,s,t),  s  =  2x  +  y,  t  —  2x  —  y,  find  ( dw/dx)y  in  terms  of  /  and  its 
derivatives. 


If  w  =  f(x,x 2  +  y2,2xy),  find  ( dw/dx)y  (compare  Problem  14). 

-rr  1  „  ( y\  .  dz  dz 

ft  z  =  —  f  [  —  ) ,  prove  that  x— — b  y— — z  =  0. 
x  \  x  /  ox  ay 

Find  the  shortest  distance  from  the  origin  to  the  surface  x  =  yz  +  10. 

Find  the  shortest  distance  from  the  origin  to  the  line  of  intersection  of  the  planes 


2x  —  3y  +  z  =  5, 
3x  —  y  —  2z  =  11, 


(a)  using  vector  methods  (see  Chapter  3,  Section  5); 

(b)  using  Lagrange  multipliers. 

Find  by  the  Lagrange  multiplier  method  the  largest  value  of  the  product  of  three 
positive  numbers  if  their  sum  is  1. 

Find  the  largest  and  smallest  values  of  y  =  4x3  +  9x2  —  12x  +  3  if  x  =  cosd. 

Find  the  hottest  and  coldest  points  on  a  bar  of  length  5  if  T  =  4x  —  x2 ,  where  x  is 
the  distance  measured  from  the  left  end. 

Find  the  hottest  and  coldest  points  of  the  region  y2  <  x  <  5  if  T  =  x2  —  y2  —  3x. 


Find 


d_  r 

dt  Jq 


■  dx. 


24. 


Find 

dx 


cosh  xt 
t 


dt. 


Find  4- 
dx 


26. 


Find  — 
dx 


sin  xt 
t 


dt. 


j  nsinx 

Show  that  —  /  \/l  —  £2  dt  =  1. 

dx  J  cos  x 


In  discussing  the  velocity  distribution  of  molecules  of  an  ideal  gas,  a  function 
F(x,y,  z)  =  f(x)f(y)f(z)  is  needed  such  that  d(lnF’)  =  0  when  4>  =  x2  +  y2  +  z2  — 
const.  Then  by  the  Lagrange  multiplier  method  d(\nF  +  A (f>)  =  0.  Use  this  to  show 


that 


F(x,y,z)  =  Ae 


(\/2)(x2  +  y2+z2) 
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29.  The  time  dependent  temperature  at  a  point  of  a  long  bar  is  given  by 


T{t)  =  100° 


When  t  =  64,  T  =  15.73°.  Use  differentials  to  estimate  how  long  it  will  be  until 
T  =  17°. 

d2  r  rx 

30.  Evaluate  — /  /  f(s,  t )  ds  dt. 

dx  J o  J o 
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Multiple  Integrals; 
Applications  of  Integration 


1.  INTRODUCTION 

In  calculus  and  elementary  physics,  you  have  seen  a  number  of  uses  for  integration 
such  as  finding  area,  volume,  mass,  moment  of  inertia,  and  so  on.  In  this  chapter  we 
want  to  consider  these  and  other  applications  of  both  single  and  multiple  integrals. 
We  shall  discuss  both  how  to  set  up  integrals  to  represent  physical  quantities  and 
methods  of  evaluating  them.  In  later  chapters  we  will  need  to  use  both  single  and 
multiple  integrals. 

Computers  and  integral  tables  are  very  useful  in  evaluating  integrals.  But  to  use 
these  tools  efficiently,  you  need  to  understand  the  notation  and  meaning  of  integrals 
which  we  will  discuss  in  this  chapter.  There  is  another  important  point  here.  A 
computer  will  give  you  an  answer  for  a  definite  integral,  but  an  indefinite  integral 
has  many  possible  answers  (differing  from  each  other  by  a  constant  of  integration), 
and  your  computer  or  integral  tables  may  not  give  you  the  form  you  need.  (See 
problems  below.)  If  this  happens,  here  are  some  ideas  you  can  try: 

(a)  Look  in  other  integral  tables,  or  try  to  induce  your  computer  to  change  the 
form. 

(b)  See  if  some  algebra  will  give  the  form  you  want  (see  Problem  1  below;  also  see 
Chapter  2,  Section  15,  Example  2). 

(c)  A  simple  substitution  may  give  the  desired  result  (see  Problem  2  below). 

(d)  To  check  a  claimed  answer,  differentiate  it  (by  hand  or  computer)  to  see  whether 
you  get  the  integrand. 

PROBLEMS,  SECTION  1 


Verify  each  of  the  following  answers  for  an  indefinite  integral  by  one  or  more  of  the  methods 
suggested  above. 
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1. 


2. 


3. 


4. 


5. 


J  2  sin  9  cos  9  dd  —  sin2  9  or  —  cos2  9  or  —  icos26l.  Hint :  Use  trig  identities. 


dx 


\/x2  +  ( 


■  sinh  or  In  (^x  +  \J  x2  +  a2  ^  .  ffinf:  To  find  the  sinh  1  form, 

make  the  substitution  a;  =  a  sinh  u.  Or  see  Chapter  2,  Sections  15  and  17. 

'  ^  —  cosh-1  —  or  In  (y  +  \/ y2  —  a2  ^  .  ffint:  See  Problem  2  hints. 


J  ^ /y 2  -  a2 

^  \/l  +  a2x 2  dx  —  ^  \/ 1  +  a2x 2  +  sinh-1 


ax  or 


\/l  +  a2*2  +  In  ^ax  +  \/ 1  +  a2x2  j . 


Kdr 


=  sin  1  Kr  or  —  cos  1  Kr  or  tan  1 


Kr 


VI  -  K2r2  ~  ”*  "  VI  -  K2r2' 

Hints'.  Sketch  a  right  triangle  with  acute  angles  u  and  v  and  label  the  sides  so  that 
sinu  =  Kr.  Also  note  that  u  +  v  =  7t/2;  then  if  u  is  an  indefinite  integral,  so  is  —  v 
since  they  differ  by  a  constant  of  integration. 


6. 


/ 


Kdr  _!  K  _!  r  .  -i  K  -i  K 

—  ■  =  cos  —  or  sec  —  or  —  sin  —  or  —  tan  — . 

rVr2  -  K2  r  K  r  Vr2  -  K2 


2.  DOUBLE  AND  TRIPLE  INTEGRALS 

Recall  from  calculus  that  J^ydx  =  J ^  f(x)  dx 
gives  the  area  “under  the  curve”  in  Figure  2.1. 

Recall  also  the  definition  of  the  integral  as  the 
limit  of  a  sum:  We  approximate  the  area  by 
a  sum  of  rectangles  as  in  Figure  2.1;  a  rep¬ 
resentative  rectangle  (shaded)  has  width  Ax. 

The  geometry  indicates  that  if  we  increase  the 
number  of  rectangles  and  let  all  the  widths 
Ax  0,  the  sum  of  the  areas  of  the  rectan¬ 
gles  will  tend  to  the  area  under  the  curve.  We  define  J h  f(x)  dx  as  the  limit  of  the 
sum  of  the  areas  of  the  rectangles;  then  we  evaluate  the  integral  as  an  antiderivative, 
and  use  J  /(x)  dx  to  calculate  the  area  under  the  curve. 

We  are  going  to  do  something  very  similar 
in  order  to  find  the  volume  of  the  cylinder  in 
Figure  2.2  under  the  surface  2:  =  f(x,y).  We 
cut  the  (x,  y)  plane  into  little  rectangles  of  area 
A  A  =  (Ax)  (Ay)  as  shown  in  Figure  2.2;  above 
each  Ax  Ay  is  a  tall  slender  box  reaching  up  to 
the  surface.  We  can  approximate  the  desired 
volume  by  a  sum  of  these  boxes  just  as  we  ap¬ 
proximated  the  area  in  Figure  2.1  by  a  sum  of 
rectangles.  As  the  number  of  boxes  increases 
and  all  Ax  and  Ay  — >  0,  the  geometry  indi¬ 
cates  that  the  sum  of  the  volumes  of  the  boxes 
will  tend  to  the  desired  volume.  We  define  the 
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double  integral  of  f(x:y)  over  the  area  A  in  the  ( x,y )  plane  (Figure  2.2)  as  the 
limit  of  this  sum,  and  we  write  it  as  f  fA  /(x,  y)  dx  dy.  Before  we  can  use  the  double 
integral  to  compute  volumes,  however,  we  need  to  see  how  double  integrals  are  eval¬ 
uated.  Even  though  we  may  use  a  computer  to  do  the  work,  we  need  to  understand 
the  process  in  order  to  set  up  integrals  correctly  and  find  and  correct  errors.  Doing 
some  hand  evaluation  is  a  good  way  to  learn  this. 


Iterated  Integrals  We  now  show  by  some  examples  the  details  of  evaluating 
double  integrals. 


Example  1.  Find  the  volume  of  the  solid  (Figure  2.3)  below  the  plane  *  =  1  +  y,  bounded 
by  the  coordinate  planes  and  the  vertical  plane  2 x  +  y  =  2.  From  our  discussion 
above,  this  is  f  f  A  z  dx  dy  =  ffA(l  +  y)  dx  dy ,  where  A  is  the  shaded  triangle  in  the 
(x,y)  plane  [A  is  shown  also  in  Figure  2.4  (a  and  b)].  We  are  going  to  consider 
two  ways  of  evaluating  this  double  integral.  We  think  of  the  triangle  A  cut  up  into 
little  rectangles  A  A  =  Ax  Ay  (Figure  2.4)  and  the  whole  solid  cut  into  vertical 
columns  of  height  z  and  base  A  A  (Figure  2.3).  We  want  the  (limit  of  the)  sum 
of  the  volumes  of  these  columns.  First  add  up  the  columns  (Figure  2.4a)  for  a 
fixed  value  of  x  producing  the  volume  of  a  slab  (Figure  2.3)  of  thickness  Ax.  This 
corresponds  to  integrating  with  respect  to  y  (holding  x  constant,  Figure  2.4a)  from 
y  =  0  to  y  on  the  line  2x  +  y  =  2,  that  is  y  =  2  —  2x;  we  find 


(2.1) 


zdy  = 


>y= o 


'2/= o 


(1  +  y)dy=  (y  + 


2—2x 


=  (2  -  2x)  +  (2  -  2xf/2  =  4  -  6x  +  2x2. 


(What  we  have  found  is  the  area  of  the  slab  in  Figure  2.3;  its  volume  is  the  area 
times  Ax.)  Now  we  add  up  the  volumes  of  the  slabs;  this  corresponds  to  integrating 
(2.1)  with  respect  to  x  from  x  =  0  to  x  =  1: 


(4  —  6x  +  2x2)  dx 


5 

3‘ 


(2.2) 


x=0 
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We  could  summarize  (2.1)  and  (2.2)  by  writing 


pi  /  p 2 — 2x  \  pi  p2 — 2x 

(2.3)  /  (  /  (1  +  y)  dy )  dx  or  /  /  (1  +  y)  dy  dx 

</ \J y= 0  /  J x—0  J  y—0 

pi  p2 — 2x 

or  /  dx  dy(l  +  y). 

J  x=0  J  ii— 0 


We  call  (2.3)  an  iterated  (repeated)  integral.  Multiple  integrals  are  usually  evaluated 
by  using  iterated  integrals.  Note  that  the  large  parentheses  in  (2.3)  are  not  really 
necessary  if  we  are  always  careful  to  state  the  variable  in  giving  the  limits  on  an 
integral;  that  is,  always  write  J)1  (| ,  not  just J0' . 

Now  we  could  also  add  up  the  volume  z(AA)  by  first  integrating  with  respect 
to  x  (for  fixed  y ,  Figure  2.4b)  from  x  =  0tox  =  l—  yj  2  giving  the  volume  of  a 
slab  perpendicular  to  the  y  axis  in  Figure  2.3,  and  then  add  up  the  volumes  of  the 
slabs  by  integrating  with  respect  to  y  from  y  =  0  to  y  =  2  (Figure  2.4b).  We  write 


(2.4) 


rl-y/2 


(1  +  y)  dx)  dy 


'  x—0 


[  (l  +  y)x 
Jv= 0 


'  y—0 
(•2 


i-y/2 


dy 


x=0 


[  (l  +  y)(l  -y/2)dy 

Jy=0 

[  (i  +  y/ 2  -  y2/2)  dy 

Jy= 0 


5 

3' 


As  the  geometry  would  indicate,  the  results  in  (2.2)  and  (2.4)  are  the  same;  we  have 
two  methods  of  evaluating  the  double  integral  by  using  iterated  integrals. 

Often  one  of  these  two  methods  is  more  convenient  than  the  other;  we  choose 
whichever  method  is  easier.  To  see  how  to  decide,  study  the  following  sketches  of 
areas  A  over  which  we  want  to  find  ffA  f(x,y)  dx  dy.  In  each  case  we  think  of 
combining  little  rectangles  dx  dy  to  form  strips  (as  shown)  and  then  combining  the 
strips  to  cover  the  whole  area. 

Areas  shown  in  Figure  2.5:  Integrate  with  respect  to  y  first.  Note  that  the  top 
and  bottom  of  area  A  are  curves  whose  equations  we  know;  the  boundaries  at  x  =  a 
and  x  =  b  are  either  vertical  straight  lines  or  else  points. 


We  find 


(2.5) 


A 


V=Vi(x) 
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Areas  shown  in  Figure  2.6:  Integrate  with  respect  to  x  first.  Note  that  the  sides 
of  area  A  are  curves  whose  equations  we  know;  the  boundaries  at  y  =  c  and  y  =  d 
are  either  horizontal  straight  lines  or  else  points. 


d  — 


S2(y! 

Figure  2.6 


*26) 


Areas  shown  in  Figure  2.7:  Integrate  in  either  order.  Note  that  these  areas  all 
satisfy  the  requirements  for  both  (2.5)  and  (2.6). 


We  find 
(2.7) 


r  r  rb  rvAx) 

f(x,y)dxdy=  /  f(x,y)dydx 

J  J  Jx-aJy-yi(x) 

f{x,  y)  dx  dy. 


r=a  Jy=yi(x) 
rd  r-x2{y) 


Jy=c  Jx=xi{y) 

An  important  special  case  is  a  double  integral  over  a  rectangle  (both  x  and  y 
limits  are  constants)  when  f{x,y)  is  a  product,  f(x,y)  =  g{x)h{y).  Then 


b  rd 


(2.8)  f  f  f(x,y)dxdy  =  f  j 

J  J  J  x=a  J  y 


g{x)h(y)  dydx 
g{x)  dxj  [j  h(y)  dy 


x—a  J  y—c 

f 

/  a 


When  areas  are  more  complicated  than  those  shown,  we  may  break  them  into 
two  or  more  simpler  areas  (Problems  9  and  10). 

We  have  seen  how  to  set  up  and  evaluate  double  integrals  to  find  areas  and 
volumes.  Recall,  however,  that  we  use  single  integrals  for  other  purposes  than 
finding  areas.  Similarly,  now  that  we  know  how  to  evaluate  a  double  integral,  we 
can  use  it  to  find  other  quantities  besides  areas  and  volumes. 
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Example  2.  Find  the  mass  of  a  rectangular  plate  bounded  by  x  =  0,  x  =  2,  y  =  0,  y  =  1, 
if  its  density  (mass  per  unit  area)  is  f(x,y)  =  xy.  The  mass  of  a  tiny  rectangle 
A  A  =  Ax  Ay  is  approximately  f(x,y)  Ax  Ay,  where  f(x,y)  is  evaluated  at  some 
point  in  A  A.  We  want  to  add  up  the  masses  of  all  the  Ad’s;  this  is  what  we  find  by 
evaluating  the  double  integral  of  dM  =  xy  dx  dy.  We  call  dM  an  element  of  mass 
and  think  of  adding  up  all  the  dM’s  to  get  M. 


(2.9) 


M  =  /  /  xy  dxdy  = 


xy  dx  dy 


'  x—0 J y= 0 


=  l. 


A  triple  integral  of  f(x,y,z)  over  a  volume  V,  written  J  f  fv  f(x,  y,  z)  dx  dy  dz, 
is  also  defined  as  the  limit  of  a  sum  and  is  evaluated  by  an  iterated  integral.  If  the 
integral  is  over  a  box,  that  is,  all  limits  are  constants,  then  we  can  do  the  x,  y,  z 
integrations  in  any  order.  If  the  volume  is  complicated,  then  we  have  to  consider 
the  geometry  as  we  did  for  double  integrals  to  decide  on  the  best  order  and  find  the 
limits.  This  process  can  best  be  learned  from  examples  (below  and  Section  3)  and 
practice  (see  problems). 


Example  3.  Find  the  volume  of  the  solid  in  Figure  2.3  by  using  a  triple  integral.  Here 
we  imagine  the  whole  solid  cut  into  tiny  boxes  of  volume  Ax  Ay  Az]  an  element  of 
volume  is  dx  dy  dz .  We  first  add  up  the  volumes  of  the  tiny  boxes  to  get  the  volume 
of  a  column;  this  means  integrating  with  respect  to  z  from  0  to  1  +  y  with  x  and  y 
constant.  Then  we  add  up  the  columns  to  get  a  slab  and  the  slabs  to  get  the  whole 
volume  just  as  we  did  in  Example  1.  Thus: 


(2.10) 


V  = 


dx  dy  dz 


r*l  />2—2x  /  /»1 +y 


lx—0Jy—0  \J  z— 0 

r»l  n2—2x 


dz)dydx  or 

/*!  pZ~ZX  £ 

/  /  (1  +  y)dydx=  -, 

J  x—0  J  v—0 


/y— o 


dz  dy  dx 


as  in  (2.1)  and  (2.2).  Or,  we  could  have  used  (2.4). 


Example  4.  Find  the  mass  of  the  solid  in  Figure  2.3  if  the  density  (mass  per  unit  volume) 
is  x  +  z.  An  element  of  mass  is  dM  =  (x  +  z)  dx  dy  dz.  We  add  up  elements  of 
mass  just  as  we  add  up  elements  of  volume;  that  is,  the  limits  are  the  same  as  in 
Example  3. 


pi  p2—2x  rl+y 

(2.11)  M=  /  /  (x  +  z)  dz  dy  dx  =  2 

J  x=0  J  y— 0  J  z— 0 

where  we  evaluate  the  integrals  as  we  did  (2.1)  to  (2.4).  (Check  the  result  by  hand 
and  by  computer.) 
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PROBLEMS,  SECTION  2 

In  the  problems  of  this  section,  set  up  and  evaluate  the  integrals  by  hand  and  check  your 


results  by  computer. 

1.  3xdydx 

J  x=0  J  y= 2 

2.  /  /  8xydxdy 

[2  ,4 

3.  dxdy 

J  y= 0  J  x=2y 

Jy=- 2 Jx= 1 

4.  /  /  ydydx 

J  x=0 J y= 0 

5.  T  r  ydydx 

J  x=0  J  y=x 

j'2  ry2 

6.  /  /  xdxdy 

■'  y= 1  Jx  =  ^y 

In  Problems  7  to  18  evaluate  the  double  integrals  over  the  areas  described.  To  find  the 
limits,  sketch  the  area  and  compare  Figures  2.5  to  2.7. 

7.  ffA( 2x  —  3 y)  dxdy,  where  A  is  the  triangle  with  vertices  (0,  0),  (2, 1),  (2,  0). 

8.  ffA  6  y2  cos  x  dxdy,  where  A  is  the  area  inclosed  by  the  curves  y  =  sin  a;,  the  x  axis, 
and  the  line  x  =  tv /2. 


9.  ffA  sin  x  dx  dy  where  A  is  the  area  shown  in  Figure  2.8. 

10.  ffAydxdy  where  A  is  the  area  in  Figure  2.8. 

(tt,  4) 

0 

1  - 

Figure  2.8 


11-  SSa  xdxdy,  where  A  is  the  area  between  the  parabola  y  =  x2  and  the  straight  line 
2x  —  y  +  8  =  0. 

12-  ff  ydxdy  over  the  triangle  with  vertices  (—1,0),  (0,2),  and  (2,0). 

13.  ff  2 xydxdy  over  the  triangle  with  vertices  (0,0),  (2,1),  (3,0). 

14.  ff  x2ex2y  dxdy  over  the  area  bounded  by  y  =  a;-1,  y  =  x~2,  and  x  =  ln4. 

15-  ff  dx  dy  over  the  area  bounded  by  y  =  lna:,  y  =  e  +  1  —  x,  and  the  x  axis. 

16.  ff  ( 9  +  2j/2)-1  dxdy  over  the  quadrilateral  with  vertices  (1,3),  (3,3),  (2,6),  (6,6). 

17.  ff (x/y) dx  dy  over  the  triangle  with  vertices  (0,  0),  (1, 1),  (1,  2). 

18.  f f  y ~1,/2  dx  dy  over  the  area  bounded  by  y  =  x2,  x  +  y  =  2,  and  the  y  axis. 

In  Problems  19  to  24,  use  double  integrals  to  find  the  indicated  volumes. 

19.  Above  the  square  with  vertices  at  (0,  0),  (2,  0),  (0,  2),  and  (2,  2),  and  under  the  plane 
z  =  8  —  x  +  y. 

20.  Above  the  rectangle  with  vertices  (0,0),  (0,1),  (2,0),  and  (2,1),  and  below  the 
surface  z2  =  36a;2  (4  —  x2). 

21.  Above  the  triangle  with  vertices  (0,0),  (2,0),  and  (2, 1),  and  below  the  paraboloid 
z  =  24  —  x2  —  y2. 

22.  Above  the  triangle  with  vertices  (0,2),  (1,1),  and  (2,2),  and  under  the  surface 
z  =  xy. 

23.  Under  the  surface  z  =  y(x  +  2),  and  over  the  area  bounded  by  x  +  y  =  0,  y  =  1, 
V  =  \fx. 

24.  Under  the  surface  z  =  l/(y  +  2),  and  over  the  area  bounded  by  y  =  x  and  y2  +x  =  2. 
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In  Problems  25  to  28,  sketch  the  area  of  integration,  observe  that  it  is  like  the  areas 
in  Figure  2.7,  and  so  write  an  equivalent  integral  with  the  integration  in  the  opposite 
order.  Check  your  work  by  evaluating  the  double  integral  both  ways.  Also  check  that  your 
computer  gives  the  same  answer  for  both  orders  of  integration. 


r  1  r3—3x 

25.  /  /  dydx 

J  x=0  J  y= 0 

26. 

/  /  {x  +  y)dx  dy 

J  y= 0  J  x=y  /  2 

/»4  rVx 

27.  /  /  y\fxdydx 

J  x=0  J  y= 0 

28. 

nl  r\/l—y2 

/  /  ydxdy 

J y= 0 J X=0 

In  Problems  29  to  32,  observe  that  the  inside  integral  cannot  be  expressed  in  terms  of 
elementary  functions.  As  in  Problems  25  to  28,  change  the  order  of  integration  and  so 
evaluate  the  double  integral.  Also  try  using  your  computer  to  evaluate  these  for  both 
orders  of  integration. 

29.  f  [  SmX  dxdy 

J y= 0 J x=y  ^ 

30. 

f  f  e  v  !2  dy  dx 

J  x=0  J  y=x 

3i  finw  r  dydx 

Jx= 0  Jy=e*/2  lnf/ 

32. 

[  [  — =dxdy 

J y= 0  J X=y2  Vx 

33.  A  lamina  covering  the  quarter  disk  x2  +  y2  <  4,  x  >  0,  y  >  0,  has  (area)  density 
x  +  y.  Find  the  mass  of  the  lamina. 


34.  A  dielectric  lamina  with  charge  density  proportional  to  y  covers  the  area  between 
the  parabola  y  =  16  —  x2  and  the  x  axis.  Find  the  total  charge. 


35.  A  triangular  lamina  is  bounded  by  the  coordinate  axes  and  the  line  x  +  y  =  6.  Find 
its  mass  if  its  density  at  each  point  P  is  proportional  to  the  square  of  the  distance 
from  the  origin  to  P. 

36.  A  partially  silvered  mirror  covers  the  square  area  with  vertices  at  (±1,±1).  The 
fraction  of  incident  light  which  it  reflects  at  (x,y)  is  (x  —  y)2/ 4.  Assuming  a  uniform 
intensity  of  incident  light,  find  the  fraction  reflected. 


In  Problems  37  to  40,  evaluate  the  triple  integrals. 


r2  p2x  ny  —  x 

/  /  /  dzdydx 

J  x=l  J  y=x  J  Z—O 

38.  f  f  j  dydxdz 

J  2=0  J  x=z  J  y=8x 

p3  r2  r2y+z 

/  /  /  6y  dxdz  dy 

J  y=  —  2  J  z=l  J  x=y-\-z 

r2  r2x  r^-/z 

40.  /  /  /  zdydzdx 

J  x=\  J  z=x  J  y= 0 

41.  Find  the  volume  between  the  planes  z  =  2x  +  3y  +  6  and  z  =  2x  +  7y  +  8,  and  over 
the  triangle  with  vertices  (0,  0),  (3,  0),  and  (2, 1). 

42.  Find  the  volume  between  the  planes  2  =  2x  +  3y  +  6  and  z  =  2x  +  7y  +  8,  and  over 
the  square  in  the  ( x,y )  plane  with  vertices  (0,0),  (1,0),  (0, 1),  (1, 1). 

43.  Find  the  volume  between  the  surfaces  z  =  2x2  +  y2  +  12  and  z  =  x2  +  y2  +  8,  and 

over  the  triangle  with  vertices  (0,0),  (1,0),  and  (1,2). 

44.  Find  the  mass  of  the  solid  in  Problem  42  if  the  density  is  proportional  to  y. 

45.  Find  the  mass  of  the  solid  in  Problem  43  if  the  density  is  proportional  to  x. 

46.  Find  the  mass  of  a  cube  of  side  2  if  the  density  is  proportional  to  the  square  of  the 

distance  from  the  center  of  the  cube. 
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47.  Find  the  volume  in  the  first  octant  bounded  by  the  coordinate  planes  and  the  plane 
x  +  2y  +  z  =  4. 

48.  Find  the  volume  in  the  first  octant  bounded  by  the  cone  z2  =  x2  —  y 2  and  the  plane 
x  =  4. 

49.  Find  the  volume  in  the  first  octant  bounded  by  the  paraboloid  z  =  1  —  x2  —  y2,  the 
plane  x  +  y  =  1,  and  all  three  coordinate  planes. 

50.  Find  the  mass  of  the  solid  in  Problem  48  if  the  density  is  z. 

3.  APPLICATIONS  OF  INTEGRATION; 

SINGLE  AND  MULTIPLE  INTEGRALS 

Many  different  physical  quantities  are  given  by  integrals;  let  us  do  some  problems 
to  illustrate  setting  up  and  evaluating  these  integrals.  The  basic  idea  which  we  use 
in  setting  up  the  integrals  in  these  problems  is  that  an  integral  is  the  “limit  of  a 
sum.”  Thus  we  imagine  the  physical  object  (whose  volume,  moment  of  inertia,  etc., 
we  are  trying  to  find)  cut  into  a  large  number  of  small  pieces  called  elements.  We 
write  an  approximate  formula  for  the  volume,  moment  of  inertia,  etc.,  of  an  element 
and  then  sum  over  all  elements  of  the  object.  The  limit  of  this  sum  (as  the  number 
of  elements  tends  to  infinity  and  the  size  of  each  element  tends  to  zero)  is  what  we 
find  by  integration  and  is  what  we  want  in  the  physical  problem. 

Using  a  computer  to  evaluate  the  integrals  saves  time  and  we  will  concentrate 
mainly  on  setting  up  integrals.  However,  in  order  to  do  a  skillful  job  of  finding 
limits,  deciding  order  of  integration,  detecting  and  correcting  errors,  making  useful 
changes  of  variables,  and  understanding  the  meaning  of  the  symbols  used,  it  is 
important  to  learn  to  evaluate  multiple  integrals  by  hand.  So  a  good  study  method 
is  to  do  some  integrals  both  by  hand  and  by  computer.  A  computer  is  also  very 
useful  to  plot  graphs  of  curves  and  surfaces  to  help  you  find  the  limits  in  a  multiple 
integral. 

Example  1.  Given  the  curve  y  =  x2  from  x  =  0  to  x  =  1,  find 

(a)  the  area  under  the  curve  (that  is,  the  area  bounded  by  the  curve,  the  x  axis, 
and  the  line  x  =  1;  see  Figure  3.1); 

(b)  the  mass  of  a  plane  sheet  of  material  cut  in  the  shape  of  this  area  if  its  density 
(mass  per  unit  area)  is  xy\ 

(c)  the  arc  length  of  the  curve; 

(d)  the  centroid  of  the  area; 

(e)  the  centroid  of  the  arc; 

(f)  the  moments  of  inertia  about  the  x,  y,  and  z  axes  of  the  lamina  in  (b). 

(a)  The  area  is 


A  = 


i 

o 


1 

3' 
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We  could  also  find  the  area  as  a  double  integral  of  dA  =  dydx  (see  Figure  3.1). 
We  have  then 

ri 


A  = 


f  f 

J  x—0  J  v- 


dy  dx  =  /  x  dx 

/  x—0  J  y—0  j  0 


as  before.  Although  the  double  integral  is  entirely  unnecessary  in  finding  the  area 
in  this  problem,  we  shall  need  to  use  a  double  integral  to  find  the  mass  in  part  (b) . 


(b)  The  element  of  area,  as  in  the  double  integral  method  in  (a) ,  is  dA  =  dy  dx. 
Since  the  density  is  p  =  xy ,  the  element  of  mass  is  dM  =  xy  dy  dx ,  and  the  total 
mass  is 


,t  r2 

M=  /  xydydx 
J  x=0  J  y—0 


i 

12' 


Observe  that  we  could  not  do  this  problem  as  a  single  integral  because  the  density 
depends  on  both  x  and  y. 


(c)  The  element  of  arc  length  ds  is  defined  as  indicated  in  Figures  3.1  and  3.2. 
Thus  we  have 


ds 2  =  dx2  +  dy2, 

ds  =  dx2  +  dy2  =  y/l  +  ( dy/dx )2  dx  =  i/ ( dx/dy )2  +  1  dy. 


If  y  =  f(x)  has  a  continuous  first  derivative  dy/  dx  (except  possibly  at  a  finite  num¬ 
ber  of  points),  we  can  find  the  arc  length  of  the  curve  y  =  f(x)  between  a  and  b  by 
calculating  fa  ds.  For  our  example,  we  have 


(3.2) 


ds  =  s/l  +  Ax2  dx, 

2V5  +  ln(2  +  y/5) 
4 


(see  Problem  32). 
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(d)  Recall  from  elementary  physics  that: 


The  center  of  mass  of  a  body  has  coordinates  x,  y,  z  given  by  the  equations 


(3.3) 


x  dM  =  x  dM. 


ydM  =  /  ydM , 


zdM  =  /  zdM, 


where  dM  is  an  element  of  mass  and  the  integrals  are  over  the  whole  body. 


Although  we  have  written  single  integrals  in  (3.3),  they  may  be  single,  double,  or 
triple  integrals  depending  on  the  problem  and  the  method  of  evaluation.  Since  x, 
y,  and  z  are  constants,  we  can  take  them  outside  the  integrals  in  (3.3)  and  solve 
for  them.  However,  you  may  find  it  easier  to  remember  the  definitions  in  the  form 
(3.3).  For  the  example  we  are  doing,  z  =  0  since  the  body  is  a  sheet  of  material 
in  the  ( x,y )  plane.  The  element  of  mass  is  dM  =  pdA  =  pdxdy ,  where  p  is  the 
density  (mass  per  unit  area  in  this  problem).  For  a  variable  density  as  in  (b),  we 
would  substitute  the  value  of  p  into  (3.3)  and  integrate  both  sides  of  each  equation 
to  find  the  coordinates  of  the  center  of  mass.  However,  let  us  suppose  the  density 
is  a  constant.  Then  the  first  integral  in  (3.3)  is 


(3.4) 


J  xp  dA  or 


/ 


x  dA  = 


/ 


x  dA. 


Similarly,  a  constant  density  p  can  be  canceled  from  all  the  equations  in  (3.3).  The 
quantities  x ,  y ,  z,  are  then  called  the  coordinates  of  the  centroid  of  the  area  (or 
volume  or  arc). 


The  centroid  of  a  body  is  the  center  of  mass  when  we  assume  constant  density. 


In  our  example,  we  have 


(3.5) 


rl  rx2  nl  nx 

/  /  x  dy  dx  =  /  / 

J x—0 J v— 0  J x—0 J v- 


xdy  dx  =  I  I  xdy  dx 
/  x—0  ^  y—0  J  rc=0  J  y—0 

/  /  ydydx=  /  y  dy  dx 

/  x—0  J  y—0  J  x=0  J  y—0 


or  xA  =  — 
4 


or  yA  = 


o 

5  1 


10 


1 

4’ 

1 

To' 


(Double  integrals  are  not  really  necessary  for  any  of  these  but  the  last.)  Using  the 
value  of  A  from  part  (a),  we  find  x  =  |,  y  = 


(e)  The  center  of  mass  (x,y)  of  a  wire  bent  in  the  shape  of  the  curve  y  =  f(x) 
is  given  by 


(3.6) 


where  p  is  the  density  (mass  per  unit  length) ,  and  the  integrals  are  single  integrals 
with  ds  given  by  (3.1).  If  p  is  constant,  (3.6)  defines  the  coordinates  of  the  centroid. 
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In  our  example  we  have 


(3.7) 


/  x\J  1  +  4x2  dx  =  /  xy/ 1  +  4x2  dx, 

Jo  Jo 

/  yy/l  +  4x2  dx  =  /  y\/ 1  +  4x2  dx  =  /  a;2\/l  +  4a:2  dx. 
Jo  Jo  Jo 


Note  carefully  here  that  it  is  correct  to  put  y  =  x2  in  the  last  integral  of  (3.7),  but 
it  would  not  have  been  correct  to  do  this  in  the  last  integral  of  (3.5);  the  reason  is 
that  over  the  area ,  y  could  take  values  from  zero  to  x2 ,  but  on  the  arc,  y  takes  only 
the  value  x2.  By  calculating  the  integrals  in  (3.7)  we  can  find  x  and  y. 


(f)  We  need  the  following  definition: 


The  moment  of  inertia  I  of  a  point  mass  m  about  an  axis  is  by  definition  the 
product  ml 2  of  to  times  the  square  of  the  distance  l  from  to  to  the  axis.  For  an 
extended  object  we  must  integrate  PdM  over  the  whole  object,  where  l  is  the 
distance  from  dM  to  the  axis. 


In  our  example  with  variable  density  p  =  xy,  we  have 
dM  =  xy  dy  dx.  The  distance  from  dM  to  the  x  axis  is 
y  (Figure  3.3);  similarly,  the  distance  from  dM  to  the 
y  axis  is  x.  The  distance  from  dM  to  the  z  axis  (the 
z  axis  is  perpendicular  to  the  paper  in  Figure  3.3)  is 
\J x1  +  y2.  Then  the  three  moments  of  inertia  about 
the  three  coordinate  axes  are: 


4 


4 


y2xy  dy  dx 


4=o 


x2xy  dy  dx 


4=o 


1 

40’ 

1 

16’ 


4=o 


(x2  +  y2)xy  dy  dx  =  Ix  +  Iy  = 


7 

80' 


The  fact  that  Ix  +  Iy  =  Iz  for  a  plane  lamina  in  the  ( x ,  y)  plane  is  known  as  the 
perpendicular  axis  theorem. 


It  is  customary  to  write  moments  of  inertia  as  multiples  of  the  mass;  using  M  = 
from  (b),  we  write 


i  o  Q 

4  =  —  M  =  —M, 
40  10 


1  O  Q 

I  =  —M  =  -M, 
v  16  4  ’ 


7-12  21 

4  = - M  =  —M. 

80  20 


Example  2.  Rotate  the  area  of  Example  1  about  the  x  axis  to  form  a  volume  and  surface 
of  revolution,  and  find 

(a)  the  volume; 
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(b)  the  moment  of  inertia  about  the  x  axis  of  a  solid  of  constant  density  occupying 
the  given  volume; 

(c)  the  area  of  the  curved  surface; 

(d)  the  centroid  of  the  curved  surface. 

(a)  We  want  to  find  the  given  volume. 


The  easiest  way  to  find  a  volume  of  revolution  is  to  take  as  volume  element  a 
thin  slab  of  the  solid  as  shown  in  Figure  3.4.  The  slab  has  circular  cross  section 
of  radius  y  and  thickness  dx ;  thus  the  volume  element  is  iry2  dx. 


Then  the  volume  in  our  example  is 

(3.8)  V  =  f  ny2 dx  =  f  tt x4dx=^-. 

Jo  Jo  5 

We  have  really  avoided  part  of  the  integration  here  because  we  knew  the  formula  for 
the  area  of  a  circle.  In  finding  volumes  of  solids  which  are  not  solids  of  revolution, 
we  may  have  to  use  double  or  triple  integrals.  Even  for  a  solid  of  revolution  we 
might  need  multiple  integrals  to  find  the  mass  if  the  density  is  variable. 

To  illustrate  setting  up  such  integrals,  let  us  do  the  above  problem  using  triple 
integrals.  For  this  we  need  the  equation  of  the  surface  which  is  (see  Problem  16) 

(3.9)  V2  +  z2  =  a:4,  x  >  0. 


To  set  up  a  multiple  integral  for  the  volume  of  a  solid,  we  cut  the  solid  into  slabs 
as  in  Figure  3.4  (not  necessarily  circular  slabs,  although  they  are  in  our  example) 
and  then  as  in  Figure  3.5  we  cut  each  slab  into  strips  and  each  strip  into  tiny  boxes 
of  volume  dx  dy  dz.  The  volume  is 


V  = 


dx  dy  dz\ 


the  only  problem  is  to  find  the  limits!  To  do  this,  we  start  by  adding  up  tiny  boxes 
to  get  a  strip;  as  we  have  drawn  Figure  3.5,  this  means  to  integrate  with  respect  to 
y  from  one  side  of  the  circle  y2  +  z2  =  xA  to  the  other,  that  is,  from 


=  —\fxA  — 


to  y  =  +\/xA  —  z2. 
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Next  we  add  all  the  strips  in  a  slab.  This  means  that,  in  Figure  3.5,  we  integrate 
with  respect  to  z  from  the  bottom  to  the  top  of  the  circle  y2  +  z2  =  x4;  thus  the  2 
limits  are  z  =  ±  radius  of  circle  =  ±x2.  And  finally  we  add  all  the  slabs  to  obtain 
the  solid.  This  means  to  integrate  in  Figure  3.4  from  x  =  0  to  x  =  1;  this  is  just 
what  we  did  in  our  first  simple  method.  The  final  integral  is  then 

1  j>X 2  r-y/x4  —  Z2 

(3.10)  V  =  /  /  /  dydzdx. 

J x—0  J  z=—x2  J  y=—  \/x4  —  z2 

(See  Problem  33) . 

Although  the  triple  integral  is  an  unnecessarily  complicated  way  of  finding  a 
volume  of  revolution,  this  simple  problem  illustrates  the  general  method  of  setting 
up  an  integral  for  any  kind  of  volume.  Once  we  have  the  volume  as  a  triple  integral, 
it  is  easy  to  write  the  integrals  for  the  mass  with  a  given  variable  density,  for  the 
coordinates  of  the  centroid,  for  the  moments  of  inertia,  and  so  on.  The  limits  of 
integration  are  the  same  as  for  the  volume;  we  need  only  insert  the  proper  expres¬ 
sions  (density,  etc.)  in  the  integrand  to  get  the  mass,  centroid,  and  so  on. 


(b)  To  find  the  moment  of  inertia  of  the  solid  about  the  x  axis,  we  must  integrate 
the  quantity  l2dM ,  where  l  is  the  distance  from  dM  to  the  x  axis;  from  Figure  3.5, 
since  the  x  axis  is  perpendicular  to  the  paper,  l 2  =  y2  +  z2.  The  limits  on  the 
integrals  are  the  same  as  in  (3.10).  We  are  assuming  constant  density,  so  the  factor 
p  can  be  written  outside  the  integrals.  Then  we  have 

pi  nx2  p^/x4—z2 

Ix  =  P  /  /  _ (y2  +  z2)  dydzdx  =  —p. 

J x—0  J z—  —  x2  J y—  —  y/x4  —  z2 

Since  from  (3.8)  the  mass  of  the  solid  is 

M  =  pV  =  —p 
5 

we  can  write  Ix  (as  is  customary)  as  a  multiple  of  M: 


Ix  =  —  -M  =  —  M. 

x  18  7T  18 


(c)  We  find  the  area  of  the  surface  of  revolution  by  using  as  element  the  curved 
surface  of  a  thin  slab  as  in  Figure  3.6.  This  is  a  strip  of  circumference  2ny  and 
width  ds.  To  see  this  clearly  and  to  understand  why  we  use  ds  here  but  dx  in  the 
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volume  element  in  (3.8),  think  of  the  slab  as  a  thin  section  of  a  cone  (Figure  3.7) 
between  planes  perpendicular  to  the  axis  of  the  cone.  If  you  wanted  to  find  the 
total  volume  V  =  i nr2h  of  the  cone,  you  would  use  the  height  h  perpendicular  to 
the  base,  but  in  finding  the  total  curved  surface  area  S  =  ^  •  2nr  ■  s,  you  would  use 
the  slant  height  s.  The  same  ideas  hold  in  finding  the  volume  and  surface  elements. 
The  approximate  volume  of  the  thin  slab  is  the  area  of  a  face  of  the  slab  times  its 
thickness  ( dh  in  Figure  3.7,  dx  in  Figure  3.4).  But  if  you  think  of  a  narrow  strip 
of  paper  just  covering  the  curved  surface  of  the  thin  slab,  the  width  of  the  strip  of 
paper  is  ds7  and  its  length  is  the  circumference  of  the  thin  slab. 


The  element  of  surface  area  (in  Figure  3.6)  is  then 
(3.11)  dA  =  2i:yds. 


The  total  area  is  [using  ds  from  (3.2)] 

A=  f  2i ry  ds  =  f  2nx2  \J\  +  4x2  dx. 

Jx- 0  Jo 

(For  more  general  surfaces,  there  is  a  way  to  calculate  areas  by  double  integration; 
we  shall  take  this  up  in  Section  5.) 

(d)  The  y  and  z  coordinates  of  the  centroid  of  the  surface  area  are  zero  by 
symmetry.  For  the  x  coordinate,  we  have  by  (3.4) 

J  x  dA  =  j  x  dA, 

or,  using  dA  =  2i ry  ds  and  the  total  area  A  from  (c),  we  have 

xA  =  /  x-2nyds=  /  x  ■  2ttx2\/1  +  Ax2  dx. 

Jx= o  Jo 

PROBLEMS,  SECTION  3 

The  following  notation  is  used  in  the  problems: 

M  =  mass, 

x,  y,  z  =  coordinates  of  center  of  mass  (or  centroid  if  the  density  is  constant), 

I  =  moment  of  inertia  (about  axis  stated), 

Ix,  Iy,  Iz  =  moments  of  inertia  about  x,  y,  z  axes, 

=  moment  of  inertia  (about  axis  stated)  through  the  center  of  mass. 

Note:  It  is  customary  to  give  answers  for  /,  7m,  Ix,  etc.,  as  multiples  of  M  (for 
example,  I  =  | Ml 2). 

1.  Prove  the  “parallel  axis  theorem”:  The  moment  of  inertia  I  of  a  body  about  a  given 
axis  is  /  =  Im  +  Md2,  where  M  is  the  mass  of  the  body,  Im  is  the  moment  of  inertia 
of  the  body  about  an  axis  through  the  center  of  mass  and  parallel  to  the  given  axis, 
and  d  is  the  distance  between  the  two  axes. 
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2.  For  a  thin  rod  of  length  l  and  uniform  density  p  find 

(a)  M, 

(b)  Im  about  an  axis  perpendicular  to  the  rod, 

(c)  I  about  an  axis  perpendicular  to  the  rod  and  passing  through  one  end  (see 
Problem  1). 

3.  A  thin  rod  10  ft  long  has  a  density  which  varies  uniformly  from  4  to  24  lb/ft.  Find 

(a)  M, 

(b)  x, 

(c)  Im  about  an  axis  perpendicular  to  the  rod, 

(d)  I  about  an  axis  perpendicular  to  the  rod  passing  through  the  heavy  end. 

4.  Repeat  Problem  3  for  a  rod  of  length  l  with  density  varying  uniformly  from  2  to  1. 

5.  For  a  square  lamina  of  uniform  density,  find  I  about 

(a)  a  side, 

(b)  a  diagonal, 

(c)  an  axis  through  a  corner  and  perpendicular  to  the  plane  of  the  lamina.  Hint: 
See  the  perpendicular  axis  theorem,  Example  If. 

6.  A  triangular  lamina  has  vertices  (0,0),  (0,6)  and  (6,0),  and  uniform  density.  Find: 

(a)  x,  y, 

(b)  4, 

(c)  Im  about  an  axis  parallel  to  the  x  axis.  Hint:  Use  Problem  1  carefully. 

7.  A  rectangular  lamina  has  vertices  (0,0),  (0,2),  (3,0),  (3,2)  and  density  xy.  Find 

(a)  M, 

0)  X,  y , 

(c)  Ix ,  Iy , 

(d)  Im  about  an  axis  parallel  to  the  z  axis.  Hint:  Use  the  parallel  axis  theorem 
and  the  perpendicular  axis  theorem. 

8.  For  a  uniform  cube,  find  I  about  one  edge. 

9.  For  the  pyramid  inclosed  by  the  coordinate  planes  and  the  plane  x  +  y  +  z  —  1: 

(a)  Find  its  volume. 

(b)  Find  the  coordinates  of  its  centroid. 

(c)  If  the  density  is  2,  find  M  and  z. 

10.  A  uniform  chain  hangs  in  the  shape  of  the  catenary  y  =  cosh  a;  between  x  =  —  1  and 
x  =  1.  Find  (a)  its  length,  (b)  y. 

11.  A  chain  in  the  shape  y  =  x2  between  x  =  —1  and  x  =  1  has  density  |a;|.  Find 
(a)  M,  (b)  x,  y. 

Prove  the  following  two  theorems  of  Pappus: 

12.  The  area  A  inside  a  closed  curve  in  the  (x,y)  plane,  y  >  0,  is  revolved  about  the  x 
axis.  The  volume  of  the  solid  generated  is  equal  to  A  times  the  circumference  of  the 
circle  traced  by  the  centroid  of  A.  Hint:  Write  the  integrals  for  the  volume  and  for 
the  centroid. 
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13.  An  arc  in  the  ( x,y )  plane,  y  >  0,  is  revolved  about  the  x  axis.  The  surface  area 
generated  is  equal  to  the  length  of  the  arc  times  the  circumference  of  the  circle 
traced  by  the  centroid  of  the  arc. 

14.  Use  Problems  12  and  13  to  find  the  volume  and  surface  area  of  a  torus  (doughnut). 

15.  Use  Problems  12  and  13  to  find  the  centroids  of  a  semicircular  area  and  of  a  semi¬ 

circular  arc.  Hint:  Assume  the  formulas  A  =  4nr2,  V  =  §7rr3  for  a  sphere. 

16.  Let  a  curve  y  =  f(x )  be  revolved  about  the  x  axis,  thus  forming  a  surface  of 

revolution.  Show  that  the  cross  sections  of  this  surface  in  any  plane  x  =  const, 
[that  is,  parallel  to  the  ( y,z )  plane]  are  circles  of  radius  f(x).  Thus  write  the 
general  equation  of  a  surface  of  revolution  and  verify  the  special  case  f(x)  =  x2  in 
(3.9). 

In  Problems  17  to  30,  for  the  curve  y  =  y/x,  between  x  =  0  and  x  =  2,  find: 

17.  The  area  under  the  curve. 

18.  The  arc  length. 

19.  The  volume  of  the  solid  generated  when  the  area  is  revolved  about  the  x  axis. 

20.  The  curved  area  of  this  solid. 

21.  22,  23.  The  centroids  of  the  arc,  the  volume,  and  the  surface  area. 

24,  25,  26,  27.  The  moments  of  inertia  about  the  x  axis  of  a  lamina  in  the  shape  of 

the  plane  area  under  the  curve;  of  a  wire  bent  along  the  arc  of  the  curve;  of  the  solid  of 

revolution;  and  of  a  thin  shell  whose  shape  is  the  curved  surface  of  the  solid  (assuming 

constant  density  for  all  these  problems). 

28.  The  mass  of  a  wire  bent  in  the  shape  of  the  arc  if  its  density  (mass  per  unit  length) 
is  y/x. 

29.  The  mass  of  the  solid  of  revolution  if  the  density  (mass  per  unit  volume)  is  \xyz\. 

30.  The  moment  of  inertia  about  the  y  axis  of  the  solid  of  revolution  if  the  density  is 

\xyz\. 

31.  (a)  Revolve  the  curve  y  =  x  1 .  from  x  =  1  to  x  =  oo,  about  the  x  axis  to  create 

a  surface  and  a  volume.  Write  integrals  for  the  surface  area  and  the  volume. 
Find  the  volume,  and  show  that  the  surface  area  is  infinite.  Hint:  The  surface 
area  integral  is  not  easy  to  evaluate,  but  you  can  easily  show  that  it  is  greater 
than  Jj00  aU1  dx  which  you  can  evaluate. 

(b)  The  following  question  is  a  challenge  to  your  ability  to  fit  together  your  math¬ 
ematical  calculations  and  physical  facts:  In  (a)  you  found  a  finite  volume  and 
an  infinite  area.  Suppose  you  fill  the  finite  volume  with  a  finite  amount  of  paint 
and  then  pour  off  the  excess  leaving  what  sticks  to  the  surface.  Apparently  you 
have  painted  an  infinite  area  with  a  finite  amount  of  paint!  What  is  wrong? 
(Compare  Problem  15.31c  of  Chapter  1.) 

32.  Use  a  computer  or  tables  to  evaluate  the  integral  in  (3.2)  and  verify  that  the  answer 
is  equivalent  to  the  text  answer.  Hint:  See  Problem  1.4  and  also  Chapter  2,  Sections 
15  and  17. 

33.  Verify  that  (3.10)  gives  the  same  result  as  (3.8). 
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4.  CHANGE  OF  VARIABLES  IN  INTEGRALS;  JACOBIANS 

In  many  applied  problems,  it  is  more  convenient  to  use  other  coordinate  systems 
instead  of  the  rectangular  coordinates  we  have  been  using.  For  example,  in  the  plane 
we  often  use  polar  coordinates,  and  in  three  dimensions  we  often  use  cylindrical 
coordinates  or  spherical  coordinates.  It  is  important  to  know  how  to  set  up  multiple 
integrals  directly  in  these  coordinate  systems  which  occur  so  frequently  in  practice. 
That  is,  we  need  to  know  what  the  area,  volume,  and  arc  length  elements  are, 
what  the  variables  r,  0 ,  etc.,  mean  geometrically,  and  how  they  are  related  to  the 
rectangular  coordinates.  We  are  going  to  discuss  finding  elements  of  area,  etc., 
geometrically  for  several  important  coordinate  systems.  However,  if  we  are  given 
equations  like  x  =  r  cos  0,  y  =  r  sin  0,  relating  new  variables  to  the  rectangular  ones, 
it  is  useful  to  know  how  to  find  the  elements  of  area,  etc.,  algebraically ,  without 
having  to  rely  on  the  geometry.  We  are  going  to  discuss  this  and  illustrate  it 
by  verifying  the  results  which  we  can  get  geometrically  for  several  of  the  familiar 
coordinate  systems. 

In  the  plane,  the  polar  coordinates  r,  0  are  related  to  the  rectangular  coordinates 
x,  y  by  the  equations 


(4.1) 


x  =  rcos0, 
y  =  rsin0. 


Recall  that  we  found  the  area  element  dydx  by  drawing  a  grid  of  lines  x  =  const., 
y  =  const.,  which  cut  the  plane  into  little  rectangles  dx  by  dy\  the  area  of  one 
rectangle  was  then  dy  dx.  We  can  make  a  similar  construction  for  polar  coordinates 
by  drawing  lines  0  =  const,  and  circles  r  =  const.;  we  then  obtain  the  grid  shown 
in  Figure  4.1.  Observe  that  the  sides  of  the  area  element  are  not  dr  and  d0,  but  dr 
and  r  d0,  and  its  area  is  then 


(4.2) 


cL4  =  dr  •  r  dO  =  r  dr  d8. 
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Similarly,  we  can  see  from  Figure  4.2  that  the  arc  length  element  ds  is  given  by 


(4.3) 


ds2  =  dr2  +  r 2  dd2, 

*=V',(s)2+r2<ie=^1+’'2(S)  *■ 


Example  1.  Given  a  semicircular  sheet  of  material  of  radius  a  and  constant  density  p, 
find 


(a)  the  centroid  of  the  semicircular  area; 

(b)  the  moment  of  inertia  of  the  sheet  of  material  about  the  diameter  forming  the 
straight  side  of  the  semicircle. 

y 

(a)  In  Figure  4.3,  we  see  by  symmetry  that  y  =  0.  We  want 
to  find  x.  By  (3.4),  we  have 


xr  dr  dd. 

Changing  the  x  to  polar  coordinates  and  putting  in  the  limits, 
we  get 

ra  r- tt/2  r-a  /.tt/2  Figure  4.3 

x  j  /  r  drdd  =  /  /  r  cos  6  r  dr  dd. 

Jr=0  J0--TV/2  Jr= 0  J6=-n/2 

We  calculate  the  integrals  and  find  x: 


xr  dr  dd  = 


_  a2  a3  .  n 

x  —  7r  —  —  sin  d 
2  3 


tt/2 

-7T/2 


X  = 


4a 

37T 


(b)  We  want  the  moment  of  inertia  about  the  y  axis  in  Figure  4.3;  by  definition 
this  is  /  x2dM.  In  polar  coordinates,  dM  =  pdA  =  prdrdd.  We  are  given  that 
the  density  p  is  constant.  Then  we  have 


tt/2 


r2  cos2  dr  dr  dd  =  p - . 


Iy=p  x2r  dr  dd  =  p 


r— 0 


0  —  —  7T  /  2 
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The  mass  of  the  semicircular  object  is 


M  =  p 


r  dr  dd 


p  f  (  1  rdrd9  =  p—. 
Jr= 0  J9=-n/ 2  2 


We  write  Iy  in  terms  of  M  to  get 

2 M  7T a4  Ma2 

y  =  =  I-' 

Spherical  and  Cylindrical  Coordinates  The  two  most  important  coordinate 
systems  (besides  rectangular)  in  three  dimensions  are  the  spherical  and  the  cylin¬ 
drical  coordinate  systems.  Figures  4.4  and  4.5  and  equations  (4.4)  and  (4.5)  show 
the  geometrical  meaning  of  the  variables,  their  algebraic  relation  to  x,  y ,  z,  the 
appearance  of  the  volume  elements,  and  the  formulas  for  the  volume,  arc  length, 
and  surface  area  elements. 

Cylindrical  coordinates  are  just  polar  coordinates  in  the  (x,  y)  plane  with  2  for 
the  third  variable.  Note  that  the  spherical  coordinates  r  and  9  in  Figure  4.5  are 
different  from  the  cylindrical  or  polar  coordinates  r  and  9  in  Figures  4.4  and  4.1. 
Since  we  seldom  use  both  systems  in  the  same  problem,  this  should  cause  no  confu¬ 
sion.  (If  necessary,  use  p  or  R  for  one  of  the  r’s  and  use  (f>  instead  of  9  in  cylindrical 
coordinates.)  Watch  out,  however,  for  the  discrepancy  in  notation  for  spherical 
coordinates  in  various  texts.  Most  calculus  books  interchange  9  and  <f>.  This  can  be 
confusing  later  since  the  notation  of  Figure  4.5  is  almost  universal  in  applications 
to  the  physical  sciences,  and  is  often  used  in  advanced  mathematics  (partial  differ¬ 
ential  equations,  special  functions),  in  computer  programs,  and  in  reference  books 
of  formulas  and  tables.  You  will  need  to  learn  a  number  of  useful  formulas  involving 
spherical  coordinates  [for  example,  (4.7),  (4.19),  and  (4.20)  below;  also  see  Chapter 
10,  Section  9  and  Chapter  13,  Section  7].  It  is  best  to  learn  these  formulas  in  the 
notation  that  you  will  use  in  applications. 


(4.4)  Cylindrical  coordinates: 
x  =  r  cos  9 
y  =  r  sin  9 
z  =  z 

dV  =  r  dr  d9  dz 

ds 2  =  dr2  +  r2  d92  +  dz 2 

dA  =  ad9  dz 


X 


Figure  4.4 
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We  will  need  the  volume  and  surface  area  elements  in  these  two  systems  [and 
also  the  arc  length  elements — see  equations  (4.18)  and  (4.19)].  To  find  the  polar  co¬ 
ordinate  area  element  in  Figure  4.1,  we  drew  a  grid  of  curves  r  =  const.,  9  =  const. 
In  three  dimensions  we  want  to  draw  a  grid  of  surfaces.  In  cylindrical  coordinates 
these  surfaces  are  the  cylinders  r  =  const.,  the  half-planes  9  =  const,  (through  the 
z  axis),  and  the  planes  z  =  const,  [parallel  to  the  (a;,  y )  plane].  One  of  the  elements 
formed  by  this  grid  of  surfaces  is  sketched  in  Figure  4.4.  From  the  geometry  we  see 
that  three  edges  of  the  element  are  dr,  r  dd,  and  dz,  giving  the  volume  element 


(4.6)  dV  =  rdrdOdz  (cylindrical  coordinates) . 


If  r  is  constant,  then  the  surface  area  element  on  the  cylinder  r  =  a  has  edges  add 
and  dz,  so  dA  =  adddz.  Similarly  for  the  spherical  coordinate  case,  we  draw  the 
spheres  r  =  const.,  the  cones  9  =  const.,  and  the  half-planes  <j>  =  const.  The  volume 
elements  formed  by  this  grid  (Figure  4.5)  have  edges  dr,  rdd,  and  r  sin  dO  dtp;  thus 
we  get 


(4.7)  dV  =  r2  sin  9  dr  dO  d<f>  (spherical  coordinates) . 


If  r  is  constant,  then  the  surface  area  element  on  the  sphere  r  =  a  has  edges  add 
and  a  sin  9  d(f>,  so  dA  =  a2  sin  9  d9  d<f>. 

Jacobians.  For  polar,  cylindrical,  and  spherical  coordinates,  we  have  seen  how  to 
find  area  and  volume  elements  from  the  geometry.  However,  it  is  convenient  to  know 
an  algebraic  way  of  finding  them  which  we  can  use  for  unfamiliar  coordinate  systems 
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(Problems  16  and  17)  or  for  any  change  of  variables  in  a  multiple  integral  (Problems 
19  and  20).  Here  we  state  without  proof  (see  Chapter  6,  Section  3,  Example  2)  some 
theorems  which  tell  us  how  to  do  this.  First,  in  two  dimensions,  suppose  x  and  y 
are  given  as  functions  of  two  new  variables  s  and  t.  The  Jacobian  of  x,  y ,  with 
respect  to  s,  t,  is  the  determinant  in  (4.8)  below;  we  also  show  abbreviations  used 
for  it. 


(4.9)  dA=\J\dsdt 


where  |  J|  is  the  absolute  value  of  the  Jacobian  in  (4.8). 

Let  us  find  the  Jacobian  of  x,  y  with  respect  to  the  polar  coordinates  r,  9,  and 
thus  verify  that  (4.8)  and  our  geometric  method  give  the  same  result  (4.2)  for  the 
polar  coordinate  area  element.  We  have 


dx  dx 

cos  9  —  rsinf? 

d(x7  y) 

dr 

d9 

9(r,  9) 

dy 

dr 

dy 

d9 

sin  9  r  cos  9 

Thus  by  (4.9)  the  area  element  is  rdrdd  as  in  (4.2). 

The  use  of  Jacobians  extends  to  more  variables.  Also,  it  is  not  necessary  to  start 
with  rectangular  coordinates;  let  us  state  a  more  general  theorem. 


Suppose  we  have  a  triple  integral 


(4.11) 


/(u,  v,  w)  du  dv  dw 


in  some  set  of  variables  u7  v,  w.  Let  r,  s,  t  be  another  set  of  variables,  related  to 
it,  v,  w  by  given  equations 

u=u(r,s,t)7  v  —  v(r,s,t),  w  =  w(r,s,t). 
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Then  if  the  determinant 

du 

du 

du 

dr 

ds 

~dt 

(4.i2)  = 

d(r,s,  t) 

dv 

dr 

dv 

ds 

dv 

~dt 

dw 

dw 

dw 

dr 

ds 

~dt 

is  the  Jacobian  of  u,  v,  w  with  respect  to  r, 

s,  t,  then  the  triple  integral  in  the 

new  variables  is 

(413) 

■  dr  ds  dt, 

where,  of  course,  /  and  J  must  both  be  expressed 

in  terms  of  r,  s,  t,  and  the 

limits  must  be  properly  adjusted  to  correspond  to  the  new  variables. 

We  can  use  (4.12)  to  verify  the  volume  element  (4.6)  for  cylindrical  coordinates 
(Problem  15)  and  the  volume  element  (4.7)  for  spherical  coordinates.  Let  us  do  the 
calculation  for  spherical  coordinates.  From  (4.5),  we  have 


(4.14) 


d{x,y,z) 
d(r,  9, 4>) 


dx 

dx 

dx 

dr 

~d9 

d(j) 

sin  9  cos 

4> 

r  cos  9  cos  <fi 

—r  sin  9  sin  (f) 

dy 

dr 

dy 

d9 

dy 

dcj) 

= 

sin  (9  sin 

<S> 

r  cos  9  sin  (f> 

r  sin  9  cos 

dz 

dz 

dz 

COS0 

— r  sin  9 

0 

dr 

d9 

d(j> 

r2  sin  6\—  sin2  </>(—  sin2  9  —  cos2  9)  —  cos2  </>(—  sin2  9  —  cos2  9)) 
r2  sin  9. 


Thus  the  spherical  coordinate  volume  element  is  dV  =  r 2  sin  9  dr  d9  dcj)  as  in  (4.7). 


Example  2.  Find  the  z  coordinate  of  the  centroid  of  a  uniform  solid  cone  (part  of  one 
nappe)  of  height  h  equal  to  the  radius  of  the  base  r.  Also  find  the  moment  of  inertia 
of  the  solid  about  its  axis. 
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If  we  take  the  cone  as  shown  in  Figure  4.6,  its 
equation  in  cylindrical  coordinates  is  r  =  z,  since 
at  any  height  z,  the  cross  section  is  a  circle  of 
radius  equal  to  the  height.  To  find  the  mass  we 
must  integrate  dM  =  prdrdddz,  where  p  is  the 
constant  density.  The  limits  of  integration  are 


6  :  0  to  27t, 


Then  we  have 


r  :  0  to  z, 


0  to  h. 


m=  f pdv = p  fh  r  j 

J  J  z— 0  J  r=0  J  0 


z=0  Jr= 0  J  6=0 


:dV  = 


zdV  = 

ph 


zr  dr  dd  dz 


(4.15) 


=  27 r 


/  o 


Jz= 0  Jr= 0  J0= 0 

1  ,  7r/l4 

z  ■  —z  dz  —  — — , 

2  4  ’ 


7 rh3 


irhA 


z=  jh. 

4 

For  the  moment  of  inertia  about  the  z  axis  we  have 

rh  rz  r2TT  rh  Trh^ 

I  =  p  /  /  /  r2r  dr  dd  dz  —  p  ■  2n  /  —  dz  =  p—ppp- 

I  z= 0  J  r— 0  J  0= 


9=0 


/0 


10 


Using  the  value  of  M  from  (4.15),  we  write  I  in  the  usual  form  as  a  multiple  of  M : 

T  3Mnh5  3 

/  =  — r- - =  — Mh. 

7 rh3  10  10 

In  the  following  examples  and  problems,  note  that  we  use  sphere  (r  =  a)  to 
mean  surface  area,  and  ball  (r  <  a)  to  mean  volume  (just  as  we  use  circle  to  mean 
circumference  and  disk  to  mean  area). 


Example  3.  Find  the  moment  of  inertia  of  a  solid  ball  of  radius  a  about  a  diameter.  In 
spherical  coordinates,  the  equation  of  the  ball  is  r  <  a.  Then  the  mass  is 

//•  27T  /'•7T  fid 

dV  =  P  /  /  f2  sin  d  dr  dd  d(f> 

Jc/>=o  Je= 0  7r= o 

a3  4  3 

=  P y  '  2  •  2tt  =  -7TO  p. 

(to  no  one’s  surprise!).  The  moment  of  inertia  about  the  z  axis  is 

p2tt  p  7T 


n  (‘2'k  /*7T  na 

1=  (x2  +  y2)dM  =  p  /  / 


(r2  sin2  d)  r2  sin  d  dr  dd  d4> 
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or,  using  the  value  of  M,  we  get 


(4.17) 


Example  4.  Find  the  moment  of  inertia  about  the  z  axis  of  the  solid  ellipsoid  inside 


2  2  2 

£  r  £  =  1 

a2  b2  c2 

We  want  to  evaluate 

M  =  p  JJJ  dx  dy  dz  and  I  =  p  JJJ ( x 2  +  y 2)  dx  dy  dz 

where  the  triple  integrals  are  over  the  volume  of  the  ellipsoid.  Make  the  change 
of  variables  x  =  ax',  y  =  by' ,  z  =  cz'\  then  x'2  +  y'2  +  z'2  =  1,  so  in  the  primed 
variables  we  integrate  over  the  volume  of  a  ball  of  radius  1.  Then 

M  =  p  abc  JJJ  dx'  dy'  dz'  =  p  abc  ■  volume  of  ball  of  radius  1. 

Using  (4.16),  we  have 

4,4 

M  =  p  abc  ■  — 7T  •  1 J  =  —irp  abc. 

3  3 


I  =  pabc  III  (a2x'2  +  b2y'2)  dV' 


Similarly,  we  find 


where  the  triple  integral  is  over  the  volume  of  a  ball  of  radius  1.  Now,  by  symmetry, 

1 


x  dV  =  y'  dV’  =  z'  dV'  = 


r" dV' 


where  r'2  =  x'2  +  y'2  +  z'2 ,  and  we  are  integrating  over  the  volume  inside  the  sphere 
r'  =  1.  Let  us  use  spherical  coordinates  in  the  primed  system.  Then 

/27T  /*7T  pi 

/  /  r,2(r'2  sin  9'  dr1  dd'  dtp1) 

,=o  J  e=o  Jr= o 


,  f  /4  ,  /  4-7T 

=  47 t  r  dr  =  — . 

,/n  5 


Thus, 


/  =  pabc 


x'  dV'  +  b2 


y'  dV' 


=  p abc{a2  +  62)i  • 

3  5 


I  =  M(a2  +  b2). 
5 


or,  in  terms  of  M 
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In  order  to  find  arc  lengths  using  spherical  or  cylindrical  coordinates,  we  need 
the  arc  length  element  ds.  Recall  that  we  found  the  polar  coordinate  arc  length 
element  ds  (Figure  4.2)  as  the  hypotenuse  of  the  right  triangle  with  sides  dr  and 
rdd.  From  Figure  4.1  you  can  see  that  ds  can  also  be  thought  of  as  a  diagonal  of 
the  area  element.  Similarly,  in  cylindrical  and  spherical  coordinates  (see  Figures 
4.4  and  4.5),  the  arc  length  element  ds  is  a  space  diagonal  of  the  volume  element. 
In  cylindrical  coordinates  (4.4),  the  sides  of  the  volume  element  are  dr,  rdd,  dz ,  so 
the  arc  length  element  is  given  by 


(4.18)  ds2  =  dr2  +  r2  dd2  +  dz2  (cylindrical  coordinates) . 


In  spherical  coordinates  (4.5),  the  sides  of  the  volume  element  are  dr,  r  dd,  r  sin  d  d(f>, 
so  the  arc  length  element  is  given  by 


(4.19)  ds2  =  dr2  +  r2  dd2  +  r2  sin2  d  dcj>2  (spherical  coordinates). 


It  is  also  convenient  to  be  able  to  find  arc  lengths  algebraically.  Let  us  do  this 
for  polar  coordinates;  the  same  method  can  be  used  in  three  dimensions.  From  (4.1) 
we  have 


dx  =  cos  d  dr  —  r  sin  6  dd, 
dy  =  sin  d  dr  +  r  cos  d  dd. 

Squaring  and  adding  these  two  equations,  we  get 

ds2  =  dx2  +  dy 2 

=  (cos2  d  +  sin2  d)  dr2  +  0  ■  dr  dd  +  r2  (sin2  d  +  cos2  d)  dd2 
=  dr2  +  r2  dd2 

as  in  (4.3).  Using  the  same  method  for  cylindrical  and  spherical  coordinates  (Prob¬ 
lem  21)  you  can  verify  equations  (4.18)  and  (4.19). 

Example  5.  Express  the  velocity  of  a  moving  particle  in  spherical  coordinates. 

If  s  represents  the  distance  the  particle  has  moved  along  some  path,  then  ds/dt 
is  the  velocity  of  the  particle.  Dividing  (4.19)  by  dt2,  we  find  for  the  square  of  the 
velocity 


(4.20) 

1,2  =(f)  =  (f)  +,'2(f)  +’'2sin2"(^)  (“P*1™1  ®ordi„ates). 


We  have  just  seen  how  to  find  the  arc  length  element  ds  in  polar  coordinates 
(or  other  systems)  by  calculating  yj dx2  +  dy2.  You  might  be  tempted  to  try  to 


Section  4 


Change  of  Variables  in  Integrals;  Jacobians  267 


find  the  area  element  by  computing  dx  dy ,  but  you  would  discover  that  this  does 
not  work  —we  must  use  the  Jacobian  [or  geometry  as  in  (4.2)]  to  get  volume  or 
area  elements.  You  can  see  why  by  looking  at  Figure  4.1.  The  element  of  area 
r  dr  dQ  at  the  point  ( x ,  y )  is  not  the  same  as  the  element  of  area  dx  dy  at  that  point. 
Then  consider  Figure  4.2;  the  element  of  arc  ds  is  the  hypotenuse  of  the  triangle 
with  legs  dr  and  rdd ,  and  it  is  also  the  hypotenuse  of  the  triangle  with  legs  dx  and 
dy.  Thus  ds  is  the  same  element  for  both  it,  y  and  r,  9  and  this  is  why  we  can 
compute  ds  in  polar  coordinates  by  calculating  \J dx2  +  dy2 .  These  comments  hold 
for  other  coordinate  systems,  too.  We  can  always  find  ds  by  computing  \J dx2  +  dy2 
or  \J dx 2  +  dy2  +  dz2,  but  we  cannot  compute  area  or  volume  elements  directly  from 
the  rectangular  ones — we  must  use  the  Jacobian  or  else  geometrical  methods. 

PROBLEMS,  SECTION  4 

As  needed,  use  a  computer  to  plot  graphs  of  figures  and  to  check  values  of  integrals. 

1.  For  the  disk  r  <  a,  find  by  integration  using  polar  coordinates: 

(a)  the  area  of  the  disk; 

(b)  the  centroid  of  one  quadrant  of  the  disk; 

(c)  the  moment  of  inertia  of  the  disk  about  a  diameter; 

(d)  the  circumference  of  the  circle  r  =  a; 

(e)  the  centroid  of  a  quarter  circle  arc. 

2.  Using  polar  coordinates: 

(a)  Show  that  the  equation  of  the  circle  sketched  is 
r  =  2 a  cos  9.  Hint:  Use  the  right  triangle  OPQ. 

(b)  By  integration,  find  the  area  of  the  disk 
r  <2  a  cos  9. 

(c)  Find  the  centroid  of  the  area  of  the  first  quadrant 
half  disk. 

(d)  Find  the  moments  of  inertia  of  the  disk  about  each  of  the  three  coordinate 
axes,  assuming  constant  area  density. 

(e)  Find  the  length  and  the  centroid  of  the  semicircular  arc  in  the  first  quadrant. 

(f)  Find  the  center  of  mass  and  the  moments  of  inertia  of  the  disk  if  the  density 
is  r. 

(g)  Find  the  area  common  to  the  disk  sketched  and  the  disk  r  <  a. 

3.  (a)  Find  the  moment  of  inertia  of  a  circular  disk  (uniform  density)  about  an  axis 

through  its  center  and  perpendicular  to  the  plane  of  the  disk. 

(b)  Find  the  moment  of  inertia  of  a  solid  right  circular  cylinder  (uniform  density) 
about  its  axis. 

(c)  Do  (a)  using  Problem  lc  and  the  perpendicular  axis  theorem  (Section  3,  Ex¬ 
ample  If). 

For  the  sphere  r  =  a,  find  by  integration: 

(a)  its  surface  area; 


4. 
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5. 


6. 

7. 


8. 

9. 

10. 


11. 


12. 


13. 


14. 


15. 


(b)  the  centroid  of  the  curved  surface  area  of  a  hemisphere; 

(c)  the  moment  of  inertia  of  the  whole  spherical  shell  (that  is,  surface  area)  about 
a  diameter  (assuming  constant  area  density); 

(d)  the  volume  of  the  ball  r  <  a; 

(e)  the  centroid  of  a  solid  half  ball. 

(a)  Write  a  triple  integral  in  spherical  coordinates  for  the  volume  inside  the  cone 
z2  =  x2  +  y 2  and  between  the  planes  z  =  1  and  z  =  2.  Evaluate  the  integral. 

(b)  Do  (a)  in  cylindrical  coordinates. 

Find  the  mass  of  the  solid  in  Problem  5  if  the  density  is  ( x 2  +  y2  +  z2)-1 .  Check 

your  work  by  doing  the  problem  in  both  spherical  and  cylindrical  coordinates. 

(a)  Using  spherical  coordinates,  find  the  volume  cut  from  the  ball  r  <  a  by  the 
cone  8  =  a  <  7r/2. 

(b)  Show  that  the  z  coordinate  of  the  centroid  of  the  volume  in  (a)  is  given  by  the 
formula  z  =  3a(l  +  cos  a)/8. 

For  the  solid  in  Problem  7,  find  Iz/M  if  a  =  7t/3  and  the  density  is  constant. 

Let  the  solid  in  Problem  7  have  density  =  cos  8.  Show  that  then  Iz  =  j^Ma2  sin2  a. 

(a)  Find  the  volume  inside  the  cone  3z2  =  x2  +  y2,  above  the  plane  z  =  2  and 
inside  the  sphere  x2  +  y2  +  z2  =  36.  Hint:  Use  spherical  coordinates. 

(b)  Find  the  centroid  of  the  volume  in  (a). 

Write  a  triple  integral  in  cylindrical  coordinates  for  the  volume  inside  the  cylinder 

x2  +  y2  =  4  and  between  z  =  2x2  +  y2  and  the  ( x ,  y)  plane.  Evaluate  the  integral. 

(a)  Write  a  triple  integral  in  cylindrical  coordinates  for  the  volume  of  the  solid 
cut  from  a  ball  of  radius  2  by  a  cylinder  of  radius  1,  one  of  whose  rulings  is  a 
diameter  of  the  ball.  Hint:  Take  the  axis  of  the  cylinder  parallel  to  the  z  axis; 
a  cross  section  of  the  cylinder  then  looks  like  the  figure  in  Problem  2. 

(b)  Write  a  triple  integral  for  the  moment  of  inertia  about  the  z  axis  of  a  uniform 
solid  occupying  this  volume. 

(c)  Evaluate  the  integrals  in  (a)  and  (b),  and  find  7  as  a  multiple  of  the  mass. 

(a)  Write  a  triple  integral  in  cylindrical  coordinates  for  the  volume  of  the  part  of 
a  ball  between  two  parallel  planes  which  intersect  the  ball. 

(b)  Evaluate  the  integral  in  (a).  Warning  hint:  Do  the  r  and  8  integrals  first. 

(c)  Find  the  centroid  of  this  volume. 

Express  the  integral 


as  an  integral  in  polar  coordinates  (r,  8)  and  so  evaluate  it. 
Find  the  cylindrical  coordinate  volume  element  by  Jacobians. 


Find  the  Jacobians  d(x,  y)/d(u,  v)  of  the  given  transformations  from  variables  x,  y  to 
variables  u,  v. 

!6.  x  =  i(u2  -v2), 
y  =  uv, 


(■ u  and  v  are  called  parabolic  cylinder  coordinates). 
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17.  x  =  a  cosh  u  cos  v, 

y  =  asinh-it  sinv,  (u  and  v  are  called  elliptic  cylinder  coordinates). 

18.  Prove  the  following  theorems  about  Jacobians. 

d(u,v)  d(x,y)  = 
d{x,y)d(u,v) 
d(x,y)  d(u,v)  _  d(x,y) 
d(u,  v)  d(s,  t)  d(s,  t)  ' 


Hint:  Multiply  the  determinants  (as  you  would  matrices)  and  show  that  each  ele¬ 
ment  in  the  product  determinant  can  be  written  as  a  single  partial  derivative.  Also 
see  Chapter  4,  Section  7. 


19.  In  the  integral 

r  oo  pc 

I  = 

Jo  Jo 

make  the  change  of  variables 


2  i  2 

x  +y 


1  +  (x2  —  y2)2 


e  2xy  dx  dy 


2  2 
u  =  x  —  y 

v  =  2  xy 


and  evaluate  I.  Hint:  Use  (4.8)  and  the  accompanying  discussion. 

20.  In  the  integral 


make  the  change  of  variables 

*=  |(r-s), 

1  . 

y=  2  (r  +  s)i 

and  evaluate  I.  Hints:  See  Problem  19.  To  find  the  r  and  s  limits,  sketch  the  area 
of  integration  in  the  (x,  y)  plane  and  sketch  the  r  and  s  axes.  Then  show  that  to 
cover  the  same  integration  area,  you  may  take  the  r  and  s  limits  to  be:  s  from  0  to 
r,  r  from  0  to  1. 

21.  Verify  equations  (4.18)  and  (4.19). 

22.  Use  equation  (4.18)  to  set  up  an  integral  for  the  length  of  wire  required  to  wind  a 
coil  spirally  about  a  cylinder  of  radius  1  in.,  and  length  1  ft,  if  there  are  three  turns 
per  inch. 

23.  A  loxodrome  or  rhumb  line  is  a  curve  on  the  earth’s  surface  along  which  a  ship  sails 
without  changing  its  course,  that  is,  such  that  it  crosses  the  meridians  at  a  constant 
angle  a.  Show  that  then  tana  =  sin 9d(f)/d0  ( 9  and  <j>  are  spherical  coordinates). 
Use  (4.19)  to  set  up  an  integral  for  the  distance  traveled  by  a  ship  along  a  rhumb 
line.  Show  that  although  a  rhumb  line  winds  infinitely  many  times  around  either 
the  north  or  the  south  pole,  its  total  length  is  finite. 

24.  Compute  the  gravitational  attraction  on  a  unit  mass  at  the  origin  due  to  the  mass 
(of  constant  density)  occupying  the  volume  inside  the  sphere  r  =  2 a  and  above  the 
plane  z  =  a.  Hint:  The  magnitude  of  the  gravitational  force  on  the  unit  mass  due  to 
the  element  of  mass  dM  at  (r,  9,  <j>)  is  ( G/r2)dM .  You  want  the  z  component  of  this 
since  the  other  components  of  the  total  force  are  zero  by  symmetry.  Use  spherical 
coordinates. 
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25.  The  volume  inside  a  sphere  of  radius  r  is  V  =  frrr3.  Then  dV  =  Anr2  dr  =  A  dr, 
where  A  is  the  area  of  the  sphere.  What  is  the  geometrical  meaning  of  the  fact  that 
the  derivative  of  the  volume  is  the  area?  Could  you  use  this  fact  to  find  the  volume 
formula  given  the  area  formula? 

26.  Use  the  parallel  axis  theorem  (Problem  3.1) 

(a)  and  Example  3,  to  find  the  moment  of  inertia  of  a  solid  ball  about  a  line  tangent 
to  it; 

(b)  and  Problem  3b  to  find  the  moment  of  inertia  of  a  solid  cylinder  about  a  ruling. 

27.  Use  the  spherical  coordinates  9  and  (ji  1°  find  the  area  of  a  zone  of  a  sphere  (that  is, 
the  spherical  surface  area  between  two  parallel  planes).  Hint:  See  dA  in  (4.5). 

28.  Find  the  center  of  mass  of  a  hemispherical  shell  of  constant  density  (mass  per  unit 
area)  by  using  double  integrals  and  the  area  element  dA  in  (4.5).  [Compare  your 
result  in  Problem  4(b).] 


5.  SURFACE  INTEGRALS 

In  the  preceding  sections  we  found  surface  areas,  moments  of  them,  etc.,  for  surfaces 
of  revolution.  We  now  want  to  consider  a  way  of  computing  surface  integrals  in 
general  whether  the  surface  is  a  surface  of  revolution  or  not.  Consider  a  part  of  a 
surface  as  in  Figure  5.1  and  its  projection  in  the  ( x,y )  plane.  We  assume  that  any 
line  parallel  to  the  z  axis  intersects  the  surface  only  once.  If  this  is  not  true,  we 
must  work  with  part  of  the  surface  at  a  time,  or  project  the  surface  into  a  different 
plane.  For  example,  if  the  surface  is  closed,  we  could  find  the  areas  of  the  upper 
and  lower  parts  separately.  For  a  cylinder  with  axis  parallel  to  the  z  axis  we  could 
project  the  front  and  back  parts  separately  into  the  (y,  z)  plane. 


Let  dA  (Figure  5.1)  be  an  element  of  surface  area  which  projects  onto  dxdy  in 
the  ( x ,  y)  plane  and  let  7  be  the  acute  angle  between  dA  (that  is,  the  tangent  plane 
at  cL4)  and  the  (x,y)  plane.  Then  we  have 


(5.1) 


dx  dy  =  dA  cos  7 


or 


dA  =  sec  7  dx  dy. 
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The  surface  area  is  then 
(5.2) 


sec  7  dx  dy 


where  the  limits  on  x  and  y  must  be  such  that  we  integrate  over  the  projected 
area  in  the  (x,y)  plane. 


Now  we  must  find  sec  7.  The  (acute)  angle  between  two  planes  is  the  same  as 
the  (acute)  angle  between  the  normals  to  the  planes.  If  n  is  a  unit  vector  normal 
to  the  surface  at  dA  (Figure  5.1),  then  7  is  the  (acute)  angle  between  n  and  the  z 
axis,  that  is,  between  the  vectors  n  and  k,  so  cos  7  =  |n  •  k|.  Let  the  equation  of 
the  surface  be  <f>{x,y,z)  =  const.  Recall  from  Chapter  4  just  after  equation  (9.14) 
that  the  vector 


(5.3) 


.  d(j>  .  d(j>  d(j> 

grad  </>  =  1  — +  j  — — h  k  — 
ox  ay  oz 


is  normal  to  the  surface  <j>(x,y,z)  =  const.  (Also  see  Chapter  6,  Section  6.)  Then 
n  is  a  unit  vector  in  the  direction  of  grad  </>,  so 


(5.4)  n  =  (grad  (/>)/|  grad  </>|. 

From  (5.3)  and  (5.4)  we  find 

k  k -grade/)  d<j>/dz 
|  grade/) |  |  grade/) |’ 

1  1 

sec  7  = - =  - - — , 

cos  7  |n  •  k| 


so 


(5.5) 


sec  7  = 


|  grad  0| 
\d<P/dz\ 


\dcf)/dz\ 


Often  the  equation  of  a  surface  is  given  in  the  form  2  =  f(x,y).  In  this  case 
(j)(x,  y,z)  =  z  —  f(x,  y),  so  d(f>/dz  =  1,  and  (5.5)  simplifies  to 


(5.6)  sec  7  =  \J  (d  f  /  dx)2  +  (df  /dy)2  +  1. 


We  then  substitute  (5.5)  or  (5.6)  into  (5.2)  and  integrate  to  find  the  area.  To  find 
centroids,  moments  of  inertia,  etc.,  we  insert  the  proper  factor  into  (5.2)  as  we  have 
discussed  in  Section  3. 
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Example  1.  Find  the  area  cut  from  the  upper  half  of  the  sphere 
x2  +  y2  +  z2  =  1  by  the  cylinder  x2  +  y2  —  y  =  0. 

This  is  the  same  as  the  area  on  the  sphere  which  projects 
onto  the  disk  x2  +  y2  —  y  <  0  in  the  (x,  y)  plane.  Thus  we 
want  to  integrate  (5.2)  over  the  area  of  this  disk.  Figure 
5.2  shows  the  disk  of  integration  (shaded)  and  the  equatorial 
circle  of  the  sphere  (large  circle).  We  compute  secy  from  the 
equation  of  the  sphere;  we  could  use  (5.6),  but  it  is  easier  in 
this  problem  to  use  (5.5): 


secy  = 


I  grad  0| 

\d(j)/dz\ 


4>  =  x2  +  y2  +  z2, 

^V/(2x)2  +  (22/)2  +  (2  s)2  =  i 


1 


\/i  -  x2  -  y2 


We  find  the  limits  of  integration  from  the  equation  of  the  shaded  disk,  x2+y2—y  <  0. 
Because  of  the  symmetry  we  can  integrate  over  the  first-quadrant  part  of  the  shaded 
area  and  double  our  result.  Then  the  limits  are: 


x  from  0  to  \/ y  —  y2, 
y  from  0  to  1. 


The  desired  area  is 


(5.7) 


A  =  2 


'2/=0 


dx  dy 

V1  -  x2  -  y2 


This  integral  is  simpler  in  polar  coordinates.  The  equation  of  the  cylinder  is  then 
r  =  sin0,  so  the  limits  are:  r  from  0  to  sin0,  and  6  from  0  to  7r/2.  Thus  (5.7) 
becomes 


(5.8) 


A  = 


r  dr  d6 
\/l  -  r2 


This  is  still  simpler  if  we  make  the  change  of  variable  z  =  \J\  —  r2 .  Then  dz  = 
— r dr /\/\  —  r2 ,  and  the  limits  r  =  0  to  sin#  become  2  =  1  to  cos6f  Thus  (5.8) 
becomes 

/*7t/2  /*cos  6 

(5.9)  A  =  —2  /  dzdO  =  TT-2. 

J  9—0  J  z= 1 


PROBLEMS,  SECTION  5 

For  these  problems,  the  most  important  sketch  is  the  projection  in  the  plane  of  integration, 
which  is  easy  to  do  by  hand.  However,  you  might  like  to  use  your  computer  to  plot  the 
corresponding  3  dimensional  picture. 

1.  Find  the  area  of  the  plane  x  —  2y  +  5z  =  13  cut  out  by  the  cylinder  x2  +  y2  =  9. 

2.  Find  the  surface  area  cut  from  the  cone  2x2  +  2 y2  =  5 z2,  z  >  0,  by  the  cylinder 
x2  +  y2  =  2  y. 

Find  the  area  of  the  paraboloid  x2  +  y2  =  z  inside  the  cylinder  x2  +  y2  =  9. 


3. 


Section  6 


Miscellaneous  Problems  273 


4.  Find  the  area  of  the  part  of  the  cone  2 z2  =  x2  +  y2  in  the  first  octant  cut  out  by 

the  planes  y  =  0,  and  y  =  x/y/H,  and  the  cylinder  x2  +  y2  =  4. 

5.  Find  the  area  of  the  part  of  the  cone  z2  =  3(a:2  +  y2)  which  is  inside  the  sphere 
x2  +y2  +z2  =  16. 

6.  In  Example  1,  find  the  area  of  the  cylinder  inside  the  sphere. 

7.  Find  the  area  of  the  part  of  the  cylinder  y2  +  z2  =  4  in  the  first  octant,  cut  out  by 

the  planes  x  =  0  and  y  =  x. 

8.  Find  the  area  of  the  part  of  the  cylinder  z  =  x  +  y2  that  lies  below  the  second- 
quadrant  area  bounded  by  the  x  axis,  x  =  —1,  and  y2  =  —x. 

9.  Find  the  area  of  the  part  of  the  cone  x2+y2  =  z2  that  is  over  the  disk  (x—  l)2+y2  <  1. 

10.  Find  the  area  of  the  part  of  the  sphere  of  radius  a  and  center  at  the  origin  which  is 
above  the  square  in  the  ( x,y )  plane  bounded  by  x  =  ±a/%/ 2  and  y  =  ±a/y/2.  Hint 
for  evaluating  the  integral:  Change  to  polar  coordinates  and  evaluate  the  r  integral 
first. 

11.  The  part  of  the  plane  x  +  y  +  z  =  1  which  is  in  the  first  octant  is  a  triangular  area 
(sketch  it).  Find  the  area  and  its  centroid  by  integration.  You  might  like  to  check 
your  work  by  geometry. 

12.  In  Problem  11,  let  the  triangle  have  a  density  (mass  per  unit  area)  equal  to  x.  Find 
the  total  mass  and  the  coordinates  of  the  center  of  mass. 

13.  For  the  area  of  Example  1,  find  the  2  coordinate  of  the  centroid. 

14.  For  the  area  in  Example  1,  let  the  mass  per  unit  area  be  equal  to  \x\.  Find  the  total 
mass. 

15.  For  a  uniform  mass  distribution  over  the  area  of  Example  1,  find  the  moment  of 
inertia  about  the  z  axis. 

16.  Find  the  centroid  of  the  surface  area  in  Problem  2. 


6.  MISCELLANEOUS  PROBLEMS 

As  needed,  use  a  computer  to  plot  graphs  and  to  check  values  of  integrals. 

1.  Find  the  volume  inside  the  cone  z2  =  x2  +  y2 ,  above  the  ( x,y )  plane,  and  between 
the  spheres  x2  +  y2  +  z2  =  1  and  x2  +  y2  +  z2  =  4.  Hint:  Use  spherical  coordinates. 

2.  Find  the  z  coordinate  of  the  centroid  of  the  volume  in  Problem  1. 

3.  Find  the  mass  of  the  solid  in  Problem  1  if  the  density  is  equal  to  z. 

4.  Find  the  moment  of  inertia  of  a  hoop  (wire  bent  to  form  a  circle  of  radius  R) 

(a)  about  a  diameter; 

(b)  about  a  tangent  line. 

5.  The  rectangle  in  the  figure  has  sides  2 a  and  26;  the  curve 
is  an  ellipse.  If  the  figure  is  rotated  about  the  dotted 
line  it  generates  three  solids  of  revolution:  a  cone,  an  2b 
ellipsoid,  and  a  cylinder.  Show  that  the  volumes  are  in  the 
ratio  1:2:3.  (See  L.H.  Lange,  American  Mathematical 
Monthly  vol.  88  (1981),  p.  339.) 
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6.  (a)  Find  the  area  inside  the  circle  r  =  2,  with  x  >  0  and  y  >  1. 

(b)  Find  the  centroid  of  the  area  in  (a). 

7.  For  a  lamina  of  density  1  in  the  shape  of  the  area  of  Problem  6,  find  its  moment  of 
inertia  about  the  z  axis. 

8.  For  the  solid  bounded  above  by  the  sphere  x2  +y2  +  z2  =4  and  below  by  a  horizontal 
plane  through  (0,  0, 1),  find 

(a)  the  volume  (see  Problem  6  and  Problem  3.12); 

(b)  the  z  coordinate  of  the  centroid  (use  cylindrical  coordinates). 

9.  Find  the  centroid  of  the  area  above  y  =  x2  and  below  y  =  c(c  >  0). 

10.  (a)  Find  the  centroid  of  the  area  between  the  x  axis  and  one  arch  of  y  =  sin  a;. 

(b)  Find  the  volume  formed  if  the  area  in  (a)  is  rotated  about  the  x  axis. 

(c)  Find  Ix  of  a  mass  of  constant  density  occupying  the  volume  in  (b) . 

11.  Show  that  the  z  coordinate  of  the  centroid  of  the  volume  inside  the  elliptic  cone 

^  =  J  +  0  <  z  <  h,  is  z=\h. 

(Note  that  the  result  is  independent  of  a  and  b.)  Hint:  To  evaluate  the  triple 
integrals,  let  z  =  hz' ,  x  =  ax' ,  y  =  by' ,  and  then  change  to  cylindrical  coordinates 
in  the  primed  system  (see  Example  4,  Section  4).  Compare  Example  2,  Section  4. 

12.  Find  the  mass  of  the  solid  inside  the  ellipsoid 


if  the  density  is  \xyz\.  Hint:  Evaluate  the  triple  integral  as  in  Example  4,  Section  4. 

13.  Find  the  surface  area  of  the  part  of  the  cylinder  x2  +  z2  =  a2  inside  the  cylinder 
x2  +  y2  =  a2.  Use  your  computer  to  graph  the  two  cylinders  on  the  same  axes. 

14.  Find  the  volume  that  is  inside  both  cylinders  in  Problem  13. 

15.  Find  Ix  and  Iy  for  a  mass  distribution  of  constant  density  occupying  the  solid  in 
Problem  14.  Hint:  Do  the  x  integration  last. 

16.  Find  the  centroid  of  the  first  quadrant  part  of  the  arc  x 2^3  +  y2^3  =  a2^3 .  Hint:  Let 
x  =  a  cos3  9,  y  =  a  sin3  6. 

17.  Find  the  moment  of  inertia  about  a  diagonal  of  a  framework  consisting  of  the  four 
sides  of  a  square  of  side  a. 

18.  Find  the  center  of  mass  of  the  solid  right  circular  cone  inside  r2  =  z2,  0  <  z  <  h,  if 
the  density  is  r2  =  x2  +  y2.  Use  cylindrical  coordinates. 

19.  For  the  cone  in  Problem  18,  find  Ix/M,  Iy/M,  Iz/M.  Also  find  I/M  about  a  line 
through  the  center  of  mass  parallel  to  the  x  axis. 

20.  (a)  Find  the  area  of  the  surface  z  =  1  +  x2  +  y2  inside  the  cylinder  x2  +  y2  =  1. 

(b)  Find  the  volume  inside  the  cylinder  between  the  surface  and  the  (x,  y)  plane. 

Use  cylindrical  coordinates. 

21.  Find  the  gravitational  attraction  on  a  unit  mass  at  the  origin  due  to  a  mass  (of 
constant  density)  occupying  the  volume  inside  the  cone  z2  =  x2  +  y2 ,  0  <  z  <  h. 
See  Problem  4.24. 
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22.  Find  Ix/M,  Iy/M,  Iz/M,  for  a  lamina  in  the  shape  of  an  ellipse  (x2 /a2)+(y2 /b2)  =  1. 
Hint:  See  Problem  11. 

23.  (a)  Find  the  centroid  of  the  solid  paraboloid  inside  z  =  x2  +  y2 ,  0  <  z  <  c. 


(b)  Repeat  part  (a)  if  the  density  is  p  =  r  =  \J  x2  +  y2. 

24.  Repeat  Problem  23a  for  the  paraboloid 

2  2 
z  x  y 

~  =  -o+  To-  0  <  2  <  c. 

c  a 2  tr 

25.  By  changing  to  polar  coordinates,  evaluate 


poo  no 

Jo  Jo 


a-\/x2+y2 


dx  dy. 


26.  Make  the  change  of  variables  u  =  x  —  y,  v  =  x  +  y,  to  evaluate  the  integral 


r 1  /■!-» 

dV  « 
Jo  Jo 


0(x-y)/(x+y) 


dx. 


27.  Make  the  change  of  variables  u  =  y/x,  v  =  x  +  y,  to  evaluate  the  integral 

(x  +  y)ex+y 


P 1  PX 

/  dx 
Jo  Jo 


■dy. 
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Vector  Analysis 


1.  INTRODUCTION 

In  Chapter  3,  Sections  4  and  5,  we  have  discussed  the  basic  ideas  of  vector  algebra. 
The  principal  topic  of  this  chapter  will  be  vector  calculus.  First  (Sections  2  and  3) 
we  shall  consider  some  applications  of  vector  products.  Then  (Section  4  ff.)  we 
shall  discuss  differentiation  and  integration  of  vector  functions.  You  have  probably 
seen  Newton’s  second  law  F  =  ma  written  as  F  =  mcPr/dt2.  You  may  have  met 
Gauss’s  law  in  electricity  which  uses  a  surface  integral  of  the  normal  component  of 
a  vector  (Section  10).  Derivatives  and  integrals  of  vector  functions  are  important 
in  almost  every  area  of  applied  mathematics.  Such  diverse  fields  as  mechanics, 
quantum  mechanics,  electrodynamics,  theory  of  heat,  hydrodynamics,  optics,  etc., 
make  use  of  the  vector  equations  and  theorems  we  shall  discuss  in  this  chapter. 

2.  APPLICATIONS  OF  VECTOR  MULTIPLICATION 

In  Chapter  3,  Section  4,  we  defined  the  scalar  or  dot  product  of  vectors  A  and  B, 
and  the  vector  or  cross  product  of  A  and  B  as  follows,  where  9  is  the  angle  (<  180°) 
between  the  vectors: 
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Let  us  consider  some  applications  of  these  definitions. 

Work  In  elementary  physics  you  learned  that  work  equals  force  times  displace¬ 
ment.  If  the  force  and  displacement  are  not  parallel,  then  the  component  of  the 
force  perpendicular  to  the  displacement  does  no  work. 


Figure  2.2  Figure  2.3 

The  work  in  this  case  is  the  component  of  the  force  parallel  to  the  displacement, 
multiplied  by  the  displacement;  that  is  W  =  (FcosQ)  ■  d  =  Fd cos  9  (Figure  2.2). 
This  can  now  conveniently  be  written  as 

(2.3)  W  =  Fdcosd  =  F  •  d. 

If  the  force  varies  with  distance,  and  perhaps  also  the  direction  of  motion  d  changes 
with  time,  we  can  write,  for  an  infinitesimal  vector  displacement  dr  (Figure  2.3) 

(2.4)  dW  =  F  •  dr. 

We  shall  see  later  (Section  8)  how  to  integrate  dW  in  (2.4)  to  find  the  total  work 
W  done  on  a  particle  which  is  pushed  along  some  path  by  a  variable  force  F. 


Figure  2.5 


Torque  In  doing  a  seesaw  or  lever  problem  (Figure  2.4),  you  multiply  force  times 
distance;  the  quantity  Fd  is  called  the  torque  or  moment*  of  F,  and  the  distance  d 

*If  the  force  F  is  due  to  a  weight  w  =  mg,  then  the  torque  about  O  in  Figure  2.4  is  mg  •  d  = 
g  •  ( md );  the  moment  of  inertia  (Chapter  5,  Section  3)  of  m  about  O  is  md? .  The  quantity  md  is 
called  the  moment  (or  first  moment)  of  m  about  O,  and  the  quantity  md?  is  called  the  moment  of 
inertia  (or  second  moment)  of  m  about  O.  By  extension,  we  call  mgd  the  moment  of  mg,  or  Fd 
the  moment  of  F.  For  an  object  which  is  not  a  point  mass,  the  quantities  md  and  md 2  become 
integrals  (Chapter  5,  Section  3). 
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from  the  fulcrum  O  to  the  line  of  action  of  F  is  the  lever  arm  of  F.  The  lever  arm 
is  by  definition  the  perpendicular  distance  from  O  to  the  line  of  action  of  F.  Then 
in  general  (Figure  2.5)  the  torque  (or  moment)  of  a  force  about  O  (really  about 
an  axis  through  O  perpendicular  to  the  paper)  is  defined  as  the  magnitude  of  the 
force  times  its  lever  arm;  in  Figure  2.5  this  is  Fr  sin  0.  Now  r  X  F  has  magnitude 
rF sin0,  so  the  magnitude  of  the  torque  is  |r  X  F|.  We  can  also  use  the  direction 
of  r  X  F  in  describing  the  torque,  in  the  following  way.  If  you  curve  the  fingers  of 
your  right  hand  in  the  direction  of  the  rotation  produced  by  applying  the  torque, 
then  your  thumb  points  in  a  direction  parallel  to  the  rotation  axis.  It  is  customary 
to  call  this  the  direction  of  the  torque.  By  comparing  Figures  2.5  and  2.1,  we  see 
that  this  is  also  the  direction  of  r  X  F.  With  this  agreement,  then,  r  X  F  is  the 
torque  or  moment  of  F  about  an  axis  through  O  and  perpendicular  to  the  plane  of 
the  paper  in  Figure  2.5. 

Angular  Velocity  In  a  similar  way,  a  vector  is  used  to  represent  the  angular 
velocity  of  a  rotating  body.  The  direction  of  the  vector  is  along  the  axis  of  rotation  in 
the  direction  of  progression  of  a  right-handed  screw  turned  the 
way  the  body  is  rotating.  Suppose  P  in  Figure  2.6  represents  a 
point  in  a  rigid  body  rotating  with  angular  velocity  uj.  We  can 
show  that  the  linear  velocity  v  of  point  P  is  v  =  u  X  r.  First 
of  all,  v  is  in  the  right  direction:  It  is  perpendicular  to  the 
plane  of  r  and  uj  and  in  the  right  sense.  Next  we  want  to  show 
that  the  magnitude  of  v  is  the  same  as  |w  X  r|  =  or  sin  61  But 
rsin0  is  the  radius  of  the  circle  in  which  P  is  traveling,  and  uj 
is  the  angular  velocity;  thus  (rsin0)w  is  |v|,  as  we  claimed. 


<•> 


3.  TRIPLE  PRODUCTS 


There  are  two  products  involving  three  vectors,  one  called  the  triple  scalar  product 
(because  the  answer  is  a  scalar)  and  the  other  called  the  triple  vector  product 
(because  the  answer  is  a  vector). 


B 


Figure  3.2 


Triple  Scalar  Product  This  is  written  A  •  (B  X  C).  There  is  a  useful  geo¬ 
metrical  interpretation  of  the  triple  scalar  product  (see  Figure  3.1).  Construct  a 
parallelepiped  using  A,B,C  as  three  intersecting  edges.  Then  |B  X  C|  is  the  area 
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of  the  base  (Figure  3.2)  because  |B  X  C|  =  |B|  C|  sin 9,  which  is  the  area  of  a 
parallelogram  with  sides  |B|,  |C|,  and  angle  9.  The  height  of  the  parallelepiped  is 
| A j  cos  <f>  (Figure  3.1).  Then  the  volume  of  the  parallelepiped  is 

|B|  |C|  sin#|A|  cos <j>  =  |B  x  C|  |A|  cos^  =  A  •  (B  x  C). 


If  <j>  >  90°,  this  will  come  out  negative,  so  in  general  we  should  say  that  the  volume 
is  (A  •  (B  X  C)|.  Any  side  may  be  used  as  base,  so,  for  example,  B  •  (C  X  A)  must 
also  be  either  plus  or  minus  the  volume.  There  are  six  such  triple  scalar  products, 
all  equal  except  for  sign  [or  twelve  if  you  count  both  the  type  A  •  (B  X  C)  and  the 
type  (B  x  C)  •  A)]. 

To  write  the  triple  scalar  product  in  component  form  we  first  write  B  X  C  in 
determinant  form  [Chapter  3,  equation  (4.19)]: 


(3.1) 


B  X  C 


i  j  k 

Bx  By  Bz 

CX  Cy  CZ 


Now  A  ■  (B  X  C)  =  AX(B  X  C)x  +  A.^B  X  C)y  +  Az( B  X  C)z,  and  this  is  exactly 
what  we  get  by  expanding,  by  elements  of  the  first  row,  the  determinant  in  (3.2) 
below;  this  determinant  is  then  equal  to  A  •  (B  X  C). 


(3.2) 

A  -  (B  x  C)  = 

Ax 

Bx 

Ay 

By 

Az 

Bz 

cx 

Cy 

cz 

Recalling  that  an  interchange  of  rows  changes  the  sign  of  a  determinant,  we  can 
now  easily  write  out  the  six  (or  twelve)  products  mentioned  above  with  their  proper 
signs.  You  should  convince  yourself  of,  and  then  remember,  the  following  facts:  The 
order  of  the  factors  is  all  that  counts;  the  dot  and  cross  may  be  interchanged.  If  the 
order  of  factors  is  cyclic  (one  way  around  the  circle  in  Figure  3.3),  all  such  triple 
scalar  products  are  equal.  If  you  go  the  other  way,  you  get  another  set  all  equal  to 
each  other  and  the  negatives  of  the  first  set.  For  example, 


(3.3)  (A  x  B)  C  =  A  -  (B  x  C) 

=  C  •  (A  x  B) 

=  —(A  x  C)  •  B,  etc. 

Since  it  doesn’t  matter  where  the  dot  and  cross  are, 
the  triple  scalar  product  is  often  written  as  (ABC), 
meaning  A  •  (B  X  C)  or  (A  X  B)  •  C. 


Triple  Vector  Product  This  is  written  A  X  (B  X  C).  Before  we  try  to  evaluate 
it,  we  can  make  the  following  observations.  B  X  C  is  perpendicular  to  the  plane  of 
B  and  C.  A  X  (B  X  C)  is  perpendicular  to  the  plane  of  A  and  (B  X  C);  we  are 
particularly  interested  in  the  fact  that  A  X  (B  X  C)  is  perpendicular  to  (B  X  C). 
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Now  (see  Figure  3.4)  any  vector  perpendicular  to  B  X  C  lies 
in  the  plane  perpendicular  to  B  X  C,  that  is,  the  plane  of  B 
and  C.  Thus  A  X  (B  X  C)  is  some  vector  in  the  plane  of  B 
and  C,  and  can  be  written  as  some  combination  aB  +  6C,  where 
a  and  b  are  scalars  which  we  want  to  find.  (See  Chapter  3, 
Section  8,  Problem  5.)  One  way  to  find  a  and  b  is  to  write  out 
A  X  (B  X  C)  in  component  form.  We  can  simplify  this  work 
by  choosing  our  coordinate  system  carefully;  recall  that  a  vector 
equation  is  true  independently  of  the  coordinate  system.  Given 
the  vectors  A,  B,  C,  we  take  the  x  axis  along  B,  and  the  y  axis 
in  the  plane  of  B  and  C;  then  B  X  C  is  in  the  z  direction.  The 
vectors  in  component  form  relative  to  these  axes  are: 


B  x  C 


Figure  3.4 


B  =  Bxi, 

(3.4)  C  =  Cxi+Cyj, 

A  =  Axi  +  Ay  j  +  A~k. 

Using  (3.4)  we  find 


B  x  C  =  Bxi  x  [Cx i  +  Cy  j)  =  BXCV( i  x  j)  =  BxCy k. 
(3.5)  A  x  (B  x  C)  =  AxBxCy{ i  x  k)  +  AvBxCy{ j  x  k) 

=  AxBxCy  (—  j)  +  AyBxCy(i). 


We  would  like  to  write  A  X  (B  X  C)  in  (3.5)  as  a  combination  of  B  and  C;  we  can 
do  this  by  adding  and  subtracting  AXBXCX i: 

(3.6)  A  x  (B  x  C)  =  —AXBX(CX i  +  Cy  j)  +  (AyCy  +  AXCX)BX i. 

Each  of  these  expressions  is  something  simple  in  terms  of  the  vectors  in  (3.4): 

AXBX  =  A  •  B,  AyCy  AXCX  —  A  •  C, 

C,i  •  C„j  -  C.  BA  -  B. 

Using  (3.7)  in  (3.6),  we  get 

(3.8)  A  x  (B  x  C)  =  (A  •  C)B  —  (A  •  B)C. 

This  important  formula  should  be  learned,  but  not  memorized  in  terms  of  letters, 
because  that  is  confusing  when  you  want  some  other  combination  of  the  same  letters. 
Learn  instead  the  following  three  facts: 


The  value  of  a  triple  vector  product  is  a  linear  combination  of  the 
two  vectors  in  the  parenthesis  [B  and  C  in  (3.8)];  the  coefficient  of 
each  vector  is  the  dot  product  of  the  other  two;  the  middle  vector 
in  the  triple  product  [B  in  (3.8)]  always  has  the  positive  sign. 


This  method  also  covers  triple  vector  products  with  the  parenthesis  first;  by  (3.9), 
the  value  of  (B  X  C)  X  A  is  (A  •  B)C  —  (A  •  C)B.  This  is  correct  since  it  is  just 
the  negative  of  what  we  had  above  for  A  X  (B  X  C). 
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Applications  of  the  Triple  Scalar  Product  We  have  shown  that  the  torque  of 
a  force  F  about  an  axis  may  be  written  as  r  X  F  in  one  special  case,  namely  when 
r  and  F  are  in  a  plane  perpendicular  to  the  axis.  Now  let  us  consider  the  general 
case  of  finding  the  torque  produced  by  a  force  F  about  any  given  line  (axis)  L  in 
Figure  3.5.  Let  r  be  a  vector  from  some  (that  is,  any)  point 
on  L  to  the  line  of  action  of  F;  let  O  be  the  tail  of  r.  Then  we 
define  the  torque  about  the  point  O  to  be  r  X  F.  Note  that  this 
cannot  contradict  our  previous  discussion  of  torque  because  we 
were  considering  torque  about  a  line  before,  and  this  definition 
is  of  torque  about  a  point.  However,  we  shall  show  how  the  two 
notions  are  connected.  Also  notice  that  r  X  F  is  not  changed 
if  the  head  of  r  is  moved  along  F;  for  this  just  adds  a  multiple 
of  F  to  r,  and  F  X  F  =  0  (see  Problem  10). 


Figure  3.5 


We  shall  now  show  that  the  torque  of  F  about  the  line  L  through  O  is  n  •  (r  X  F) , 
where  n  is  a  unit  vector  along  L.  To  simplify  the  calculation,  choose  the  positive  z 
axis  in  the  direction  n;  then  n  =  k.  Think  of  a  door  hinged  to  rotate  about  the  z 


Figure  3.6 

axis  as  in  Figure  3.6.  Let  a  force  F  be  applied  to  it  at  the  head  of  the  vector  r.  We 
first  find  the  torque  of  F  about  the  2  axis  by  elementary  methods  and  definition. 
Break  F  into  its  components;  the  z  component  is  parallel  to  the  rotation  axis  and 
produces  no  torque  about  it  (pulling  straight  up  or  down  on  a  door  handle  does  not 
tend  to  open  or  close  the  door!).  The  x  and  y  components  can  be  seen  better  if  we 
draw  them  in  the  (x,y)  plane  (Figure  3.7;  note  that  the  x  and  y  axes  are  rotated 
90°  clockwise  from  their  usual  position  in  order  to  compare  this  figure  more  easily 
with  Figure  3.6).  The  torque  about  the  z  axis  produced  by  Fx  and  Fy  is  xFy  —  yFx 
by  the  elementary  definition  of  torque.  We  want  to  show  that  this  is  the  same  as 
n  •  (r  X  F)  or  here  k  •  (r  X  F).  Using  (3.2)  we  find 


k  •  (r  X  F) 


0  0  1 

x  y  z 

F  F  F 

±  x  ^  y  z 


=  xFy  -  yFx 


To  summarize: 
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In  Figure  3.5,  the  torque  of  F  about  point  O  is  r  X  F.  The  torque 
(3.10)  of  F  about  the  line  L  through  O  is  n  •  (r  X  F)  where  n  is  a  unit 
vector  along  L. 


This  proof  can  easily  be  given  without  reference  to  a  coordinate  system.  Let 
the  symbols  ||  and  _L  stand  for  parallel  and  perpendicular  to  the  given  rotation  axis 
n.  Then  any  vector  (F  or  r,  say)  can  be  written  as  the  sum  of  a  vector  parallel 
to  the  axis  and  a  vector  perpendicular  to  the  axis  (that  is,  somewhere  in  the  plane 
perpendicular  to  n): 

r  =  r_L+r|[,  F  =  Fj_  +  Fy. 

Then  the  torque  about  O  produced  by  F  is 

r  X  F  =  (r±  +  ry)  X  (Fj_  +  Fy) 

=  riXFi  +  rj_  xFy  Try  xF_l  +  ry  xFy . 

The  last  term  is  zero  (cross  product  of  parallel  vectors).  Also  ry  and  Fy  are  parallel 
to  n;  therefore  their  cross  products  with  anything  are  in  the  plane  perpendicular  to 
n,  and  the  dot  product  of  n  with  these  is  zero.  Hence  we  have 

n  •  (r  X  F)  =  n  •  (r j_  xFj_). 

Now  r  j_  and  F  j_  are  in  a  plane  perpendicular  to  n;  thus  the  torque  about  n  produced 
by  F_l  is  (by  Section  2)  r^xFj_.  But  since  only  the  component  of  F  perpendicular 
to  n  produces  a  torque  about  n,  r±  xF_l  is  the  total  torque  about  n  produced  by  F. 
The  vector  torque  rj^  xFj_  is  in  the  in  direction  since  r_L  and  Fj_  are  perpendicular 
to  n;  the  dot  product  of  this  vector  torque  with  the  unit  vector  n  gives  a  scalar 
torque  of  the  same  magnitude;  the  ±  sign  indicates  whether  the  torque  is  in  the  +n 
or  the  n  direction. 


Example  1.  If  F  =  i  +  3j  —  k  acts  at  the  point  (1, 1, 1),  find  the  torque  of  F  about  the 
line  r  =  3i  +  2k  +  (2i  —  2j  +  k)f. 

We  first  find  the  vector  torque  about  a  point  on  the  line,  say  the  point  (3,  0, 2). 
By  (3.10)  and  Figure  3.5,  this  is  r  X  F  where  r  is  the  vector  from  the  point  about 
which  we  want  the  torque,  to  the  point  at  which  F  acts,  that  is,  from  (3,0,2)  to 
(1, 1, 1);  then  r  =  (1, 1, 1)  —  (3, 0,  2)  =  (—2, 1,  —1).  The  vector  torque  is 


r  X  F  = 


i  j  k 

-2  1  -1 
13-1 


2i  —  3j  —  7k. 


The  torque  about  the  line  is  n  •  (r  X  F)  where  n  is  a  unit  vector  along  the  line, 
namely  n  =  i  (2i  —  2j  +  k) .  Then  the  torque  about  the  line  is 


n  •  (r  X  F)  =  |(2i  -  2j  +  k)  •  (2i  -  3 j  -  7k)  =  1. 
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Example  2.  As  another  application  of  the  triple  scalar  product,  let’s  find  the  Jacobian  we 
used  in  Chapter  5,  Section  4  for  changing  variables  in  a  multiple  integral.  As  you 
know,  in  rectangular  coordinates  the  volume  element  is  a  rectangular  box  of  volume 
dx  dy  dz.  In  other  coordinate  systems,  the  volume  element  may  be  approximately  a 
parallelepiped  as  in  Figure  3.1.  We  want  a  formula  for  the  volume  element  in  this 
case.  (See,  for  example,  the  cylindrical  and  spherical  coordinate  volume  elements 
in  Chapter  5,  Figures  4.4  and  4.5.) 

Suppose  we  are  given  formulas  for  x ,  y,  z  as  functions  of  new  variables  u,  v, 
w.  Then  we  want  to  find  the  vectors  along  the  edges  of  the  volume  element  in  the 
u,  v,  w  system.  Suppose  vector  A  in  Figure  3.1  is  along  the  direction  in  which  u 
increases  while  v  and  w  remain  constant.  Then  if  dr  =  i  dx  +  j  dy  +  k  dz  is  a  vector 
in  this  direction,  we  have 


dr 


du 


dx  .  dy 


dz 


A  =  —du=  ivr+j^-  +  k^-  du- 


du  du 


du 


Similarly  if  B  is  along  the  increasing  v  edge  of  the  volume  element  and  C  is  along 
the  increasing  w  edge,  we  have 


Then  by  (3.2) 


B  =  £*,= 
ov 

C=^dw  = 

dw 


.  dx  .  dy 
dv  dv 
.  dx  .  dy 
dw  dw 


dz 

dv 


dv, 


dz 

dw 


dw. 


A  •  (B  x  C)  = 


dx 

dy_ 

dz 

du 

du 

du 

dx 

dy 

dz 

dv 

dv 

dv 

dx 

dy 

dz 

dw 

dw 

dw 

du  dv  dw  =  J  dudv  dw 


where  J  is  the  Jacobian  of  the  transformation  from  x,  y,  z  to  u,  v,  w.  Recall  from 
the  discussion  of  (3.2)  that  the  triple  scalar  product  may  turn  out  to  be  positive 
or  negative.  Since  we  want  a  volume  element  to  be  positive,  we  use  the  absolute 
value  of  J.  Thus  the  u,  v,  w  volume  element  is  |  J|  dudvdw  as  stated  in  Chapter  5, 
Section  4. 


Applications  of  the  Triple  Vector  Product  In  Figure  3.8 
(compare  Figure  2.6),  suppose  the  particle  to  is  at  rest  on  a  ro¬ 
tating  rigid  body  (for  example,  the  earth).  Then  the  angular 
momentum  L  of  to  about  point  O  is  defined  by  the  equation 
L  =  r  X  (tov)  =  mr  X  v.  In  the  discussion  of  Figure  2.6,  we 
showed  that  v  =  u>  X  r.  Thus,  L  =  mr  X  (u>  X  r).  See  Prob¬ 
lem  16  and  also  Chapter  10,  Section  4. 

As  another  example,  it  is  shown  in  mechanics  that  the  cen¬ 
tripetal  acceleration  of  m  in  Figure  3.8isa  =  wx(uxr).  See 
Problem  17. 
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PROBLEMS,  SECTION  3 

1.  If  A  =  2i  —  j  — k,  B  =  2i  —  3j  +  k,  C  =  j  +  k,  find  (A-B)C,  A(B-C),  (A  x  B)-C, 

A  •  (B  x  C),  (A  x  B)  x  C,  A  x  (B  x  C). 

For  Problems  2  to  6,  given  A  =  i  +  j  —  2k,  B  =  2i  j  +  3k,  C  =  j  —  5k: 

2.  Find  the  work  done  by  the  force  B  acting  on  an  object  which  undergoes  the  dis¬ 

placement  C. 

3.  Find  the  total  work  done  by  forces  A  and  B  if  the  object  undergoes  the  displacement 
C.  Hint:  Can  you  add  the  two  forces  first? 

4.  Let  O  be  the  tail  of  B  and  let  A  be  a  force  acting  at  the  head  of  B.  Find  the  torque 
of  A  about  O;  about  a  line  through  O  perpendicular  to  the  plane  of  A  and  B;  about 
a  line  through  O  parallel  to  C. 

5.  Let  A  and  C  be  drawn  from  a  common  origin  and  let  C  rotate  about  A  with  an 
angular  velocity  of  2  rad/sec.  Find  the  velocity  of  the  head  of  C. 

6.  In  Problem  5,  draw  B  with  its  tail  at  the  head  of  A.  If  the  figure  is  rotating  as  in 
Problem  5,  find  the  velocity  of  the  head  of  B.  With  the  same  diagram,  let  B  be  a 
force;  find  the  torque  of  B  about  the  head  of  C,  and  about  the  line  C. 

7.  A  force  F  =  2i  —  3j  +  k  acts  at  the  point  (1,  5,  2).  Find  the  torque  due  to  F 

(a)  about  the  origin; 

(b)  about  the  y  axis; 

(c)  about  the  line  x/2  =  y/1  =  z/{—2). 

8.  A  vector  force  with  components  (1,2,3)  acts  at  the  point  (3,2, 1).  Find  the  vector 
torque  about  the  origin  due  to  this  force  and  find  the  torque  about  each  of  the 
coordinate  axes. 

9.  The  force  F  =  2i  —  j  —  5k  acts  at  the  point  (—5,  2, 1).  Find  the  torque  due  to  F 
about  the  origin  and  about  the  line  2x  =  — 4j/  =  —2. 

10.  In  Figure  3.5,  let  r'  be  another  vector  from  O  to  the  line  of  F.  Show  that  r'  X  F  = 
r  X  F.  Hint:  r  —  r'  is  a  vector  along  the  line  of  F  and  so  is  a  scalar  multiple  of  F. 
(The  scalar  has  physical  units  of  distance  divided  by  force,  but  this  fact  is  irrelevant 
for  the  vector  proof.)  Show  also  that  moving  the  tail  of  r  along  n  does  not  change 
n  •  r  X  F.  Hint:  The  triple  scalar  product  is  not  changed  by  interchanging  the  dot 
and  the  cross. 

11.  Write  out  the  twelve  triple  scalar  products  involving  A,  B,  and  C  and  verify  the 
facts  stated  just  above  (3.3). 

12.  (a)  Simplify  (A  •  B)2  —  [(A  X  B)  X  B]  •  A  by  using  (3.9). 

(b)  Prove  Lagrange’s  identity :  (A  xB)-(CxD)  =  (A  -  C)(B  •  D)  —  (A  -  D)(B  •  C). 

13.  Prove  that  the  triple  scalar  product  of  (A  X  B),  (B  X  C),  and  (C  X  A),  is  equal  to 
the  square  of  the  triple  scalar  product  of  A,  B,  and  C.  Hint:  First  let  (B  X  C)  =  D, 
and  evaluate  (A  x  B)  x  D.  [See  Am.  J.  Phys.  66,  739  (1998).] 

14.  Prove  the  Jacobi  identity.  A  x  (B  x  C)  +  B  x  (C  x  A)  +  C  x  (A  x  B)  =  0.  Hint: 
Expand  each  triple  product  as  in  equations  (3.8)  and  (3.9). 
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15.  In  the  figure  ui  is  a  unit  vector  in  the  direction  of 
an  incident  ray  of  light,  and  113  and  U2  are  unit 
vectors  in  the  directions  of  the  reflected  and  re¬ 
fracted  rays.  If  u  is  a  unit  vector  normal  to  the 
surface  AB ,  the  laws  of  optics  say  that  #1  =  #3  and 
m  sin  9\  =  712  sin  #2,  where  m  and  ri2  are  constants 
(indices  of  refraction).  Write  these  laws  in  vector 
form  (using  dot  or  cross  products). 


16.  In  the  discussion  of  Figure  3.8,  we  found  for  the  angular  momentum,  the  formula 
L  =  mr  X  (w  X  r).  Use  (3.9)  to  expand  this  triple  product.  If  r  is  perpendicular  to 
u;,  show  that  you  obtain  the  elementary  formula,  angular  momentum  =  mvr. 

17.  Expand  the  triple  product  for  a  =  a>  X  (u>  X  r)  given  in  the  discussion  of  Figure 

3.8.  If  r  is  perpendicular  to  ui  (Problem  16),  show  that  a  =  —  ui2r,  and  so  find 

the  elementary  result  that  the  acceleration  is  toward  the  center  of  the  circle  and  of 

magnitude  v2/r. 

18.  Two  moving  charged  particles  exert  forces  on  each  other  because  each  one  creates 
a  magnetic  field  in  which  the  other  moves  (see  Problem  4.6).  These  two  forces  are 
proportional  to  vi  X  [V2  X  r]  and  V2  X  [vi  X  (— r)]  where  r  is  the  vector  joining  the 
particles.  By  using  (3.9),  show  that  these  forces  are  equal  and  opposite  (Newton’s 
third  “law”)  if  and  only  if  r  X  (vi  X  V2)  =  0.  Compare  Problem  14. 

19.  The  force  F  =  i  +  3j  +  2k  acts  at  the  point  (1, 1, 1). 

(a)  Find  the  torque  of  the  force  about  the  point  (2,  —1,  5).  Careful!  The  vector  r 
goes  from  (2,  —1,  5)  to  (1, 1, 1). 

(b)  Find  the  torque  of  the  force  about  the  line  r  =  2i—  j  +  5k  +  (i  —  j  +  2k)f.  Note 
that  the  line  goes  through  the  point  (2,  —1,  5). 

20.  The  force  F  =  2i  —  5k  acts  at  the  point  (3,  —1,  0).  Find  the  torque  of  F  about  each 
of  the  following  lines. 

(a)  r  =  (2i  —  k)  +  (3j  —  4k)t. 

(b)  r  =  i  +  4j  +  2k  +  (2i  +  j  —  2k)t . 

4.  DIFFERENTIATION  OF  VECTORS 

If  A  =  i Ax  +  jAy  +  k Az,  where  i,  j,  k  are  fixed  unit  vectors  and  Ax,  Ay,  Az  are 
functions  of  t,  then  we  define  the  derivative  dA  / dt  by  the  equation 


(4.1) 


dA  ,dAx  .dAy  ,dAz 
dt  1  dt  ^  ^  dt  ^  dt 


Thus  the  derivative  of  a  vector  A  means  a  vector  whose  components  are  the  deriva¬ 
tives  of  the  components  of  A. 


286  Vector  Analysis 


Chapter  6 


Example  1. 


Let  (x,  y ,  z)  be  the  coordinates  of  a  moving  particle  at  time  t:  then  x,  y,  z  are 
functions  of  t.  The  vector  displacement  of  the  particle  from  the  origin  at  time  t 
is 


(4.2) 


r  =  ix  +  jy  +  k  z, 


where  r  is  a  vector  from  the  origin  to  the  particle  at  time  t.  We  say  that  r  is 
the  position  vector  or  vector  coordinate  of  the  particle.  The  components  of  the 
velocity  of  the  particle  at  time  t  are  dx/dt,  dy/dt,  dz/dt  so  the  velocity  vector  is 


(4.3) 


dr  ,dx  ,dy  dz 

dt  1  dt  dt  dt ' 


The  acceleration  vector  is 


(4.4) 


dv  . d2x 
dt  dt 2 


.d2y 

Jdt2 


d2z 

'It2' 


The  product  of  a  scalar  and  a  vector  and  the  dot  and  cross  products  of  vectors 
are  differentiated  by  the  ordinary  calculus  rules  for  differentiating  a  product,  with 
one  word  of  caution:  The  order  of  the  factors  must  be  kept  in  a  cross  product.  You 
can  easily  prove  equations  (4.5)  below  by  writing  out  components  (Problem  1)  and 
using  (4.1). 


(4.5) 


d  ,  .  ,  da  .  dA 
—  (aA)  =  —A  +  a——, 
dt y  dt  dt 


U  A. 
dtK 


*  dB 

dt 


dA 

dt 

dA 


B, 


-j-(A  xB)  =  Ax^  +  ;f  xB. 
dV  ’  dt  dt 


The  second  term  in  (d/dt){ A  •  B)  can  be  written  B  •  dA / dt  if  you  like  since  A  •  B  = 
B  •  A.  But  the  corresponding  term  in  (d/dt)( A  X  B)  must  not  be  turned  around 
unless  you  put  a  minus  sign  in  front  of  it  since  A  X  B  =  -B  X  A. 


Example  2.  Consider  the  motion  of  a  particle  in  a  circle  at  constant  speed.  We  can  then 
write 


(4.6) 


r2  =  r  •  r  =  const., 
v2  =  v  •  v  =  const. 


If  we  differentiate  these  two  equations  using  (4.5),  we  get 


dr 

2r  •  —  =  0 

dt 

2v^  =  0 

dt 


or  r  •  v  =  0, 
or  v  •  a  =  0. 


(4.7) 
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Also  differentiating  r  •  v  =  0,  we  get 

(4.8)  r-a  +  v-v  =  0  or  r*a=  —  v2. 

The  first  of  equations  (4.7)  says  that  r  is  perpendicular  to  v;  the  second  says  that 
a  is  perpendicular  to  v.  Therefore  a  and  r  are  either  parallel  or  antiparallel  (since 
the  motion  is  in  a  plane)  and  the  angle  9  between  a  and  r  is  either  0°  or  180°.  From 

(4.8)  and  the  definition  of  scalar  product,  we  have 

(4.9)  r  •  a  =  |r|  |a|  cos  9  =  —v2. 


Thus  we  see  that  cos  9  is  negative,  so  9  =  180°.  Then  from  (4.9)  we  get 

(4.10)  |r|  |a|(— 1)  =  —  v2  or  a=— . 

r 

We  have  just  given  a  vector  proof  that  for  motion  in  a  circle  at  constant  speed  the 
acceleration  is  toward  the  center  of  the  circle  and  of  magnitude  v2/r. 

So  far  we  have  written  vectors  only  in  terms  of  their  rectangular  components 
using  the  unit  basis  vectors  i,  j,  k.  It  is  often  convenient  to  use  other  coordinate 
systems,  for  example  polar  coordinates  in  two  dimensions  and  spherical  or  cylin¬ 
drical  coordinates  in  three  dimensions  (see  Chapter  5,  Section  4,  and  Chapter  10, 
Sections  8  and  9) .  We  shall  consider  using  vectors  in  various  coordinate  systems  in 
detail  in  Chapter  10,  but  it  will  be  useful  to  discuss  briefly  here  the  use  of  plane 
polar  coordinates.  In  Figure  4.1,  think  of  starting  at  the  point  (x,y)  or  (r,  9)  and 
moving  along  the  line  9  =  const,  in  the  direction  of  increasing  r.  We  call  this  the 
“r  direction”;  we  draw  a  unit  vector  (that  is,  a  vector  of  length  1)  in  this  direction 


Figure  4.2 


and  label  it  er.  Similarly,  think  of  moving  along  the  circle  r  =  const,  in  the  direction 
of  increasing  9.  We  call  this  the  “6  direction”;  we  draw  a  unit  vector  tangent  to 
the  circle  and  label  it  eg.  These  two  vectors  er  and  eg  are  the  polar  coordinate 
unit  basis  vectors  just  as  i  and  j  are  the  rectangular  unit  basis  vectors.  We  can 
now  write  any  given  vector  in  terms  of  its  components  in  the  directions  er  and 
eg  (by  finding  its  projections  in  these  directions).  There  is  a  complication  here, 
however.  In  rectangular  coordinates,  the  vectors  i  and  j  are  constant  in  magnitude 
and  direction.  The  polar  coordinate  unit  basis  vectors  are  constant  in  magnitude 
but  their  directions  change  from  point  to  point  (Figure  4.2).  Thus  in  calculating 
the  derivative  of  a  vector  written  in  polar  coordinates,  we  must  differentiate  the 
basis  vectors  as  well  as  the  components  [compare  (4.1)  where  we  differentiate  the 
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components  only.]  One  straightforward  way  to  do  this  is  to  express  the  vectors  er 
and  eg  in  terms  of  i  and  j.  From  Figure  4.3,  we  see  that  the  x  and  y  components 
of  er  are  cos  9  and  sin  9.  Thus  we  have 


(4.11) 


er  =  icos6*  +  j  sin  9. 


Similarly  (Problem  7)  we  find 


(4.12) 


eg  =  —  i  sin  9  +  j  cos  9. 


Differentiating  er  and  eg  with  respect  to  t,  we  get 


(4.13) 


der 

dt 

deg 

dt 


.  .  d9  .  d6 

— l  sin  9  — -  +  j  cos  0  — 
dt  dt 


d9 

~e0M’ 


,  d9 


.  d9 


— icos  9  — - jsin„  , 

dt  dt 


=  —e 


d9_ 

dt 


We  can  now  use  (4.13)  in  calculating  the  derivative  of  any  vector  which  is  written 
in  terms  of  its  polar  components. 


Example  3.  Given  A 
We  get 


Arer  +  Ageg,  where  Ar  and  Ag  are  functions  of  t,  find  d.A/dt. 


dA  dAr  der  dAg  deg 

-iu=e’-dr  +  A’-^r+e,^r+A‘-dr 


Using  (4.13),  we  find 


dA 

dt 


dAr  d9  dAg  d9 

=  er — — — |-  egAr—  +  eg—- - erAg—. 

dt  dt  dt  dt 


We  can  find  higher-order  derivatives  if  we  like  by  differentiating  again  using  (4.13) 
each  time  to  evaluate  the  derivatives  of  er  and  eg. 
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PROBLEMS,  SECTION  4 

1.  Verify  equations  (4.5)  by  writing  out  the  components. 

2.  Let  the  position  vector  (with  its  tail  at  the  origin)  of  a  moving  particle  be  r  =  r(t)  = 
t2 i  —  2tj  +  ( t 2  +  2f)k,  where  t  represents  time. 

(a)  Show  that  the  particle  goes  through  the  point  (4,  —4, 8).  At  what  time  does  it 
do  this? 

(b)  Find  the  velocity  vector  and  the  speed  of  the  particle  at  time  t;  at  the  time 
when  it  passes  though  the  point  (4,  —4,  8). 

(c)  Find  the  equations  of  the  line  tangent  to  the  curve  described  by  the  particle 
and  the  plane  normal  to  this  curve,  at  the  point  (4,  —4,  8). 

3.  As  in  Problem  2,  if  the  position  vector  of  a  particle  is  r  =  (4  +  3f)i  +  t3 j  —  5£k,  at 
what  time  does  it  pass  through  the  point  (1,  —1,  5)?  Find  its  velocity  at  this  time. 
Find  the  equations  of  the  line  tangent  to  its  path  and  the  plane  normal  to  the  path, 
at  (1,  —1,  5). 

4.  Let  r  =  r(f)  be  a  vector  whose  length  is  always  1  (it  may  vary  in  direction).  Prove 
that  either  r  is  a  constant  vector  or  dv/dt  is  perpendicular  to  r.  Hint:  Differentiate 

r  •  r. 

5.  The  position  of  a  particle  at  time  t  is  given  by  r  =  i  cos  t  +  j  sin  t  +  kt.  Show  that 
both  the  speed  and  the  magnitude  of  the  acceleration  are  constant.  Describe  the 
motion. 

6.  The  force  acting  on  a  moving  charged  particle  in  a  magnetic  field  B  is  F  =  q(y  X  B) 
where  q  is  the  electric  charge  of  the  particle,  and  v  is  its  velocity.  Suppose  that  a 
particle  moves  in  the  (x,  y)  plane  with  a  uniform  B  in  the  z  direction.  Assuming 
Newton’s  second  law,  m  dv/dt  —  F,  show  that  the  force  and  velocity  are  perpendic¬ 
ular  and  that  both  have  constant  magnitude.  Hint:  Find  ( d/dt){y  ■  v). 

7.  Sketch  a  figure  and  verify  equation  (4.12). 

8.  In  polar  coordinates,  the  position  vector  of  a  particle  is  r  =  rer.  Using  (4.13),  find 
the  velocity  and  acceleration  of  the  particle. 

9.  The  angular  momentum  of  a  particle  m  is  defined  by  L  =  mr  X  [dv/dt)  (see  end  of 
Section  3).  Show  that 

dL  d2  r 

— —  =  mr  X  -ttt. 

dt  dt2 

10.  If  V(t)  is  a  vector  function  of  t,  find  the  indefinite  integral 


5.  FIELDS 

Many  physical  quantities  have  different  values  at  different  points  in  space.  For 
example,  the  temperature  in  a  room  is  different  at  different  points:  high  near  a  reg¬ 
ister,  low  near  an  open  window,  and  so  on.  The  electric  field  around  a  point  charge 
is  large  near  the  charge  and  decreases  as  we  go  away  from  the  charge.  Similarly, 
the  gravitational  force  acting  on  a  satellite  depends  on  its  distance  from  the  earth. 
The  velocity  of  flow  of  water  in  a  stream  is  large  in  rapids  and  in  narrow  channels 
and  small  over  flat  areas  and  where  the  stream  is  wide.  In  all  these  examples  there 
is  a  particular  region  of  space  which  is  of  interest  for  the  problem  at  hand;  at  every 
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point  of  this  region  some  physical  quantity  has  a  value.  The  term  field,  is  used  to 
mean  both  the  region  and  the  value  of  the  physical  quantity  in  the  region  (for  ex¬ 
ample,  electric  field,  gravitational  field).  If  the  physical  quantity  is  a  scalar  (for 
example,  temperature),  we  speak  of  a  scalar  field.  If  the  quantity  is  a  vector 
(for  example,  electric  field,  force,  or  velocity),  we  speak  of  a  vector  field.  Note 
again  a  point  which  we  discussed  in  “endpoint  problems”  in  Chapter  4,  Section  10: 
Physical  problems  are  often  restricted  to  certain  regions  of  space,  and  our  mathe¬ 
matics  must  take  account  of  this. 


Z 


Figure  5.1 

A  simple  example  of  a  scalar  field  is  the  gravitational  potential  energy  near  the 
earth;  its  value  is  V  =  mgz  at  every  point  of  height  z  above  some  arbitrary  reference 
level  [which  we  take  as  the  (x,  y )  plane].  Suppose  that  on  a  hill  (Figure  5.1)  we  mark 
a  series  of  curves  each  corresponding  to  some  value  of  z  (curves  of  constant  elevation, 
often  called  contour  lines  or  level  lines).  Any  curve  or  surface  on  which  a  potential 
is  constant  is  called  an  equipotential.  Thus  these  level  lines  are  equipotentials  of  the 
gravitational  field  since  along  any  one  curve  the  value  of  the  gravitational  potential 
energy  mgz  is  constant.  The  horizontal  planes  which  intersect  the  hill  in  these 
curves  are  equipotential  surfaces  (or  level  surfaces)  of  the  gravitational  field.  (See 
Problems,  Section  6  for  more  examples.) 

As  another  example,  let  us  ask  for  the  equipotential  surfaces  in  the  field  of  an 
electric  point  charge  q.  The  potential  is  V  =  9*109<7/r  (in  SI  units)  at  a  point  which 
is  a  distance  r  from  the  charge.  The  potential  V  is  constant  if  r  is  constant;  that 
is,  the  equipotentials  of  this  electric  field  are  spheres  with  centers  at  the  charge. 
Similarly  we  could  imagine  drawing  a  set  of  surfaces  (probably  very  irregular)  in  a 
room  so  that  at  every  point  of  a  single  surface  the  temperature  would  be  constant. 
These  surfaces  would  be  like  equipotentials;  they  are  called  isothermals  when  the 
constant  quantity  is  the  temperature. 


6.  DIRECTIONAL  DERIVATIVE;  GRADIENT 

Suppose  that  we  know  the  temperature  T(x,y,z)  at  every  point  of  a  room,  say, 
or  of  a  metal  bar.  Starting  at  a  given  point  we  could  ask  for  the  rate  of  change 
of  the  temperature  with  distance  (in  degrees  per  centimeter)  as  we  move  away 
from  the  starting  point.  The  chances  are  that  the  temperature  increases  in  some 
directions  and  decreases  in  other  directions,  and  that  it  increases  more  rapidly  in 
some  directions  than  others.  Thus  the  rate  of  change  of  temperature  with  distance 
depends  upon  the  direction  in  which  we  move;  consequently  it  is  called  a  directional 
derivative.  In  symbols,  we  want  to  find  the  limiting  value  of  AT  /As  where  As  is  an 
element  of  distance  (arc  length)  in  a  given  direction,  and  AT  is  the  corresponding 
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change  in  temperature;  we  write  the  directional  derivative  as  dT/ds.  We  could  also 
ask  for  the  direction  in  which  dT/ds  has  its  largest  value;  this  is  physically  the 
direction  from  which  heat  flows  (that  is,  heat  flows  from  hot  to  cold,  in  the  opposite 
direction  from  the  maximum  rate  of  temperature  increase). 

Before  we  discuss  how  to  calculate  directional  derivatives,  consider  another  ex¬ 
ample.  Suppose  we  are  standing  at  a  point  on  the  side  of  the  hill  of  Figure  5.1  (not 
at  the  top),  and  ask  the  question  “In  what  direction  does  the  hill  slope  downward 
most  steeply  from  this  point?”  This  is  the  direction  in  which  you  would  start  to 
slide  if  you  lost  your  footing;  it  is  the  direction  most  people  would  probably  call 
“straight”  down.  We  want  to  make  this  vague  idea  more  precise.  Suppose  we  move 
a  small  distance  As  on  the  hill;  the  vertical  distance  Az  which  we  have  gone  may 
be  positive  (uphill)  or  negative  (downhill)  or  zero  (around  the  hill).  Then  Az/ As 
and  its  limit  dz/ds  depend  upon  the  direction  in  which  we  go;  dz/ds  is  a  directional 
derivative.  The  direction  of  steepest  slope  is  the  direction  in  which  dz/ds  has  its 
largest  absolute  value.  Notice  that  since  the  gravitational  potential  energy  of  a 
mass  to  is  V  =  mgz ,  maximizing  dz/ds  is  the  same  as  maximizing  dV/ds,  where 
the  equipotentials  on  the  hill  are  V (x,  y)  =  mgz(x,  y)  =  const. 

Let  us  now  state  and  solve  the  general  problem  of  finding  a  directional  derivative. 
We  are  given  a  scalar  field,  that  is,  a  function  y,  z)  [or  <j>(x,  y)  in  a  two- variable 
problem;  the  following  discussion  applies  to  two- variable  problems  if  we  simply  drop 
terms  and  equations  containing  z].  We  want  to  find  d(f>/ds,  the  rate  of  change  of 
4>  with  distance,  at  a  given  point  (xo,yo,zo)  and  in  a  given  direction.  Let  u  = 
io  +  jb  +  kc  be  a  unit  vector  in  the  given  direction.  In  Figure  6.1,  we  start  at 
{xo,  yo,Zo)  and  go  a  distance  s  (s  >  0)  in  the  direction  u  to  the  point  (x,y,z);  the 
vector  joining  these  points  is  us  since  u  is  a  unit  vector.  Then, 

( x ,  y,  z)  -  (x0,  yo,  z0)  =  us  =  (ai  +  bj  +  ck)s  ^ 

or 

!x  =  Xo  +  as, 
y  =  2/o  +  bs, 

Z  =  Zo  +  cs. 


(*o,  ro>  zo) 


Figure  6.1 


Equations  (6.1)  are  the  parametric  equations  of  the  line  through  (xo,  2/o,  ~o)  hr  the 
direction  u  [see  Chapter  3,  equation  (5.8)]  with  the  distance  s  (instead  of  t)  as  the 
parameter,  and  with  u  (instead  of  A)  as  the  vector  along  the  line.  From  (6.1)  we 
see  that  along  the  line,  x,  y ,  and  z  are  each  functions  of  a  single  variable,  namely  s 
[all  the  other  letters  in  (6.1)  are  given  constants].  If  we  substitute  x,  y ,  2  in  (6.1) 
into  <p(x,  y,  z),  then  </>  becomes  a  function  of  just  the  one  variable  s.  That  is,  along 
the  straight  line  (6.1),  <f>  is  a  function  of  one  variable,  namely  the  distance  along  the 
line  measured  from  (xq,  yo,  zq).  Since  <t>  depends  on  s  alone,  we  can  find  d<f)/ds: 


(6.2) 


d(f> 

ds 


d(j>  dx 
dx  ds 
d(f> 


d<\)  dy  d(f>  dz 

dy  ds  dz  ds 

d(j> 


Ab 


dx  dy 


dz 


This  is  the  dot  product  of  u  with  the  vector  i {d<f>/dx)  +\{d<f>/dy)  +  k {d<f>/dz).  This 
vector  is  called  the  gradient  of  <f>  and  is  written  grad^  or  V0  (read  “del  0”).  By 
definition 
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Example  1.  Find  the  directional  derivative  of  4>  =  x2y  +  xz  at  (1,  2,  —1)  in  the  direction 

A  =  2i  —  2j  +  k. 

Here  u  is  a  unit  vector  obtained  by  dividing  A  by  |A|.  Then  we  have 


u=  -(2i-2j  +  k). 


Using  (6.3)  we  get 


_  ,  .  dcj)  .  dcj)  dcj)  2. 

V0  =  i  —  +  j  —  +  k—  =  (2  xy  +  z)i  +  x2}  +  a’k, 


at  the  point  (1,  2,  —1)  =  3i  +  j  +  k. 


Then  from  (6.4)  we  find 


-f  &t  (1)  2,  —1)  =  V c, 
as 


u  =  2 - 

3 


The  gradient  of  a  function  has  useful  geometrical 
and  physical  meanings  which  we  shall  now  inves¬ 
tigate.  From  (6.4),  using  the  definition  of  a  dot 
product,  and  the  fact  that  |u|  =  1,  we  have 

(6.5)  -f-  =  |V</>|  cos#, 

as 


d(f}  . 

*  in  direction  u 


where  9  is  the  angle  between  u  and  the  vector  n. 

n.  0  =  const. 

V(/>.  Thus  d(j)/ds  is  the  projection  of  V^>  on  the  \ 

direction  u  (Figure  6.2).  We  find  the  largest  value 

of  d(j)/ds  (namely  |V(/>|)  if  we  go  in  the  direction  Figure  6.2 

of  V0  (that  is,  9  =  0  in  Figure  6.2).  If  we  go  in 

the  opposite  direction  (that  is,  9  =  180°  in  Figure  6.2)  we  find  the  largest  rate  of 
decrease  of  (/>,  namely  d<j)/ds  =  —  |V0|. 


Figure  6.2 


Example  2.  Suppose  that  the  temperature  T  at  the  point  (x,  y,  z)  is  given  by  the  equation 
T  =  x2  —  y2  +  xyz  +  273.  In  which  direction  is  the  temperature  increasing  most 
rapidly  at  (—1,  2, 3),  and  at  what  rate?  Here  VT  =  (2x+yz)i  +  (—2y+ xz)j  +  xyk  = 
4i  —  7j  —  2k  at  (—1,  2, 3),  and  the  increase  in  temperature  is  fastest  in  the  direction 
of  this  vector.  The  rate  of  increase  is  dT/ds  =  |VTj  =  ^/16  +  49  +  4  =  \/69.  We 
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can  also  say  that  the  temperature  is  decreasing  most  rapidly  in  the  direction  —  VT; 
in  this  direction,  dT/ds  =  —  \/69-  Heat  flows  in  the  direction  -  VT  (that  is,  from 
hot  to  cold). 


p(*0.  ><>• 


Next  suppose  u  is  tangent  to  the  surface  <f>  =  const,  at  the  point  P(xo,yo,  Zq) 
(Figure  6.3).  We  want  to  show  that  d<j>/ds  in  the  direction  u  is  then  equal  to 
zero.  Consider  A<f>/A s  for  paths  PA,  PB ,  PC, 
etc.,  approaching  the  tangent  u.  Since  <p  =  const, 
on  the  surface,  and  P,  A,  B,  C,  etc.  are  all  on 
the  surface,  A <fi  =  0,  and  A(f>/As  =  0  for  such 
paths.  But  d(f>/ds  in  the  tangent  direction  is  the 
limit  of  Acj)/ As  as  As  — ■>  0  (that  is,  as  PA,  PB, 
etc.,  approach  u),  so  d(j>/ds  in  the  direction  u  is 
zero  also.  Then  for  u  along  the  tangent  to  4>  =  const.,  •  u  =  0;  this  means  that 
V0  is  perpendicular  to  u.  Since  this  is  true  for  any  u  tangent  to  the  surface  at  the 
point  (x0,  yo,  zq),  then  at  that  point: 


Figure  6.3 


The  vector  V(^  is  perpendicular  (normal)  to  the  surface  <f>  =const. 


Since  |  V(^|  is  the  value  of  the  directional  derivative  in  the  direction  normal  (that  is, 
perpendicular)  to  the  surface,  it  is  often  called  the  normal  derivative  and  written 
|V0|  =  d<j)/dn. 

We  now  see  that  the  direction  of  largest  rate  of  change  of  a  given  function  cf> 
with  distance  is  perpendicular  to  the  equipotentials  (or  level  lines)  4>  =  const.  In 
the  temperature  problem,  the  direction  of  maximum  dT / ds  is  then  perpendicular  to 
the  isothermals.  At  any  point  this  is  the  direction  of  VT  and  is  called  the  direction 
of  the  temperature  gradient.  In  the  problem  of  the  hill,  the  direction  of  steepest 
slope  at  any  point  is  perpendicular  to  the  level  lines,  that  is,  along  Wz  or  VIA 

Example  3.  Given  the  surface  x3y2z  =  12,  find  the  equations  of  the  tangent  plane  and 
normal  line  at  (1,  —2,  3). 

This  is  a  level  surface  of  the  function  w  =  x3y2z,  so  the  normal  direction  is  the 
direction  of  the  gradient 


Vw  =  3x2y2zi  +  2x3yzj  +  x3y2k  =  36i  —  12j  +  4k  at  (1,— 2,3). 

A  simpler  vector  in  the  same  direction  is  9i  —  3j  +  k.  Then  (see  Chapter  3,  Section 
5)  the  equation  of  the  tangent  plane  is 

9(®-l)-3(y  +  2)  +  (z-3)  =  0, 


and  the  equations  of  the  normal  line  are 

x-l_y+2_z-3 


In  (6.3)  we  have  written  the  gradient  in  terms  of  its  rectangular  components. 
It  is  useful  to  write  it  in  cylindrical  and  spherical  coordinates  also.  (Note  that 
this  includes  polar  coordinates  when  z  =  0).  In  cylindrical  coordinates  we  want 
the  components  of  V(/>  in  the  directions  er,  eg,  and  e,  =  k.  According  to  (6.4), 
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the  component  of  V/  in  any  direction  u  is  the  directional  derivative  df  /ds  in  that 
direction.  (We  are  changing  the  function  from  <j>  to  /  since  </>  is  used  as  an  angle  in 
spherical,  and  sometimes  in  cylindrical  and  polar,  coordinates.)  The  element  of  arc 
length  ds  in  the  r  direction  is  dr  so  the  directional  derivative  in  the  r  direction  is 
df  /  dr  ( 9  and  z  constant)  which  we  write  as  df  /dr.  In  the  9  direction,  the  element 
of  arc  length  is  r  d6  (Chapter  5,  Section  4)  so  the  directional  derivative  in  the  9 
direction  is  df/(rdO)  (with  r  and  z  constant)  which  we  write  as  (1  /r)df  /d9.  Thus 
we  have  in  cylindrical  coordinates  (or  polar  without  the  2  term) 


(6.7) 

„  ,  df  1  df  df 

V/ =  er-^- +e0- +e,—  m 

or  r  oO  oz 

cylindrical  coordinates. 

In  a  similar  way  we  can  show  (Problem  21)  that 

(6.8) 

df  1  df  1  df 

in  spherical  coordinates. 

r  dr  8 9  r  d9  +  ^  r  sin  </>  d<f 

PROBLEMS,  SECTION  6 

1.  Find  the  gradient  of  w  =  x2y3z  at  (1,  2,  —1). 

2.  Starting  from  the  point  (1, 1),  in  what  direction  does  the  function  (j>  =  x2  —  y2  +  2xy 
decrease  most  rapidly? 

3.  Find  the  derivative  of  xy2  +  yz  at  (1, 1,  2)  in  the  direction  of  the  vector  2i  —  j  +  2k. 

4.  Find  the  derivative  of  zex  cos  y  at  (1,  0,  n/3)  in  the  direction  of  the  vector  i  +  2j. 

5.  Find  the  gradient  of  <j>  =  zsiny  —  xz  at  the  point  (2, 7r/2,  —  1) .  Starting  at  this 

point,  in  what  direction  is  f  decreasing  most  rapidly?  Find  the  derivative  of  (j>  in 

the  direction  2i  +  3j. 

6.  Find  a  vector  normal  to  the  surface  x2  +  y2  —  z  =  0  at  the  point  (3, 4,  25).  Find  the 
equations  of  the  tangent  plane  and  normal  line  to  the  surface  at  that  point. 

7.  Find  the  direction  of  the  line  normal  to  the  surface  x2y  +  y2z  +  z2x  +  1  =  0  at  the 
point  (1,2, —1).  Write  the  equations  of  the  tangent  plane  and  normal  line  at  this 
point. 

8.  (a)  Find  the  directional  derivative  of  f>  =  x2  +  sin  y  —  xz  in  the  direction  i  +  2j  —  2k 

at  the  point  (1,  n/2,  —3). 

(b)  Find  the  equation  of  the  tangent  plane  and  the  equations  of  the  normal  line 
to  f  =  5  at  the  point  (1,  n/2,  —3). 

9.  (a)  Given  <f>  =  x2  —  y2z,  find  V</>  at  (1, 1, 1). 

(b)  Find  the  directional  derivative  of  rf>  at  (1, 1, 1)  in  the  direction  i  —  2j  +  k. 

(c)  Find  the  equations  of  the  normal  line  to  the  surface  x2  —  y2z  =  0  at  (1, 1, 1). 

For  Problems  10  to  14,  use  a  computer  as  needed  to  make  plots  of  the  given  surfaces  and 
the  isothermal  or  equipotential  curves.  Try  both  3D  graphs  and  contour  plots. 
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10.  If  the  temperature  in  the  (x,  y)  plane  is  given  by  T  =  xy  —  x,  sketch  a  few  isothermal 
curves,  say  for  T  =  0,1,  2,  —1,  —2.  Find  the  direction  in  which  the  temperature 
changes  most  rapidly  with  distance  from  the  point  (1,1),  and  the  maximum  rate  of 
change.  Find  the  directional  derivative  of  T  at  (1, 1)  in  the  direction  of  the  vector 
3i  —  4j .  Heat  flows  in  the  direction  -  VT  (perpendicular  to  the  isothermals).  Sketch 
a  few  curves  along  which  heat  would  flow. 

11.  (a)  Given  <f)  —  x2  —  y2,  sketch  on  one  graph  the  curves  <f>  =  4,  <f>  =  1,  <j>  =  0, 

< p  =  —1,  rf>  =  —4.  If  <j>  is  the  electrostatic  potential,  the  curves  <j>  =  const,  are 
equipotentials,  and  the  electric  field  is  given  by  E  =  —  V</>.  If  (f)  is  temperature, 
the  curves  4>  =const.  are  isothermals  and  V</>  is  the  temperature  gradient;  heat 
flows  in  the  direction  —  V</>. 

(b)  Find  and  draw  on  your  sketch  the  vectors  —  V0  at  the  points  (*,  y)  =  (±1,  ±1), 
(0,±2),  (±2,0).  Then,  remembering  that  V<)>  is  perpendicular  to  <j>  =  const., 
sketch,  without  computation,  several  curves  along  which  heat  would  flow  [see (a)]. 

12.  For  Problem  11, 

(a)  Find  the  magnitude  and  direction  of  the  electric  field  at  (2,  1). 

(b)  Find  the  direction  in  which  the  temperature  is  decreasing  most  rapidly  at 
(-3,2). 

(c)  Find  the  rate  of  change  of  temperature  with  distance  at  (1,2)  in  the  direction 
3i  -  j. 

13.  Let  <j>  =  ex  cos  y.  Let  <j>  represent  either  temperature  or  electrostatic  potential.  Refer 
to  Problem  11  for  definitions  and  find: 

(a)  The  direction  in  which  the  temperature  is  increasing  most  rapidly  at  (1,  —n/4) 
and  the  magnitude  of  the  rate  of  increase. 

(b)  The  rate  of  change  of  temperature  with  distance  at  (0, 7r/3)  in  the  direction 
i  +  jt/3. 

(c)  The  direction  and  magnitude  of  the  electric  field  at  (0,  n). 

(d)  The  magnitude  of  the  electric  field  at  x  =  —1,  any  y. 

14.  (a)  Suppose  that  a  hill  (as  in  Fig.  5.1)  has  the  equation  z  =  32  —  x2  —  4 y2,  where 

z  =  height  measured  from  some  reference  level  (in  hundreds  of  feet).  Sketch  a 
contour  map  (that  is,  draw  on  one  graph  a  set  of  curves  a  =  const.);  use  the 
contours  z  =  32,  19,  12,  7,  0. 

(b)  If  you  start  at  the  point  (3,  2)  and  in  the  direction  i  +  j,  are  you  going  uphill 
or  downhill,  and  how  fast? 

15.  Repeat  Problem  14b  for  the  following  points  and  directions. 

(a)  (4,  —2),  i  +  j  (b)  (—3, 1),  4i  +  3j 

(c)  (2,  2),  — 3i  +  j  (d)  (—4,  —  1) ,  4i  —  3j 

16.  Show  by  the  Lagrange  multiplier  method  that  the  maximum  value  of  d(f>/ds  is  |V0|. 
That  is,  maximize  d<j)/ds  given  by  (6.3)  subject  to  the  condition  a2  +  b2  +  c2  =  1.  You 
should  get  two  values  (±)  for  the  Lagrange  multiplier  A,  and  two  values  (maximum 
and  minimum)  for  d(j>/ds.  Which  is  the  maximum  and  which  is  the  minimum? 

17.  Find  Vr,  where  r  =  \J  x2  +  y2,  using  (6.7)  and  also  using  (6.3).  Show  that  your 
results  are  the  same  by  using  (4.11)  and  (4.12). 

As  in  Problem  17,  find  the  following  gradients  in  two  ways  and  show  that  your  answers 

are  equivalent. 

18.  Vo;  19-  V?/  20.  V(r2) 

21.  Verify  equation  (6.8);  that  is,  find  V/  in  spherical  coordinates  as  we  did  for  cylin¬ 
drical  coordinates.  Hint :  What  is  ds  in  the  <j>  direction?  See  Chapter  5,  Figure  4.5. 
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7.  SOME  OTHER  EXPRESSIONS  INVOLVING  V 

If  we  write  V</>  as  [i(d/dx)  +  j (9/9y)  +  k (d/dz)\cj),  we  can  then  call  the  bracket  V. 
By  itself  V  has  no  meaning  (just  as  d/dx  alone  has  no  meaning;  we  must  put  some 
function  after  it  to  be  differentiated).  However,  it  is  useful  to  use  V  much  as  we 
use  d/dx  to  indicate  a  certain  operation. 


We  call  V  a  vector  operator  and  write 


(7.1) 


V  = 


.  9  .  9  d 

*9^ +J%+k9? 


It  is  more  complicated  than  d/dx  (which  is  a  scalar  operator)  because  V  has 
vector  properties  too. 


So  far  we  have  considered  V i f>  where  (f)  is  a  scalar;  we  next  want  to  consider  whether 
V  can  operate  on  a  vector. 

Suppose  V(a;,  y,  z)  is  a  vector  function,  that  is,  the  three  components  14,  Vy,  Vz 
of  V  are  functions  of  x,  y,  z: 

V(x,V,z)  =  iVx(x,y,z)  +j  Vv(x,y,z)  +  k.Vz  (x,y,z). 

(The  subscripts  mean  components,  not  partial  derivatives.)  Physically,  V  represents 
a  vector  field  (for  example,  the  electric  field  about  a  point  charge).  At  each  point 
of  space  there  is  a  vector  V,  but  the  magnitude  and  direction  of  V  may  vary  from 
point  to  point.  We  can  form  two  useful  combinations  of  V  and  V.  We  define  the 
divergence  of  V,  abbreviated  divV  or  V  •  V,  by  (7.2): 


(7.2) 


V  •  V  =  div  V  = 


914 

dx 


dVy_ 

dy 


914 

dz 


We  define  the  curl  of  V,  written  V  X  V,  by  (7.3): 


You  should  study  these  expressions  to  see  how  we  are  using  V  as  “almost”  a  vec¬ 
tor.  The  definitions  of  divergence  and  curl  are  the  partial  derivative  expressions,  of 
course.  However,  the  similarity  of  the  formulas  (7.2)  and  (7.3)  to  those  for  A  •  B 
and  A  X  B  helps  us  to  remember  V  •  V  and  V  X  V.  But  you  must  remember  to 
put  the  partial  derivative  “components”  of  V  before  the  components  of  V  in  each 
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term  [for  example,  in  evaluating  the  determinant  in  (7.3)].  Note  that  V  •  V  is  a 
scalar  and  V  X  V  is  a  vector  (compare  A  •  B  and  A  X  B).  We  shall  discuss  later 
the  meaning  and  some  of  the  applications  of  the  divergence  and  the  curl  of  a  vector 
function. 

The  quantity  V0  in  (6.3)  is  a  vector  function;  we  can  then  let  V  =  V0  in 
(7.2)  and  find  V  •  V0  =  divgrad0.  This  is  a  very  important  expression  called  the 
Laplacian  of  0 ;  it  is  usually  written  as  V20.  From  (6.3)  and  (7.2),  we  have 


(7.4) 


_  d  dcj)  d  d(f> 

,  =  V  •  V0  =  div  grad  0  =  — 

ox  ox  oy  ay 


d2(j) 
dx 2 


<920 
dy 2 


d2(j) 

Yz 2 


(the  Laplacian). 


d  dcj) 
dz  dz 


The  Laplacian  is  part  of  several  important  equations  in  mathematical  physics: 


V20=  — 


V"0  = 


=  0 

1  d2cf> 

a ?  dt2 
1  <90 


a* 


dt 


Laplace’s  equation, 
wave  equation. 

diffusion,  heat  conduction,  Schrodinger  equation. 


These  equations  arise  in  numerous  problems  in  heat,  hydrodynamics,  electricity 
and  magnetism,  aerodynamics,  elasticity,  optics,  etc.;  we  shall  discuss  solving  such 
equations  in  Chapter  13. 

There  are  many  other  more  complicated  expressions  involving  V  and  one  or  more 
scalar  or  vector  functions,  which  arise  in  various  applications  of  vector  analysis.  For 
reference  we  list  a  table  of  such  expressions  at  the  end  of  the  chapter  (page  339). 
Notice  that  these  are  of  two  kinds:  (1)  expressions  involving  two  applications  of 
V  such  as  V  •  V()  =  V20;  (2)  combinations  of  V  with  two  functions  (vectors  or 
scalars)  such  as  V  X  (0V).  We  can  verify  these  expressions  simply  by  writing  out 
components.  However,  it  is  usually  simpler  to  use  the  same  formulas  we  would  use 
if  V  were  an  ordinary  vector,  being  careful  to  remember  that  V  is  also  a  differential 
operator. 


Example  1.  Evaluate  V  X  (V  X  V).  We  use  (3.8)  for  A  x  (B  x  C)  being  careful  to  write 
both  V’s  before  the  vector  function  V  which  they  must  differentiate.  Then  we  get 

V  X  (V  X  V)  =  V(V  •  V)  -  (V  •  V)V 
=  V(V-  V)  -  V2V. 

This  is  a  vector  as  it  should  be;  the  Laplacian  of  a  vector,  V2V,  simply  means  a 
vector  whose  components  are  V214,  S72Vv,  V2VZ. 

Example  2.  Find  V  •  (0V),  where  0  is  a  scalar  function  and  V  is  a  vector  function.  Here 
we  must  differentiate  a  product,  so  our  result  will  contain  two  terms.  We  could 
write  these  as 


(7.5) 


V  •  (0V)  =  •  (0V)  +  Vv  •  (0V) 
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where  the  subscripts  on  V  indicate  which  function  is  to  be  differentiated.  Since  4> 
is  a  scalar,  it  can  be  moved  past  the  dot.  Then 

•  (^v)  =  (v0<A)  •  v  =  v  •  (V0), 

where  we  have  removed  the  subscript  in  the  last  step  since  V  no  longer  appears 
after  V.  Actually  you  may  see  in  books  (V0)  •  V  meaning  that  only  the  (j>  is  to  be 
differentiated,  but  it  is  clearer  to  write  it  as  V  •  (V(/>).  [Be  careful  with  (V0)  X  V, 
however;  assuming  that  this  means  that  only  (f>  is  to  be  differentiated,  the  clear  way 
to  write  it  is  —V  X  (V0);  note  the  minus  sign.]  In  the  second  term  of  (7.5),  0  is  a 
scalar  and  is  not  differentiated;  thus  it  is  just  like  a  constant  and  we  can  write  this 
term  as  <(>(V  •  V).  Collecting  our  results,  we  have 

(7.6)  V-  (</>V)  =  V-  V0  +  0(V-  V). 

In  Chapter  10,  Section  9,  we  will  derive  the  formulas  for  divV  =  V  •  V  and 
V2/  in  cylindrical  and  spherical  coordinates.  However,  it  is  useful  to  have  the 
results  for  reference,  so  we  state  them  here.  Actually,  these  can  be  done  as  partial 
differentiation  problems  (see  Chapter  4,  Section  11),  but  the  algebra  is  messy. 


PROBLEMS,  SECTION  7 

The  purpose  in  doing  the  following  simple  problems  is  to  become  familiar  with  the  formulas 
we  have  discussed.  So  a  good  study  method  is  to  do  them  by  hand  and  then  check  your 
results  by  computer. 

Compute  the  divergence  and  the  curl  of  each  of  the  following  vector  fields. 


1. 

r  =  xi  +  y  j  +  zk 

2. 

r  =  x'l  +  yj 

3. 

V  =  zi  +  y  j  +  xk 

4. 

V  =  yi  +  zj  +  xk 

5. 

V  =  x2i  +  y2  j  +  z2k 

6. 

V  =  x2yi  +  y2x j  +  xyzk 

7. 

V  =  x  sin  y  i  +  cos  y  j  +  xyk 

8. 

V  =  sinh  z  i  +  2y  j  +  *  cosh  z  k 

Section  8 


Line  Integrals  299 


Calculate  the  Laplacian  V2  of  each  of  the  following  scalar  fields. 


9.  x3  —  3xy2  +  y3 


11. 


V%2  -y2 

13.  xy{x2  +  y2  —  5  z2) 

15.  xyz(x2  —  2y2  +  z2) 


10.  In  (x2+y2) 

12.  (x  +  y)-1 

14.  (x2  +  y2  +  z2)-1^2 

16.  ln(a;2  +  y2  +  z2) 


17.  Verify  formulas  (b),  (c),  (d),  (g),  (h),  (i),  (j),  (k)  of  the  table  of  vector  identities 
at  the  end  of  the  chapter.  Hint  for  (j):  Start  by  expanding  the  two  triple  vector 
products  on  the  right. 

For  r  =  xi  +  y  j  +  zk,  evaluate 

18.  V  X  (k  X  r)  19. 


V-  ( 


20.  V  X  (  -r-r 
r 


8.  LINE  INTEGRALS 

In  Section  2,  we  discussed  the  fact  that  the  work  done  by  a  force  F  on  an  object 
which  undergoes  an  infinitesimal  vector  displacement  dr  can  be  written  as 

(8.1)  dW  =  F  •  dr. 

Suppose  the  object  moves  along  some  path  (say  A  to  B  in  Fig.  8.1),  with  the  force 
F  acting  on  it  varying  as  it  moves.  For  example,  F  might  be  the  force  on  a  charged 
particle  in  an  electric  field;  then  F  would  vary  from  point  to  point,  that  is  F  would 
be  a  function  of  x,  y,  z.  However,  on  a  curve,  x,  y,  z  are  related  by  the  equations  of 
the  curve.  In  three  dimensions  it  takes  two  equations  to  determine  a  curve  (as  an 
intersection  of  two  surfaces;  for  example,  consider  the  equations  of  a  straight  line 
in  Chapter  3,  Section  5).  Thus  along  a  curve  there  is  only  one  independent  variable; 


X 


■y 


Figure  8.1 

we  can  then  write  F  and  dr  =  i  dx+  j  dy  +  kdz  as  functions  of  a  single  variable.  The 
integral  of  dW  =  F  •  dr  along  the  given  curve  then  becomes  an  ordinary  integral 
of  a  function  of  one  variable  and  we  can  evaluate  it  to  find  the  total  work  done 
by  F  in  moving  an  object  in  Figure  8.1  from  A  to  B.  Such  an  integral  is  called  a 
line  integral.  A  line  integral  means  an  integral  along  a  curve  (or  line),  that  is,  a 
single  integral  as  contrasted  to  a  double  integral  over  a  surface  or  area,  or  a  triple 
integral  over  a  volume.  The  essential  point  to  understand  about  a  line  integral  is 
that  there  is  one  independent  variable,  because  we  are  required  to  remain  on  a 
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curve.  In  two  dimensions,  the  equation  of  a  curve  might  be  written  y  =  f(x),  where 
x  is  the  independent  variable.  In  three  dimensions,  the  equations  of  a  curve  (for 
example,  a  straight  line)  can  be  written  either  like  (6.6)  (where  we  could  take  x  as 
the  independent  variable  and  find  y  and  2  as  functions  of  x),  or  (6.1)  (where  s  is  the 
independent  variable  and  x,  y,  z  are  all  functions  of  s) .  To  evaluate  a  line  integral, 
then,  we  must  write  it  as  a  single  integral  using  one  independent  variable. 

Example  1.  Given  the  force  F  =  xy  i  —  y2  j,  find  the  work  done  by  F  along  the  paths 
indicated  in  Figure  8.2  from  (0,  0)  to  (2, 1). 

Since  r  =  x  i  +  y  j  on  the  (x,  y)  plane,  we  have 

dr  =  ldx  +  j  dy, 

F  •  dr  =  xy  dx  —  y2  dy. 

We  want  to  evaluate 

(8.2)  W  =  f  (xy  dx  —  y2  dy) . 


y 


First  we  must  write  the  integrand  in  terms  of  one  variable.  Along  path  1  (a  straight 
line),  y  =  ^ x ,  dy  =  ^dx.  Substituting  these  values  into  (8.2),  we  obtain  an  integral 
in  the  one  variable  x.  The  limits  for  x  (Figure  8.2)  are  0  to  2.  Thus,  we  get 


We  could  just  as  well  use  y  as  the  independent  variable  and  put  x  =  2 y,  dx  =  2 dy, 
and  integrate  from  0  to  1.  (You  should  verify  that  the  answer  is  the  same.) 


Along  path  2  in  Figure  8.2  (a  parabola),  y  =  \x2 ,  dy  =  \xdx.  Then  we  get 


W2 


16 


1  2  j  1 

x  ■  —x  dx - x 

4  16 


4  —xdx 
2 


2 

3 


dx 
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Along  path  3  (the  broken  line) ,  we  have  to  use  a  different  method.  We  integrate 
first  from  (0,  0)  to  (0, 1)  and  then  from  (0, 1)  to  (2, 1)  and  add  the  results.  Along 
(0, 0)  to  (0, 1),  x  =  0  and  dx  =  0  so  we  must  use  y  as  the  variable.  Then  we  have 


'y= o 


(0  •  y  •  0  -  y2dy)  = 


1 

3 


Along  (0, 1)  to  (2, 1),  y  =  1,  dy  =  0,  so  we  use  x  as  the  variable.  We  have 


/  (x  ■  1  •  dx  —  1  •  0)  = 
Jx= o 


=  2. 


Then  the  total  W3  =  —  i  +  2  =  |. 


Path  4  illustrates  still  another  technique.  Instead  of  using  either  x  or  y  as  the 
integration  variable,  we  can  use  a  parameter  t.  For  x  =  2 t  ,  y  =  t  ,  we  have 
dx  =  6 12  dt,  dy  =  2 tdt.  At  the  origin,  t  =  0,  and  at  (2, 1),  t  =  1.  Substituting  these 
values  into  (8.2),  we  get 

W4  =  /  (2 t3  ■  t2  •  6f2  (if  -  f4  •  2 1  dt)  =  [  (12 17  -  2 f5)  dt  =  ^ 

io  Jo  8  6  6 


Example  2.  Find  the  value  of 


/xdy  —  y  dx 
x2  +  y2 


along  each  of  the  two  paths  indicated  in  Figure  8.3  from  (—1,0)  to  (1,0).  [Notice 
that  we  could  have  written  I  =  fF-dr  with  F  =  (— i y  +  x])/(x2  +  y2);  however, 
there  are  also  many  other  kinds  of  problems  in  which  line  integrals  may  arise.] 


(0, 1) 


Along  the  circle  it  is  simplest  to  use  polar  coordinates;  then  r  =  1  at  all  points 
of  the  circle  and  6  is  the  only  variable.  We  then  have 

x  =  cos  9,  dx  =  —  sin  9  d9 , 
y  =  sin9,  dy  =  cos9d9}  x2  +  y2  =  1, 
xdy  —  y  dx  cos2  9d9  —  sin  9(—  sin  9)  d9 


x2  +  y2  1 

At  (—1, 0),  9  =  7r;  at  (1,  0),  9  =  0.  Then  we  get 


=  d9. 


h=  d9  =  -7 r. 


■o 
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Along  path  2,  we  integrate  from  (—1,0)  to  (0,1)  and  from  (0,1)  to  (1,0)  and 
add  the  results.  The  first  straight  line  has  the  equation  y  =  x  +  1;  then  dy  =  dx, 
and  the  integral  is 


f°  x  dx  —  (x  +  1)  dx 
/_!  X2  +  {x+l)2 


—dx 


—2  dx 


2x2  +  2x+l  J_ i  (2a; +  1)2  +  1 
o 

=  —  arctan(2a;  +  1) 


-l 


=  —  arc  tan  1  +  arc  tan(— 1) 

7T 
2' 


=  4  +  (-4) 

Along  the  second  straight  line  y  =  1  —  x,  dy  =  —dx,  and  the  integral  is 


f1  xdx  +  (1  —  x)  dx  I"1  —2  dx 

Jo  X2  +  (1  -  x)2  J0  (2x  -  l)2  +  1 

7T 

=  ~2' 


arctan(2a; 


1 


1) 


0 


Adding  the  results  for  the  integrals  along  the  two  parts  of  path  2,  we  get  I 2  =  —  7r. 


Conservative  Fields  Notice  that  in  Example  1  the  answers  were  different  for 
different  paths,  but  in  Example  2  they  are  the  same.  (See  Section  11,  however.) 
We  can  give  a  physical  meaning  to  these  facts  if  we  interpret  the  integrals  in  all 
cases  as  the  work  done  by  a  force  on  an  object  which  moves  along  the  path  of 
integration.  Suppose  you  want  to  get  a  heavy  box  across  a  sidewalk  and  up  into  a 
truck.  Compare  the  work  done  in  dragging  the  box  across  the  sidewalk  and  then 
lifting  it,  with  the  work  done  in  lifting  it  and  then  swinging  it  across  in  the  air. 
In  the  first  case  work  is  done  against  friction  in  addition  to  the  work  required  to 
lift  the  box;  in  the  second  case  the  only  work  done  is  that  required  to  lift  the  box. 
Thus  we  see  that  the  work  done  in  moving  an  object  from  one  point  to  another  may 
depend  on  the  path  the  object  follows;  in  fact,  it  usually  will  when  there  is  friction. 
Our  example  1  was  such  a  case.  A  force  field  for  which  W  =  f  F  •  dr  depends  upon 
the  path  as  well  as  the  endpoints  is  called  nonconservative ;  physically  this  means 
that  energy  has  been  dissipated,  say  by  friction.  There  are  however,  conservative 
fields  for  which  f  F  •  dr  is  the  same  between  two  given  points  regardless  of  what 
path  we  calculate  it  along.  For  example,  the  work  done  in  raising  a  mass  m  to  the 
top  of  a  mountain  of  height  h  is  W  =  mgli  whether  we  lift  the  mass  straight  up  a 
cliff  or  carry  it  up  a  slope,  as  long  as  no  friction  is  involved.  Thus  the  gravitational 
field  is  conservative. 

It  is  useful  to  be  able  to  recognize  conservative  and  nonconservative  fields  before 
we  do  the  integration.  We  shall  see  later  (Section  11)  that  ordinarily  curlF  =  0  [see 
(7.3)  for  the  definition  of  curl]  is  a  necessary  and  sufficient  condition  for  f  F  •  dr  to 
be  independent  of  the  path,  that  is,  curlF  =  0  for  conservative  fields  and  curlF  7!  0 
for  nonconservative  fields.  (See  Section  11  for  a  more  careful  discussion  of  this.) 
It  is  not  hard  to  see  why  this  is  usually  so.  Suppose  that  for  a  given  F  there  is  a 
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function  W  (x,  y,  z )  such  that 


(8.3) 


^  .dW  .dW 

F  =  VW  =  i—  +J^— 

ox  oy 

dW 


Fx  = 


dW 

~dx* 


F  = 

V  dy  ’ 


+  k 

F,  = 


dW 

Hz' 

dW 

dz 


Then  assuming  that  d2W/dxdy  =  d2W/dydx,  etc.  (see  Chapter  4,  end  of  Section  1), 
we  get  from  (8.3) 


(8.4) 


dFx 

dy 


d2W 

dydx 


dFy 

dx 


and  similarly 


dFy 

dz 


dFz 
dy  ' 


dFx 

dz 


dFz 

dx 


Using  the  definition  (7.3)  of  curlF,  we  see  that  equations  (8.4)  say  that  the  three 
components  of  curlF  are  equal  to  zero.  Thus  if  F  =  VW,  then  curlF  =  0.  Con¬ 
versely  (as  we  shall  show  later),  if  curlF  =  0,  then  we  can  find  a  function  W(x,  y,  z) 
for  which  F  =  VW.  Now  if  F  =  VW,  we  can  write 


(8.5) 


„  ,  „Tir  ,  dW  ,  dW  ,  dW  , 

F  •  dr  =  VW  ■  dr  =  — — dx  +  —— dy  +  —— dz  =  dW, 
ox  oy  oz 


pB  r>B 

/  F-dr  =  dW  =W(B)  -W{A), 
J A  J  A 


where  W(B)  and  W(A)  mean  the  values  of  the  function  W  at  the  endpoints  A  and 
B  of  the  path  of  integration.  Since  the  value  of  the  integral  depends  only  on  the 
endpoints  A  and  B,  it  is  independent  of  the  path  along  which  we  integrate  from  A 
to  B,  that  is,  F  is  conservative. 


Potentials  In  mechanics,  if  F  =  VW  (that  is,  if  F  is  conservative),  then  W  is  the 
work  done  by  F.  For  example,  if  a  mass  m  falls  a  distance  z  under  gravity,  the  work 
done  on  it  is  ingz.  If,  however,  we  lift  the  mass  a  distance  z  against  gravity,  the 
work  done  by  the  force  F  of  gravity  is  W  =  — mgz  since  the  direction  of  motion  is 
opposite  to  F.  The  increase  in  potential  energy  of  m  in  this  case  is  </>  =  +mgz,  that 
is,  W  =  —cj),  or  F  =  —  V(f>.  The  function  (j)  is  called  the  potential  energy  or  the  scalar 
potential  of  the  force  F.  (Of  course,  <f>  can  be  changed  by  adding  any  constant;  this 
corresponds  to  a  choice  of  the  zero  level  of  the  potential  energy  and  has  no  effect 
on  F.)  More  generally  for  any  vector  V,  if  curlV  =  0,  there  is  a  function  <f,  called 
the  scalar  potential  of  V,  such  that  V  =  —V(f>.  (This  is  the  customary  definition 
of  scalar  potential  in  mechanics  and  electricity;  in  hydrodynamics  many  authors 
define  the  velocity  potential  so  that  V  =  +V<f.) 

Now  suppose  that  we  are  given  F  or  dW  =  F  •  dr,  and  we  find  by  calculation 
that  curlF  =  0.  We  then  know  that  there  is  a  function  W  and  we  want  to  know 
how  to  find  it  (up  to  an  arbitrary  additive  constant  of  integration).  To  do  this  we 
can  calculate  the  line  integral  in  (8.5)  from  some  reference  point  A  to  the  variable 
point  B  along  any  convenient  path;  since  the  integral  is  independent  of  the  path 
when  curlF  =  0,  this  process  gives  the  value  of  W  at  the  point  B.  (There  is, 
of  course,  an  additive  constant  in  W  whose  value  depends  on  our  choice  of  the 
reference  point  A.) 
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Example  3.  Show  that 

(8.6) 


F  =  (2  xy  —  z3)i  +  x2j  —  (3  xz2  +  l)k 


is  conservative,  and  find  a  scalar  potential  <f>  such  that  F  =  — V0. 
We  find 


(8.7) 


V  X  F  = 


d_ 

dx 


d_ 

dy 


d_ 

dz 


2  xy  —  z3  x2  —3  xz2  —  1 


=  0, 


so  F  is  conservative.  Then 


(8.8) 


W  = 


/  F  •  dr  =  (2 xy  —  z 3)  dx  +  x2  dy  —  (3 xz2  +  1)  dz 

J  A  J  A 


is  independent  of  the  path.  Let  us  choose  the  origin  as  our  reference  point  and 
integrate  (8.8)  from  the  origin  to  the  point  ( x,y,z ).  As  the  path  of  integration,  we 
choose  the  broken  line  from  (0, 0,  0)  to  (x,  0,  0)  to  (x,  y,  0)  to  (x,  y,  z).  From  (0, 0,  0) 
to  (x,  0, 0),  we  have  y  =  z  =  0,  dy  =  dz  =  0,  so  the  integral  is  zero  along  this  part 
of  the  path.  From  (x,  0,  0)  to  (x,  y,  0),  we  have  x  =  const.,  2  =  0,  dx  =  dz  =  0,  so 
the  integral  is 

rv  rv 

/  x2  dy  =  x2  dy  =  x2y. 

Jo  Jo 

From  (x,y,  0)  to  ( x,  y,z )  we  have  x  =  const.,  y  =  const.,  dx  =  dy  =  0,  so  the 
integral  is 

—  /  (3 xz2  +  1  )dz  =  —  xz3  —  z. 

Jo 

Adding  the  three  results,  we  get 


(8.9) 


W  =  x2y  —  xz3  —  z, 


or 


(8.10) 


'  =  —W  =  —x2y  +  xz3  +  z. 


Example  4.  Find  the  scalar  potential  for  the  electric  field  due  to  a  point  charge  q  at  the 
origin. 

Recall  that  the  electric  field  at  a  point  r  =  ix  +  yy  +  kz  means  the  force  on  a 
unit  charge  at  r  due  to  q  and  is  (in  Gaussian  units) 


(8.11) 


q_r 


(This  is  Coulomb’s  law  in  electricity.)  If  we  take  the  zero  level  of  the  potential 
energy  at  infinity,  then  the  scalar  potential  <f>  means  the  negative  of  the  work  done 
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by  the  field  on  the  unit  charge  as  the  charge  moves  from  infinity  to  the  point  r. 
This  is 


(8.12) 


oo  to  r 


r  to  oo 


r  •  dr 

r3 


It  is  simplest  to  evaluate  the  line  integral  using  the  spherical  coordinate  variable  r 
along  a  radial  line.  This  is  justified  by  showing  that  curlE  =  0,  that  is,  that  E  is 
conservative  (Problem  19).  Since  the  differential  of  (r  •  r)  can  be  written  as  either 
d(r  •  r)  =  2r  •  dr  or  as  d(r  •  r)  =  d(r2)  =  2r  dr,  we  have  r  •  dr  =  r  dr  and  (8.12)  gives 


(8.13) 


r  lr 


1 

r 


It  is  interesting  to  obtain  r  •  dr  =  r  dr  geometrically; 
let  us  see  that  A  •  dA  =  AdA.  The  vector  dA  means 
a  change  in  the  vector  A;  a  vector  can  change  in  both 
magnitude  and  direction  (Figure  8.4).  The  scalar  A 
means  |A|;  the  scalar  dA  means  d|A|.  Thus  dA  is  the 
increase  in  length  of  A  and  is  not  the  same  as  |dA|.  In 
fact,  from  Figure  8.4,  we  see  that 

(8.14)  A  •  dA  =  |  A |  |dAj  cos  a  =  A  dA 

since  dA  =  |dA|  cos  a.  For  the  vector  r,  we  have 

r  =  ix  +  jy  +  kz, 
dr  =  i  dx  +  j  dy  +  k  dz, 

|  dr  |  =  \J dx2  +  dy2  +  dz2  =  ds  (see  Chapter  5), 

(8.15)  r  =  |r|  =  \/  x2  +  y2  +  z2, 

dr  =  i(a:2  +  y2  +  z2ff1^2(2x  dx  +  2 ydy  +  2 z  dz) 

=  -(r  •  dr), 
r 

as  above. 


in  fact,  for  any  vector  A, 


Figure  8.4 


Exact  Differentials  The  differential  dW  in  (8.5)  of  a  function  W(x,  y,  z)  is  called 
an  exact  differential.  We  could  then  say  that  curl  F  =  0  is  a  necessary  and  sufficient 
condition  for  F  •  dr  to  be  an  exact  differential  (but  see  Section  11).  To  make  this 
clear,  let  us  consider  some  examples  in  which  F-dr  is,  or  is  not,  an  exact  differential. 


Example  5.  Consider  the  function  W  in  (8.9). 

Then 

(8.16)  dW  =  (2 xy  —  z3)  dx  +  x2  dy  —  (3 xz2  +  1  )dz. 

Here  dW  is  an  exact  differential  by  definition  since  we  got  it  by  differentiating  a 
function  W.  We  can  easily  verify  that  if  we  write  dW  =  F  •  dr,  then  equations  (8.4) 
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are  true: 

^)  =  2x=?-(2xy-z‘), 

(8.17)  ^-(-3 xz1  -  1)  =  —3s2  =  -  23), 

£<-*«» -l)-0- £(,»). 

You  should  observe  carefully  how  to  get  (8.17)  from  (8.16):  the  equations  (8.17)  say 
that,  in  (8.16),  the  partial  derivative  with  respect  to  x  of  the  coefficient  of  dy  equals 
the  partial  derivative  with  respect  to  y  of  the  coefficient  of  dx,  and  similarly  for  the 
other  pairs  of  variables.  The  equations  (8.17)  are  called  the  reciprocity  relations  in 
thermodynamics;  in  mechanics  they  are  the  components  of  curlF  =  0  [see  (8.7)]. 
In  both  cases  they  are  true  assuming  that  the  mixed  second  partial  derivatives  are 
the  same  in  either  order,  for  example  d2W / dxdy  =  d2W / dydx  (see  Chapter  4,  end 
of  Section  1). 

We  obtained  dW  in  (8.16)  by  taking  the  differential  of  (8.9);  now  suppose  we 
start  with  a  given  dW  =  F  •  dr. 

Example  6.  Let  us  consider 

(8.18)  dW  =  F  •  dr  =  (2 xy  —  z3)  dx  +  x2  dy  +  (3 xz2  +  1)  dz. 

This  is  almost  the  same  as  (8.16);  just  the  sign  of  the  dz  term  is  changed.  Then 
two  of  the  equations  corresponding  to  (8.17)  do  not  hold,  so  curlF  ^  0,  and  dW  is 
not  an  exact  differential.  We  ask  whether  there  is  a  function  W  of  which  (8.18)  is 
the  differential;  the  answer  is  “No”  because  if  there  were,  the  mixed  second  partial 
derivatives  of  W  woidd  be  equal,  and  so  curlF  would  be  zero.  Equations  like  (8.18) 
often  occur  in  applications.  When  dW  is  not  exact,  then  F  is  a  nonconservative 
force,  and  f  F  •  dr,  which  is  the  work  done  by  F,  depends  not  only  on  the  points  A 
and  B  but  also  upon  the  path  along  which  the  object  moves.  As  we  have  said,  this 
happens  when  there  are  friction  forces. 

PROBLEMS,  SECTION  8 

1.  Evaluate  the  line  integral  f  ( x 2  —  y 2)  dx  —  2 xy  dy  along  each  of  the  following  paths 
from  (0,  0)  to  (1,  2). 

(a)  y  =  2x2. 

(b)  x  =  t2 ,  y  =  2t. 

(c)  y  =  0  from  x  =  0  to  x  =  2;  then  along  the  straight  line  joining  (2,  0)  to  (1,  2). 

2.  Evaluate  the  line  integral  f(x  +  2 y)  dx  —  2xdy  along  each  of  the  following  closed 
paths,  taken  counterclockwise: 

(a)  the  circle  x2  +  y2  =  1; 

(b)  the  square  with  corners  at  (1, 1),  (—1, 1),  (—1,  —1),  (1,  —  1); 

(c)  the  square  with  corners  (0,1),  (—1,0),  (0,-1),  (1,0). 
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3. 


Evaluate  the  line  integral  f  xydx  +  x  dy  from  (0,  0)  to 
(1,2)  along  the  paths  shown  in  the  sketch. 


4.  Evaluate  the  line  integral  fcy2  dx  +  2 xdy  +  dz,  where  C  connects  (0,0,0)  with 

(1,1,1), 


(a)  along  straight  lines  from  (0,  0,  0)  to  (1,  0,  0)  to  (1, 0, 1)  to  (1, 1, 1); 

(b)  on  the  circle  x2  +  y2  —  2y  =  0  to  (1,  1,  0)  and  then  on  a  vertical  line  to  (1, 1, 1). 


5.  Find  the  work  done  by  the  force  F  =  x2yi  —  xy2 j 
along  the  paths  shown  from  (1, 1)  to  (4,  2). 


6.  Find  the  work  done  by  the  force  F  =  (2 xy  —  3)i  +  a:2j  y 

in  moving  an  object  from  (1,  0)  to  (0, 1)  along  each  of  (o,  p, 

the  three  paths  shown: 

^ - 

> 

1 

(a)  straight  line, 

(b)  circular  arc, 

(c)  along  lines  parallel  to  the  axes. 

1  (1,'0) 

7.  For  the  force  field  F  =  (y  +  z) i  —  (x  +  z) j  +  (x  +  y)k,  find  the  work  done  in  moving 
a  particle  around  each  of  the  following  closed  curves: 

(a)  the  circle  x2  +  y2  =  1  in  the  (x,  y)  plane,  taken  counterclockwise; 

(b)  the  circle  x2  +  z2  =  1  in  the  (z,  x)  plane,  taken  counterclockwise; 

(c)  the  curve  starting  from  the  origin  and  going  successively  along  the  x  axis  to 
(1,  0,  0),  parallel  to  the  2  axis  to  (1, 0, 1),  parallel  to  the  (y,  z)  plane  to  (1, 1, 1), 
and  back  to  the  origin  along  x  =  y  =  2; 

(d)  from  the  origin  to  (0,  0,  2n)  on  the  curve  x  =  1  —  cost,  y  =  sint,  z  =  t,  and 
back  to  the  origin  along  the  2  axis. 

Verify  that  each  of  the  following  force  fields  is  conservative.  Then  find,  for  each,  a  scalar 
potential  (f>  such  that  F  =  —  V0. 


8. 

F  = 

9. 

F  = 

10. 

F  = 

11. 

F  = 

12. 

F  = 

13. 

F  = 

14. 

F  = 

15. 

F  = 

i  —  2j  —  y  k. 

( 3x2yz  —  3y)i  +  ( x3z  —  3x  )j  +  ( x3y  +  2z)'k. 
—kr,  r  =  \x  +  jj/  +  k2,  k  =  const. 
y  sin  2*  i  +  sin2a:j. 
yi  +  xj  +  k. 

z2  sinh  y  j  +  2z  cosh  y  k. 

y  x 

y* 1  -  X2y2  +  s/l  —  x2y2  ^ 

2x  cos2  y  i  —  (a:2  +  1)  sm2y  j. 
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16.  Given  Fi  =  2xi  —  2yzj  —  y2 k  and  F2  =  j/i  —  :rj, 


(a)  Are  these  forces  conservative?  Find  the  potential  cor¬ 
responding  to  any  conservative  force. 

(b)  For  any  nonconservative  force,  find  the  work  done  if  it 
acts  on  an  object  moving  from  (—1,  —1)  to  (1, 1)  along 
each  of  the  paths  shown. 


17.  Which,  if  either,  of  the  two  force  fields 


Fi  =  -yi  +  xj  +  zk,  F2  =  yi  +  xj  +  zk 


is  conservative?  Calculate  for  each  field  the  work  done  in  moving  a  particle  around 
the  circle  x  =  cost,  y  =  sinf  in  the  (x,y)  plane. 

18.  For  the  force  field  F  =  —  yi  +  x j  +  zk,  calculate  the  work  done  in  moving  a  particle 
from  (1,0,0)  to  (— 1,0, 7r) 

(a)  along  the  helix  x  =  cost,  y  =  sint,  z  =  t; 

(b)  along  the  straight  line  joining  the  points. 

Do  you  expect  your  answers  to  be  the  same?  Why  or  why  not? 

19.  Show  that  the  electric  field  E  of  a  point  charge  [equation  (8.11)]  is  conservative. 
Write  <j>  in  (8.13)  in  rectangular  coordinates,  and  find  E  =  —  V0  using  both  rect¬ 
angular  coordinates  (6.3)  and  cylindrical  coordinates.  Verify  that  your  results  are 
equivalent  to  (8.11). 

20.  For  motion  near  the  surface  of  the  earth,  we  usually  assume  that  the  gravitational 
force  on  a  mass  m  is 

F  =  —  mgk, 

but  for  motion  involving  an  appreciable  variation  in  distance  r  from  the  center  of 
the  earth,  we  must  use 

T ,  C  C  r  C 

F  = - 2er  = - o  1  T  =  2  r’ 

where  C  is  a  constant.  Show  that  both  these  F’s  are  conservative,  and  find  the 
potential  for  each. 

21.  Consider  a  uniform  distribution  of  total  mass  m!  over  a  spherical  shell  of  radius  r' . 
The  potential  energy  (j>  of  a  mass  m  in  the  gravitational  field  of  the  spherical  shell  is 


ct>  =  < 


const. 

Cm' 

r 


if  m  is  inside  the  spherical  shell, 

if  m  is  outside  the  spherical  shell,  where  r  is  the  distance 
from  the  center  of  the  sphere  to  m,  and  C  is  a  constant. 


Assuming  that  the  earth  is  a  spherical  ball  of  radius  R  and  constant  density,  find 
the  potential  and  the  force  on  a  mass  m  outside  and  inside  the  earth.  Evaluate  the 
constants  in  terms  of  the  acceleration  of  gravity  g,  to  get 


F,  m9fe„ 

and 

<j>  = 

mgR 2 

m  outside  the  earth; 

r2 

r  ’ 

F__mgr 

R  ’ 

and 

<t>  = 

^-,R2), 

m  inside  the  earth. 

Hint:  To  find  the  constants,  recall  that  at  the  surface  of  the  earth  the  magnitude  of 
the  force  on  m  is  mg. 
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9.  GREEN'S  THEOREM  IN  THE  PLANE 

The  fundamental  theorem  of  calculus  says  that  the  integral  of  the  derivative  of  a 
function  is  the  function,  or  more  precisely: 

(9-1)  J  =  f(b)~  f(a). 

We  are  going  to  consider  some  useful  generalizations  of  this  theorem  to  two  and 
three  dimensions.  The  divergence  theorem  and  Stokes’  theorem  (Sections  10  and 
11)  are  very  important  in  electrodynamics  and  other  applications;  in  this  section  we 
will  find  two-dimensional  forms  of  these  theorems.  First  we  develop  an  underlying 
useful  theorem  relating  an  area  integral  to  the  line  integral  around  its  boundary 
(see  applications  in  examples  and  problems  and  also  Chapter  14,  Section  3). 


Figure  9.1 

Recall  that  we  know  how  to  evaluate  line  integrals  (Section  8),  and  that  we 
learned  in  Chapter  5  to  evaluate  double  integrals  over  areas  in  the  (x,  y)  plane.  We 
are  going  to  consider  areas  (such  as  those  in  Figure  9.1  or  in  Chapter  5,  Figure  2.7) 
for  which  we  can  evaluate  the  double  integral  over  the  area  either  with  respect  to 
x  first  or  with  respect  to  y  first.  Look  at  Figure  9.1.  We  want  to  find  a  relation 
between  a  double  integral  over  the  area  A  and  a  line  integral  around  the  curve  C,  for 
simple  closed  curves  C.  (A  simple  curve  does  not  cross  itself;  for  example,  it  is  not 
a  figure  8.)  Now  in  Figure  9.1,  the  upper  part  of  C  between  points  1  and  2  is  given 
by  an  equation  y  =  yu{x)  and  the  lower  part  by  an  equation  y  =  yi(x).  (Think 
of  solving  the  equation  of  a  circle  for  yu(x)  =  \J  1  —  x2  and  yi(x)  =  —  a/1  —  x2.) 
Similarly  in  Figure  9.1,  we  can  find  Xi(y)  and  xr(y)  for  the  left  and  right  parts  of 
C  between  points  3  and  4. 

Let  P{x,  y)  and  Q(x,  y)  be  continuous  functions  with  continuous  first  derivatives. 
We  are  going  to  show  that  the  double  integral  of  dP(x ,  y) /dy  over  the  area  A  is  equal 
to  the  line  integral  of  P  around  C.  We  write  the  double  integral  using  Figure  9.1 
to  integrate  first  with  respect  to  y,  and  do  the  y  integration  by  equation  (9.1)  with 
t  =  y  to  get: 
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Now  we  have  our  answer — we  just  have  to  recognize  it!  Think  how  you  would 
evaluate  the  line  integral  of  P(x,  y)dx  along  the  lower  part  of  C  in  Figure  9.1  from 
point  1  to  point  2.  You  would  substitute  y  =  yi{x)  into  P(x,y)  and  integrate  from 
x  =  a  to  b  (see  Section  8). 

fb 

(9.3)  /  P(x,  yi)  dx  =  line  integral  of  P  dx 
J  a 

along  lower  part  of  C  from  point  1  to  point  2. 

This  is  one  of  the  terms  in  (9.2).  Similarly,  to  find  the  line  integral  of  P(x,y)dx 
along  the  upper  part  of  C  from  point  2  to  point  1,  we  substitute  y  =  yu(x)  and 
integrate  from  b  to  a. 

(9.4)  /  P(x,  yu)  dx  =  line  integral  of  P  dx 

Jb 

along  upper  part  of  C  from  point  2  to  point  1. 


Combining  (9.3)  and  (9.4)  gives  us  the  line  integral  all  the  way  around  C  in  the 
counterclockwise  direction,  that  is,  so  that  A  is  always  on  our  left  as  we  go  around 
C .  (The  symbol  j>  means  an  integral  around  a  closed  curve  back  to  the  starting 
point.)  Then,  from  (9.2),  we  have 


(9.5) 


c 


P  dx  =  — 


0P(x,  y) 
dy 


dx  dy. 


Repeating  the  calculation  but  integrating  first  with  respect  to  x,  we  find 


(9.6) 


dy  j  dx=  j  [Q(xr,y)  -  Q(xi,y)\dy 

/  rp  j  V/  <Xj  /  « 


Adding  (9.5)  and  (9.6)  and  using  the  notation  dA  to  mean  the  boundary  of  A  (that 
is,  C)  we  have 


Green’s  theorem  in  the  plane: 

(9.7)  JJ  dxdy  =  j f  (P  dx  +  Q  dy) 

A 

The  line  integral  is  counterclockwise  around  the  boundary  of  area  A. 


Using  Green’s  theorem  we  can  evaluate  either  a  line  integral  around  a  closed  path 
or  a  double  integral  over  the  area  inclosed,  whichever  is  easier  to  do.  If  the  area  is 
not  of  the  simple  type  we  have  assumed,  it  may  be  possible  to  cut  it  into  pieces  (see 
Figure  9.2)  so  that  our  proof  applies  to  each  piece.  Then  the  line  integrals  along 
the  dotted  cuts  in  Figure  9.2  are  in  opposite  directions  for  adjacent  pieces,  and  so 


Section  9 


Green’s  Theorem  in  the  Plane  311 


cancel.  Thus  the  theorem  is  valid  for  this  more  general  area  and  its  inclosing  curve. 
In  fact,  we  can  even  close  up  Figure  9.2  creating  an  area  with  a  hole  in  the  middle. 


Figure  9.2 

Green’s  theorem  still  holds,  but  now  the  line  integral  consists  of  a  counterclockwise 
integral  around  the  outside  plus  a  clockwise  integral  around  the  hole  as  you  can 
see  in  Figure  9.2.  We  say  that  this  area  is  not  ’’simply  connected” — see  further 
discussion  of  this  in  Section  11. 


Example  1.  In  Example  1,  Section  8,  we  found  the  line 
integral  (8.2)  along  several  paths  (Figure  8.2).  Sup¬ 
pose  we  want  the  line  integral  in  Figure  8.2  around 
the  closed  loop  (Figure  9.3)  from  (0,0)  to  (2,1)  and 
back  as  shown.  From  Section  8,  Example  1,  this  is 
the  work  done  along  path  2  minus  the  work  done 
along  path  3  (since  we  are  now  going  in  the  opposite 
direction);  we  find  W2  —  W3  =  |  —  |  =  —  1.  Let  us 
theorem.  From  (8.2)  and  (9.7)  we  have 


(0,  0) 


Figure  9.3 

evaluate  this  using  Green’s 


W  =  (b  xy  dx  —  y  dy  = 
I  dA 


— x  dx  dy  =  — 


A 

f  1  r2Vv 


d  .  2.  d  ' 

di{~v  > -  di{xv) 


dx  dy 


xdxdy  =  —1 


/  y= 0  J X—Q 


as  before. 

Example  2.  In  Section  8,  we  discussed  conservative  forces  for  which  work  done  is  inde¬ 
pendent  of  the  path.  By  Green’s  theorem  (9.7),  the  work  done  by  a  force  F  around 
a  closed  path  in  the  ( x ,  y)  plane  is 

W  =  fjF,  ix  +  «  =  //  -  |()  *  <%■ 

A 

If  ( dFy  / dx)  —  ( dFx  / dy)  =  0  (note  that  this  is  the  2  component  of  curl  F  =  0)  then 
W  around  any  closed  path  is  zero,  which  means  that  the  work  from  one  point  to 
another  is  independent  of  the  path  (also  see  Section  11). 

The  functions  P(x,  y)  and  Q{x,  y)  in  (9.7)  are  arbitrary;  we  may  choose  them  to 
suit  our  purposes.  Note  that  a  two-dimensional  vector  function  \Vx{x,  y)  +  jVy  (x,  y) 
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contains  two  functions,  Vx  and  Vy.  In  the  next  two  examples,  we  are  going  to  define 
P  and  Q  in  terms  of  Vx  and  Vy  in  order  to  obtain  two  useful  results. 


Example  3.  We  define: 

(9.8)  Q  =  VX,  P=  - Vy ,  where  V  =  iVx  +jVy. 


Then 


(9.9) 

by  (7.2)  with  Vz  =  0. 


dQ_dP  =  d v*  +  dvl  =  dwv 
dx  dy  dx  dy 

Along  the  curve  bounding  an  area  A  (Figure  9.4)  the  vector 


(9.10) 


dr  =  i  dx  +  j  dy  (tangent) 


is  a  tangent  vector,  and  the  vector 


(9.11)  n  ds  =  i  dy  —  j  dx  (outward  normal), 

where  n  is  a  unit  vector  and  ds  =  \J dx2  +  dy2 , 


is  a  normal  vector  (perpendicular  to  the  tangent)  pointing  out  of  area  A.  Using 

(9.11)  and  (9.8),  we  can  write 

(9.12)  Pdx  +  Qdy  =  -Vydx  +  Vx  dy  =  (i!4  +  jUy)  •  (idy-jx) 

=  V  •  nds. 


Then  substitute  (9.9)  and  (9.12)  into  (9.7)  to  get 


(9.13) 


div  Vdxdy=  V  •  n  ds. 

JdA 


This  is  the  divergence  theorem  in  two  dimensions.  It  can  be  extended  to  three 
dimensions  (also  see  Section  10).  Let  r  represent  a  volume;  then  dr  (read  boundary 
of  t)  means  the  closed  surface  area  of  t.  Let  dr  mean  a  volume  element  and  let  da 
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mean  an  element  of  surface  area.  At  each  point  of  the  surface,  let  n  be  a  unit  vector 
perpendicular  to  the  surface  and  pointing  outward.  Then  the  divergence  theorem 
in  three  dimensions  says  (also  see  Section  10) 


(9.14) 


div  V  dr  = 


V  •  n  da. 


Divergence  theorem 


r 


dr 


Example  4.  To  see  another  application  of  (9.7)  to  vector  functions,  we  let 
(9.15)  Q  =  Vy,  P  =  VX.  where  V  =  iVx+jVy. 

Then 


(9.16) 


dQ  dP  _  dVv  8VX 

dx  dy  dx  dy  ^CUr  ^  ’ 


by  (7.3)  with  Vz  =  0.  Equations  (9.10)  and  (9.15)  give 

(9.17)  Pdx  +  Q  dy  =  (iVx  +$Vy)  •  (i  dx  +  j  dy)  =  V  •  dr. 

Substituting  (9.16)  and  (9.17)  into  (9.7),  we  get 


(9.18) 


(curlV)  •  kdxdy  =  ®  V  •  dr 


dA 


This  is  Stokes’  theorem  in  two  dimensions.  It  can  be  extended 
to  three  dimensions  (Section  11).  Let  cr  be  an  open  surface  (for 
example,  a  hemisphere);  then  da  means  the  curve  bounding 
the  surface  (Figure  9.5).  Let  n  be  a  unit  vector  normal  to  the 
surface.  Then  Stokes’  theorem  in  three  dimensions  is  (also  see 
Section  11) 


da" 


Figure  9.5 


(9.19) 


JJ  (curl  V)  •  n  da 


V  •  dr. 


!  da 


Stokes’  theorem. 


The  direction  of  integration  for  the  line  integral  is  as  shown  in  Figure  9.5  (see  also 
Section  11). 


PROBLEMS,  SECTION  9 


1.  Write  out  the  equations  corresponding  to  (9.3)  and  (9.4)  for  f  Q  dy  between  points 
3  and  4  in  Figure  9.2,  and  add  them  to  get  (9.6). 

In  Problems  2  to  5  use  Green’s  theorem  [formula  (9.7)]  to  evaluate  the  given  integrals. 

2.  2 xdy  —  3 y  dx  around  the  square  with  vertices  (0,  2),  (2,  0),  (—2, 0),  and  (0,  —2). 
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3.  <fc  xy  dx  +  x2  dy ,  where  C  is  as  sketched. 


4.  fc  ex  cos  y  dx  —  ex  sin  y  dy,  where  C  is  the  broken  line  from 
A  =  (In  2,0)  to  D  —  (0,1)  and  then  from  D  to  B  = 

(—In 2,0).  Hint:  Apply  Green’s  theorem  to  the  integral 
around  the  closed  curve  ADBA. 

5.  fc(yex  —  1)  dx+ex  dy,  where  C  is  the  semicircle  through  (0,  —10),  (10,  0),  and  (0, 10). 
(Compare  Problem  4.) 

6.  For  a  simple  closed  curve  C  in  the  plane  show  by  Green’s  theorem  that  the  area 
inclosed  is 

A  =  \f  (xdy-ydx). 

7.  Use  Problem  6  to  show  that  the  area  inside  the  ellipse  x  =  a  cos  6,  y  =  bsin(9, 
0  <  6  <  27t,  is  A  =  nab. 

8.  Use  Problem  6  to  find  the  area  inside  the  curve  x 2^3  +  y 2^3  =  4. 

9.  Apply  Green’s  theorem  with  P  =  0,  Q  =  1 * 2  to  the  triangle  with  vertices  (0,0), 
(0,3),  (3,0).  You  will  then  have  ff  xdxdy  over  the  triangle  expressed  as  a  very 
simple  line  integral.  Use  this  to  locate  the  centroid  of  the  triangle.  (Compare 
Chapter  5,  Section  3.) 

Evaluate  each  of  the  following  integrals  in  the  easiest  way  you  can. 

10.  j>(2 y  dx  —  3x  dy)  around  the  square  bounded  by  x  =  3,  x  =  5,  y  =  1  and  y  =  3. 

11-  So  (xsina;  —  y)  dx  +  (x  —  y2)dy,  where  C  is  the  triangle  in  the  ( x,y )  plane  with 
vertices  (0,0),  (1, 1),  and  (2,0). 

12-  JV  —  x 2)  dx  +  ( 2xy  +  3)  dy  along  the  x  axis  from  (0,  0)  to  {VE,  0)  and  then  along 
a  circular  arc  from  (\/5,  0)  to  (1,  2). 


10.  THE  DIVERGENCE  AND  THE  DIVERGENCE  THEOREM 


We  have  defined  (in  Section  7)  the  divergence  of  a  vector  function  V(x,  y,  z)  as 


(10.1) 


div  V  =  V  •  V  = 


dVx 

dx 


dVz 

dz 


We  now  want  to  investigate  the  meaning  and  use  of  the  divergence  in  physical 
applications. 

Consider  a  region  in  which  water  is  flowing.  We  can  imagine  drawing  at  every 
point  a  vector  v  equal  to  the  velocity  of  the  water  at  that  point.  The  vector  function 
v  then  represents  a  vector  field.  The  curves  tangent  to  v  are  called  stream  lines. 
We  could  in  the  same  way  discuss  the  flow  of  a  gas,  of  heat,  of  electricity,  or  of 
particles  (say  from  a  radioactive  source) .  We  are  going  to  show  that  if  v  represents 
the  velocity  of  flow  of  any  of  these  things,  then  divv  is  related  to  the  amount  of 
the  substance  which  flows  out  of  a  given  volume.  This  could  be  different  from  zero 
either  because  of  a  change  in  density  (more  air  flows  out  than  in  as  a  room  is  heated) 
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or  because  there  is  a  source  or  sink  in  the  volume  (alpha  particles  flow  out  of  but  not 
into  a  box  containing  an  alpha-radioactive  source) .  Exactly  the  same  mathematics 
applies  to  the  electric  and  magnetic  fields  where  v  is  replaced  by  E  or  B  and  the 
quantity  corresponding  to  outflow  of  a  material  substance  is  called  flux. 


Figure  10.1 

For  our  example  of  water  flow,  let  V  =  vp,  where  p  is  the  density  of  the  water. 
Then  the  amount  of  water  crossing  in  time  t  an  area  A'  which  is  perpendicular  to 
the  direction  of  flow,  is  (see  Figure  10.1)  the  amount  of  water  in  a  cylinder  of  cross 
section  A'  and  length  vt.  This  amount  of  water  is 

(10.2)  (vt)(A')(p). 

The  same  amount  of  water  crosses  area  A  (see  Figure  10.1)  whose  normal  is  inclined 
at  angle  9  to  v.  Since  A!  =  Acos0, 

(10.3)  vtA'  p  =  vtpA  cos  6. 

Then  if  water  is  flowing  in  the  direction  v  making  an  angle  9  with  the  normal  n  to 
a  surface,  the  amount  of  water  crossing  unit  area  of  the  surface  in  unit  time  is 

(10.4)  up  cos  9  =  F  cos  9  =  V  •  n 
if  n  is  a  unit  vector. 

Now  consider  an  element  of  volume  dx  dy  dz  in  the  region  through  which  the 
water  is  flowing  (Figure  10.2).  Water  is  flowing  either  in  or  out  of  the  volume 
dx  dy  dz  through  each  of  the  six  surfaces  of  the  volume  element;  we  shall  calculate 
the  net  outward  flow.  In  Figure  10.2,  the  rate  at  which  water  flows  into  dxdydz 


Figure  10.2 

through  surface  1  is  [by  (10.4)]  V  •  i  per  unit  area,  or  (V  •  i)  dydz  through  the  area 
dy  dz  of  surface  1.  Since  V  •  i  =  Vx,  we  find  that  the  rate  at  which  water  flows 
across  surface  1  is  14  dydz.  A  similar  expression  gives  the  rate  at  which  water  flows 
out  through  surface  2,  except  that  Vx  must  be  the  x  component  of  V  at  surface  2 
instead  of  at  surface  1.  We  want  the  difference  of  the  two  Vx  values  at  two  points, 
one  on  surface  1  and  one  on  surface  2,  directly  opposite  each  other,  that  is,  for  the 
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same  y  and  z.  These  two  values  of  Vx  differ  by  A 14  which  can  be  approximated 
(as  in  Chapter  4)  by  dVx.  For  constant  y  and  z,  dVx  =  (dVx/dx)  dx.  Then  the 
net  outflow  through  these  two  surfaces  is  the  outflow  through  surface  2  minus  the 
inflow  through  surface  1,  namely, 


(10.5) 


[{Vx  at  surface  2)  —  ( Vx  at  surface  1  )\dydz 


dVx 

dx 


dx 


dy  dz. 


We  get  similar  expressions  for  the  net  outflow  through  the  other  two  pairs  of  opposite 
surfaces: 

through  top  and  bottom,  and 
through  the  other  two  sides. 

Then  the  total  net  rate  of  loss  of  water  from  dx  dy  dz  is 

(ot/  OT  r  ot r  \ 

-—A  +  —A  _|_  —A  )  c[x  dy  dz  =  div  V  dxdydz  or  V  •  V  dx  dy  dz. 
ox  dy  dz  J 

If  we  divide  (10.7)  by  dx  dy  dz,  we  have  the  rate  of  loss  of  water  per  unit  volume. 
This  is  the  physical  meaning  of  a  divergence:  It  is  the  net  rate  of  outflow  per  unit 
volume  evaluated  at  a  point  (let  dx  dy  dz  shrink  to  a  point).  This  is  outflow  of  actual 
substance  for  liquids,  gases,  or  particles;  it  is  called  flux  for  electric  and  magnetic 
fields.  You  should  note  that  this  is  somewhat  like  a  density.  Density  is  mass  per  unit 
volume ,  but  it  is  evaluated  at  a  point  and  may  vary  from  point  to  point.  Similarly, 
the  divergence  is  evaluated  at  each  point  and  may  vary  from  point  to  point. 

As  we  have  said,  div  V  may  be  different  from  zero  either  because  of  time  variation 
of  the  density  or  because  of  sources  and  sinks.  Let 


(10.6) 


dx  dy  dz 
dy 

-r-21  dx  dy  dz 
dz 


ip  =  source  density  minus  sink  density 

=  net  mass  of  fluid  being  created  (or  added  via  something  like  a  minute 
sprinkler  system)  per  unit  time  per  unit  volume; 
p  =  density  of  the  fluid  =  mass  per  unit  volume; 
dp/dt  =  time  rate  of  increase  of  mass  per  unit  volume. 

Then: 


Rate  of  increase  of  mass  in  dx  dy  dz  =  rate  of  creation  minus  rate  of  outward  flow, 
or  in  symbols 


-Jp-dx  dy  dz 
dt 


ip  dx  dy  dz  —  V  •  V  dx  dy  dz. 


Canceling  dxdydz,  we  have 


or 

(10.8) 


dp 

dt 


Ip-V -V 


V  -V  =  ip 


dp 
dt ' 
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If  there  are  no  sources  or  sinks,  then  ip  =  0;  the  resulting  equation  is  often  called 
the  equation  of  continuity.  (See  Problem  15.) 


If  dp/dt  =  0,  then 

(10.10)  V-V  =  ip. 

In  the  case  of  the  electric  field,  the  “sources”  and  “sinks”  are  electric  charges  and 
the  equation  corresponding  to  (10.10)  is  divD  =  ip,  where  ip  is  the  charge  density 
and  D  is  the  electric  displacement.  For  the  magnetic  field  B  you  would  expect 
the  sources  to  be  magnetic  poles;  however,  there  are  no  free  magnetic  poles,  so 
div  B  =  0  always. 

We  have  shown  that  the  mass  of  fluid  crossing  a  plane 
area  A  per  unit  time  is  AV  •  n,  where  n  is  a  unit  vec¬ 
tor  normal  to  A ,  v  and  p  are  the  velocity  and  density 
of  the  fluid,  and  V  =  wp.  Consider  any  closed  surface, 
and  let  da  represent  an  area  element  on  the  surface  (Fig¬ 
ure  10.3).  For  example:  for  a  plane,  da  =  dxdy ;  for  a 
spherical  surface, 

da  =  r2  sin  9  dO  d<p.  Figure  10.3 

Let  n  be  the  unit  vector  normal  to  da  and  pointing  out  of  the  surface  (n  varies 
in  direction  from  point  to  point  on  the  surface) .  Then  the  mass  of  fluid  flowing  out 
through  da  is  V  •  nda  by  (10.4)  and  the  total  outflow  from  the  volume  inclosed  by 
the  surface  is 

(10.11)  JJv-nda, 

where  the  double  integral  is  evaluated  over  the  closed  surface. 

We  showed  previously  [see  (10.7)]  that  for  the  volume  element  dr  =  dxdydz: 

(10.12)  The  outflow  from  dr  is  V  •  V  dr. 

For  simplicity,  we  proved  this  for  a  rectangular  coordinate  volume  element  dx  dy  dz. 
With  extra  effort  we  could  prove  it  more  generally,  say  for  volume  elements  with 
slanted  sides  or  for  spherical  coordinate  volume  elements.  From  now  on  we  shall 
assume  that  dr  includes  more  general  volume  element  shapes. 

It  is  worth  noticing  here  another  way  [besides  (7.2)]  of  defining  the  divergence. 
If  we  write  (10.11)  for  the  surface  of  a  volume  element  dr,  we  have  two  expressions 
for  the  total  outflow  from  dr,  and  these  must  be  equal.  Thus 


(10.13) 


V  •  V  dr  = 


V  -nda. 


surface 
of  dr 
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The  value  of  V  •  V  on  the  left  is,  of  course,  an  average  value  of  V  •  V  in  dr,  but  if 
we  divide  (10.13)  by  dr  and  let  dr  shrink  to  a  point,  we  have  a  definition  of  V  •  V 
at  the  point: 

(10.14)  V  V  =  lim  —  [[  V  •  n  da. 

dr  dr  J  J 

surface 
of  dr 

If  we  start  with  (10.14)  as  the  definition  of  V  •  V,  then  the  discussion  leading  to 
(10.7)  is  a  proof  that  V  •  V  as  defined  in  (10.14)  is  equal  to  V  •  V  as  defined  in 
(7.2). 


The  Divergence  Theorem  See  (10.17).  The  divergence  theorem  is  also  called 
Gauss’s  theorem,  but  be  careful  to  distinguish  this  mathematical  theorem  from 
Gauss’s  law  which  is  a  law  of  physics;  see  (10.23). 

Consider  a  large  volume  r;  imagine  it  cut  up  into  volume 
elements  dri  (a  cross  section  of  this  is  shown  in  Figure  10.4). 

The  outflow  from  each  dri  is  V  •  V  dr*;  let  us  add  together 
the  outflow  from  all  the  dr;  to  get 

(10.15)  £v-v  dr. i. 

i  Figure  10.4 


We  shall  show  that  (10.15)  is  the  outflow  from  the  large  volume  r.  Consider  the 
flow  between  the  elements  marked  a  and  b  in  Figure  10.4  across  their  common  face. 
An  outflow  from  a  to  &  is  an  inflow  (negative  outflow)  from  b  to  a,  so  that  in  the 
sum  (10.15)  such  outflows  across  interior  faces  cancel.  The  total  sum  in  (10.15) 
then  equals  just  the  total  outflow  from  the  large  volume.  As  the  size  of  the  volume 
elements  tends  to  zero,  this  sum  approaches  a  triple  integral  over  the  volume, 


(10.16) 


V  dr. 


We  have  shown  that  both  (10.11)  and  (10.16)  are  equal  to  the  total  outflow  from 
the  large  volume;  hence  they  are  equal  to  each  other,  and  we  have  the  divergence 
theorem  as  stated  in  (9.14): 


(10.17)  ///v-Vdr=  ff  V-nda. 

Divergence  theorem 

J  J  J  J  J 

volume  r  surface 

inclosing  r 

(n  points  out  of  the  closed  surface  a.) 

Notice  that  the  divergence  theorem  converts  a  volume  integral  into  an  integral  over 
a  closed  surface  or  vice  versa;  we  can  then  evaluate  whichever  one  is  the  easier 
to  do. 

In  (10.17)  we  have  carefully  written  the  volume  integral  with  three  integral  signs 
and  the  surface  integral  with  two  integral  signs.  However,  it  is  rather  common  to 
write  only  one  integral  sign  for  either  case  when  the  volume  or  area  element  is 
indicated  by  a  single  differential  (dr,  dV,  etc.,  for  volume;  da,  dA,  dS,  etc.,  for 
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surface  area).  Thus  we  might  write  ///dr  or  / dr  or  ///  dxdydz,  all  meaning  the 
same  thing.  When  the  single  integral  sign  is  used  to  indicate  a  surface  or  volume 
integral,  you  must  see  from  the  notation  (r  for  volume,  a  for  area),  or  the  words 
under  the  integral,  what  is  really  meant.  To  indicate  a  surface  integral  over  a  closed 
surface  or  a  line  integral  around  a  closed  curve,  the  symbol  /  is  often  used.  Thus 
we  might  write  either  //  da  or  /  da  for  a  surface  integral  over  a  closed  surface.  A 
different  notation  for  the  integrand  V  •  n  da  is  often  used.  Instead  of  using  a  unit 
vector  n  and  the  scalar  magnitude  da ,  we  may  write  the  vector  dcr  meaning  a  vector 
of  magnitude  da  in  the  direction  n;  thus  dcr  means  exactly  the  same  thing  as  n  dcr, 
and  we  may  replace  V  •  ndcr  by  V  •  dcr  in  (10.17). 


Example  of  the  Divergence  Theorem  Let  V  =  ix  + 

j y  +  kz  and  evaluate  /  V  -  ndcr  over  the  closed  surface  of  the 
cylinder  shown  in  Figure  10.5. 

By  the  divergence  theorem  this  is  equal  to  /  V  •  V  dr  over 
the  volume  of  the  cylinder.  (Note  that  we  are  using  single 
integral  signs,  but  the  notation  and  words  make  it  clear  which 
integral  is  a  volume  integral  and  which  a  surface  integral.)  We 
find  from  the  definition  of  divergence 


V.V  =  — +  ^ 

dx  dy 


Figure  10.5 


Then  by  (10.17) 


V  •nda  = 


V  •  V  dr  = 


3  dr  =  3 


dr 


surface  of  volume  of 

cylinder  cylinder 


=  3  times  volume  of  cylinder  =  3na2h. 


It  is  harder  to  evaluate  /  V  •  n  da  directly,  but  we  might  do  it  to  show  an  example 
of  calculating  a  surface  integral  and  to  verify  the  divergence  theorem  in  a  special 
case.  We  need  the  surface  normal  n.  On  the  top  surface  (Figure  10.5)  n  =  k,  and 
there  V-n  =  V*k=z  =  d.  Then 


V  •  n  dcr  =  h  /  da  =  h  ■ 


top  surface  of 
cylinder 


On  the  bottom  surface,  n  =  k,  V  •  n  =  —2  =  0;  hence  the  integral  over  the  bottom 
surface  is  zero.  On  the  curved  surface  we  might  see  by  inspection  that  the  vector 
ix  +  j  y  is  normal  to  the  surface,  so  for  the  curved  surface  we  have 

ix  +  iy  iz+j  y 

n  =  = - . 

\J x2  +  y2  a 


If  the  vector  n  is  not  obvious  by  inspection,  we  can  easily  find  it;  recall  (Section 
6)  that  if  the  equation  of  a  surface  is  <p(x,  y,z)  =  const.,  then  V0  is  perpendicular 
to  the  surface.  In  this  problem,  the  equation  of  the  cylinder  is  x2  +  y2  =  a2  \  then 
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4>  =  x2  +  y2,  =  2xi  +  2j/j,  and  we  get  the  same  unit  vector  n  as  above.  Then 

for  the  curved  surface  we  find 


V  •  n  = 


x 2  +  y 2 
a 


—  =  a, 
a 


V  •  n  da  =  a 


da  =  a  •  (area  of  curved  surface)  =  a  •  2n ah. 


curved 

surface 


The  value  of  j>  V*n  da  over  the  whole  surface  of  the  cylinder  is  then  Tra2h+2Tra2h  = 
3na2h  as  before. 


Gauss’s  Law  The  divergence  theorem  is  very  important  in  electricity.  In  order  to 
see  how  it  is  used,  we  need  a  law  in  electricity  known  as  Gauss’s  law.  Let  us  derive 
this  law  from  the  more  familiar  Coulomb’s  law  (8.11).  Coulomb’s  law  (written  this 
time  in  SI  units)  gives  for  the  electric  field  at  r  due  to  a  point  charge  q  at  the  origin 


(10.18)  E  =  - - -er.  Coulomb’s  law 

47reor^ 


(eo  is  a  constant  called  the  permittivity  of  free  space  and  - - 

47re0 

The  electric  displacement  D  is  defined  (in  free  space)  by  D  = 


=  9  •  109  in  SI  units.) 
eoE;  then 


(10.19) 


D  = 


£ 

47 rr2  r 


Let  a  be  a  closed  surface  surrounding  the  point  charge  q  at  the  origin;  let  da  be 
an  element  of  area  of  the  surface  at  the  point  r,  and  let  n  be  a  unit  normal  to  da 
(Figures  10.3  and  10.6).  Also  (Figure  10.6)  let  dA  be  the  projection  of  da  onto  a 
sphere  of  radius  r  and  center  at  O  and  let  dQ  be  the  solid  angle  subtended  by  da 
(and  dA)  at  O.  Then  by  definition  of  solid  angle 

(10.20)  dCl  =  4 dA . 

From  Figure  10.6  and  equations  (10.19)  and  (10.20),  we  get 

(10.21)  D  •  n  da  =  D  cos  9  da  =  D  dA  =  — ■  r 2  dVL  =  — q  dfl 

47t  r2  47t 
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We  want  to  find  the  surface  integral  of  D  •  n  dcr  over  the  closed  surface  <7;  by  (10.21) 
this  is 

(10.22)  j)  Y)-nda=-^~  J  dfl  =  ■  47T  =  q  (q  inside  cr). 

closed  total 

surface  a  solid  angle 

This  is  a  simple  case  of  Gauss’s  law  when  we  have  only  one  point  charge  q\  for  most 
purposes  we  shall  want  Gauss’s  law  in  the  forms  (10.23)  or  (10.24)  below.  Before 
we  derive  these,  we  should  note  carefully  that  in  (10.22)  the  charge  q  is  inside  the 
closed  surface  cr.  If  we  repeat  the  derivation  of  (10.22)  for  a  point  charge  q  outside 
the  surface  (Problem  13),  we  find  that  in  this  case 


D  •  nda  =  0. 


closed  g 


Next  suppose  there  are  several  charges  qi  inside  the  closed  surface.  For  each  q,: 
and  the  Di  corresponding  to  it,  we  could  write  an  equation  like  (10.22).  But  the 
total  electric  displacement  vector  D  at  a  point  due  to  all  the  qi  is  the  vector  sum 
of  the  vectors  D,; .  Thus  we  have 

D  •  nda 

closed 
surface  a 


=  Y,  f  D  i-nda  =  ^ Qj . 


closed 
surface  a 


Therefore  for  any  charge  distribution  inside  a  closed  surface 


(10.23) 


D  •  nda  =  total  charge  inside  the  closed  surface. 


closed 

surface 


Gauss’s  law 


If,  instead  of  isolated  charges,  we  have  a  charge  distribution  with  charge  density  p 
(which  may  vary  from  point  to  point),  then  the  total  charge  is  J  pdr,  so 


(10.24) 

®  D  •  n  da  = 

/  pdr. 

Gauss’s  law 

J 

closed 
surface  a 

J 

volume 
bounded  by  cr 

Since  (by  Problem  13)  charges  outside  the  closed  surface  cr  do  not  contribute  to  the 
integral,  (10.23)  and  (10.24)  are  correct  if  D  is  the  total  electric  displacement  due  to 
all  charges  inside  and  outside  the  surface.  The  total  charge  on  the  right-hand  side 
of  these  equations  is,  however,  just  the  charge  inside  the  surface  a.  Either  (10.23) 
or  (10.24)  is  called  Gauss’s  law. 

We  now  want  to  see  the  use  of  the  divergence  theorem  in  connection  with  Gauss’s 
law.  By  the  divergence  theorem,  the  surface  integral  on  the  left-hand  side  of  (10.23) 
or  (10.24)  is  equal  to 

V  •  D  dr. 

volume 
bounded  by  a 
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Then  (10.24)  can  be  written  as 


V  •  D  dr  = 


Since  this  is  true  for  every  volume,  we  must  have  V-D  =  p;  this  is  one  of  the  Maxwell 
equations  in  electricity.  What  we  have  done  is  to  start  by  assuming  Coulomb’s  law; 
we  have  derived  Gauss’s  law  from  it,  and  then  by  use  of  the  divergence  theorem,  we 
have  derived  the  Maxwell  equation  V-D  =  p.  From  a  more  sophisticated  viewpoint, 
we  might  take  the  Maxwell  equation  as  one  of  our  basic  assumptions  in  electricity. 
We  could  then  use  the  divergence  theorem  to  obtain  Gauss’s  law: 


(10.25) 


D  •  n  da  = 


V  •  D  dr  = 


p  dr 


closed 
surface  a 


volume  r 
inside  cr 


volume  r 


=  total  charge  inclosed  by  cr. 


From  Gauss’s  law  we  could  then  derive  Coulomb’s  law  (Problem  14);  more  generally 
we  can  often  use  Gauss’s  law  to  obtain  the  electric  field  produced  by  a  given  charge 
distribution  as  in  the  following  example. 


Example.  Find  E  just  above  a  very  large  conducting  plate  carrying  a  surface  charge  of 
C  coulombs  per  square  meter  on  each  surface. 

The  electric  field  inside  a  conductor  is  zero  when  we  are  considering  an  electro¬ 
statics  problem  (otherwise  current  would  flow).  From  the  symmetry  of  the  problem 
(all  horizontal  directions  are  equivalent),  we  can  say  that  E  (and  D)  must  be  vertical 
as  shown  in  Figure  10.7.  We  now  find  f  D  •  nda  over  the  box  whose  cross  section 
is  shown  by  the  dotted  lines.  The  integral  over  the  bottom  surface  is  zero  since 
D  =  0  inside  the  conductor.  The  integral  over  the  vertical  sides  is  zero  because  D 
is  perpendicular  to  n  there.  On  the  top  surface  D  •  n  =  |D|  and  f  D  •  nda  =  |D| - 
(surface  area).  By  (10.25)  this  is  equal  to  the  charge  inclosed  by  the  box,  which 
is  C-  (surface  area).  Thus  we  have  |D |  •  (surface  area)  =  C  ■  (surface  area)  ,  or 
|D|  =  C  and  |E|  =  C/e0. 
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Figure  10.7 


PROBLEMS,  SECTION  10 


1.  Evaluate  both  sides  of  (10.17)  if  V  =  r  =  ia:  +  j y  +  k z,  and  r  is  the  volume 
x2  +  y2  +  z2  <  1,  and  so  verify  the  divergence  theorem  in  this  case. 

2.  Given  V  =  x2i  +  y2j  +  z2 k,  integrate  V  •  nda  over  the  whole  surface  of  the  cube 
of  side  1  with  four  of  its  vertices  at  (0,  0,  0),  (0,  0, 1),  (0, 1,  0),  (1,  0,  0).  Evaluate  the 
same  integral  by  means  of  the  divergence  theorem. 
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Evaluate  each  of  the  integrals  in  Problems  3  to  8  as  either  a  volume  integral  or  a  surface 
integral,  whichever  is  easier. 

3.  ff  r  •  n  da  over  the  whole  surface  of  the  cylinder  bounded  by  x2  +  y2  =  1,  z  =  0, 
and  a  =  3;  r  means  ix  +  jy  +  kz. 

4.  ff~V-n  da  if  V  =  x  cos2  y  i  +  xz  j  +  z  sin2  y  k  over  the  surface  of  a  sphere  with  center 
at  the  origin  and  radius  3. 

5.  fff  (V  ■  F)  dr  over  the  region  x2  +  y2  +  z2  <  25,  where 

F  =  (x2  +  y2  +  z2)(x  i  +  y$  +  zk). 


6.  fff  V  •  V  dr  over  the  unit  cube  in  the  first  octant,  where 

V  =  (x3  -  x2)y\  +  (y3  -  2 y2  +  y)x j  +  (z2  -  l)k. 

7.  ff  r  ■  nda  over  the  entire  surface  of  the  cone  with  base  x2  +  y2  <  16,  z  =  0,  and 
vertex  at  (0,  0,  3),  where  r  =  ix  +  j y  +  kz. 

8.  fff  V  •  V  dr  over  the  volume  x2  +  y2  <  4,  0  <  z  <  5,  V  =  (y/ x 2  +  y2  )(ix  +  jy). 

9 .  If  F  =  xi  +  yj ,  calculate  ff  F  •  n  da  over  the  part  of  the  surface  z  =  4  —  x2  —  y2  that  is 
above  the  ( x,y )  plane,  by  applying  the  divergence  theorem  to  the  volume  bounded 
by  the  surface  and  the  piece  that  it  cuts  out  of  the  (x,  y)  plane.  Hint:  What  is  F  •  n 
on  the  (x,  y)  plane? 

10.  Evaluate  ff  V  •  n  da  over  the  curved  surface  of  the  hemisphere  x2  +  y2  +  z2  =  9, 
z  >  0,  if  V  =  j/i  +  xzj  +  (2z  —  l)k.  Careful:  See  Problem  9. 

11.  Given  that  B  =  curl  A,  use  the  divergence  theorem  to  show  that  f  B  •  n  da  over  any 
closed  surface  is  zero. 

12.  A  cylindrical  capacitor  consists  of  two  long  concentric  metal  cylinders.  If  there 
is  a  charge  of  k  coulombs  per  meter  on  the  inside  cylinder  of  radius  Ri,  and  —k 
coulombs  per  meter  on  the  outside  cylinder  of  radius  R2,  find  the  electric  field  E 
between  the  cylinders.  Hint:  Use  Gauss’s  law  and  the  method  indicated  in  Figure 
10.7.  What  is  E  inside  the  inner  cylinder?  Outside  the  outer  cylinder?  (Again  use 
Gauss’s  law.)  Find,  either  by  inspection  or  by  direct  integration,  the  potential  <j> 
such  that  E  =  —  V</>  for  each  of  the  three  regions  above.  In  each  case  E  is  not 
affected  by  adding  an  arbitrary  constant  to  <f>.  Adjust  the  additive  constant  to  make 
<j>  a  continuous  function  for  all  space. 

13.  Draw  a  figure  similar  to  Figure  10.6  but  with  q  outside  the  surface.  A  vector  (like 
r  in  the  figure)  from  q  to  the  surface  now  intersects  it  twice,  and  for  each  solid 
angle  dQ  there  are  two  da’s,  one  where  r  enters  and  one  where  it  leaves  the  surface. 
Show  that  D  •  nda  is  given  by  (10.21)  for  the  da  where  r  leaves  the  surface  and  the 
negative  of  (10.21)  for  the  da  where  r  enters  the  surface.  Hence  show  that  the  total 
fD  ■  nda  over  the  closed  surface  is  zero. 

14.  Obtain  Coulomb’s  law  from  Gauss’s  law  by  considering  a  spherical  surface  a  with 
center  at  q. 

15.  Suppose  the  density  p  of  a  fluid  varies  from  point  to  point  as  well  as  with  time, 
that  is,  p  =  p(x,y,  z,t).  If  we  follow  the  fluid  along  a  streamline,  then  x,  y,  z  are 
functions  of  t  such  that  the  fluid  velocity  is 
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Show  that  then  dp/dt  =  dp/dt  +  v  •  Vp.  Combine  this  equation  with  (10.9)  to  get 

„V.v  +  J=0. 

(Physically,  dp/dt  is  the  rate  of  change  of  density  with  time  as  we  follow  the  fluid 
along  a  streamline;  dp/dt  is  the  corresponding  rate  at  a  fixed  point.)  For  a  steady 
state  (that  is,  time- independent),  dp/dt  =  0,  but  dp/dt  is  not  necessarily  zero. 
For  an  incompressible  fluid,  dp/dt  =  0;  show  that  then  V  •  v  =  0.  (Note  that 
incompressible  does  not  necessarily  mean  constant  density  since  dp/dt  =  0  does  not 
imply  either  time  or  space  independence  of  p;  consider,  for  example,  a  flow  of  water 
mixed  with  blobs  of  oil.) 

16.  The  following  equations  are  variously  known  as  Green’s  first  and  second  identities 
or  formulas  or  theorems.  Derive  them,  as  indicated,  from  the  divergence  theorem. 

(1)  J  (<j>V2ip  +  'V(l)-Vi’)dT=  £  (0Vi/>)  •  nda. 

volume  r  closed 

inside  a  surface  a 

To  prove  this,  let  V  =  in  the  divergence  theorem. 


(2)  / 
volume  r 
inside  a 


(rt>'V2ip-'ip'V2<l>)  dr 


(0Vi/>  —  i/’V(/>)  •  ndo\ 


closed 
surface  a 


To  prove  this,  copy  Theorem  1  above  as  is  and  also  with  cj>  and  ip  interchanged; 
then  subtract  the  two  equations. 


11.  THE  CURL  AND  STOKES' THEOREM 

We  have  already  defined  curlV  =  V  X  V  [see  (7.3)]  and  have  considered  one 
application  of  the  curl,  namely,  to  determine  whether  or  not  a  line  integral  between 
two  points  is  independent  of  the  path  of  integration  (Section  8).  Here  is  another 
application  of  the  curl.  Suppose  a  rigid  body  is  rotating  with  constant  angular 
velocity  lj;  this  means  that  |u;|  is  the  magnitude  of  the  angular  velocity  and  uj  is 
a  vector  along  the  axis  of  rotation  (see  Figure  2.6).  Then  we  showed  in  Section 
2  that  the  velocity  v  of  a  particle  in  the  rigid  body  is  v  =  u  X  r,  where  r  is 
a  radius  vector  from  a  point  on  the  rotation  axis  to  the  particle.  Let  us  calculate 
V  X  v  =  Vx(wxr);  we  can  evaluate  this  by  the  method  described  in  Section  7.  We 
use  the  formula  for  the  triple  vector  product  A  X  (B  X  C)  =  (A  •  C)B  —  (A  ■  B)C, 
being  careful  to  remember  that  V  is  not  an  ordinary  vector — it  has  both  vector 
and  differential-operator  properties,  and  so  must  be  written  before  variables  that  it 
differentiates.  Then 


(11.1)  V  X  (uj  X  r)  =  (V  •  y)uj  —  ( uj  •  V)r. 

Since  uj  is  constant,  the  first  term  of  (11-1)  means 


(11.2) 


In  the  second  term  of  (11-1)  we  intentionally  wrote  uj  •  V  instead  of  V  •  uj  since  uj 
is  constant,  and  V  operates  only  on  r;  this  term  means 


d_ 

'  dx 


d_ 

'dy 


d_ 

'  dz 


(ia;  +  jy  +  kz)  =  i  u>x  +  j  u>y  +  k  ujz  =  uj 
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since  dy/dx  =  dz/dx  =  0,  etc.  Then 


(11.3)  Vxv  =  Vx  (uxr)  =  2u  or  ui  =  i(V  X  v). 

This  result  gives  a  clue  as  to  the  name  curlv  (or  rotation  v  or  rot  v  as  it  is 
sometimes  called).  For  this  simple  case  curlv  gave  the  angular  velocity  of  rotation. 
In  a  more  complicated  case  such  as  flow  of  fluid,  the  value  of  curlv  at  a  point  is  a 
measure  of  the  angular  velocity  of  the  fluid  in  the  neighborhood  of  the  point.  When 
V  X  v  =  0  everywhere  in  some  region,  the  velocity  field  v  is  called  irrotational  in 
that  region.  Notice  that  this  is  the  same  mathematical  condition  as  for  a  force  F 
to  be  conservative. 


(a)  Vortex 


(b)  Parallel  flow, 
constant  velocity 


(c)  Parallel  flow, 
variable  velocity 


Paddle  wheel 
probe 


(d)  Flow  around 
a  corner 


Figure  11.1 

Consider  a  vector  field  V  (for  example,  V  =  vp  for  flow  of  water,  or  V  =  force 
F).  We  define  the  circulation  as  the  line  integral  j>V  •  dr  around  a  closed  plane 
curve.  If  V  is  a  force  F,  then  this  integral  is  equal  to  the  work  done  by  the  force. 
For  flow  of  water,  we  can  get  a  physical  picture  of  the  meaning  of  the  circulation  in 
the  following  way.  Think  of  placing  a  tiny  paddle-wheel  probe  (Figure  11.1c)  in  any 
of  the  flow  patterns  pictured  in  Figure  11.1.  If  the  velocity  of  the  fluid  is  greater  on 
one  side  of  the  wheel  than  on  the  other,  for  example,  as  in  (c),  then  the  wheel  will 
turn.  Suppose  we  calculate  the  circulation  j>  V  •  dr  around  the  axis  of  the  paddle 
wheel  along  a  closed  curve  in  a  plane  perpendicular  to  the  axis  (plane  of  the  paper 
in  Figure  11.1).  If  V  =  vp  is  larger  on  one  side  of  the  wheel  than  the  other,  then 
the  circulation  is  different  from  zero,  but  if  [as  in  (b)]  V  is  the  same  on  both  sides, 
then  the  circulation  is  zero.  We  shall  show  that  the  component  of  curlV  along  the 
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axis  of  the  paddle  wheel  equals 


(11.4) 


lim  — 
da — *0  da 


V  •  dr 


where  da  is  the  area  inclosed  by  the  curve  along  which  we  calculate  the  circulation. 
The  paddle  wheel  then  acts  as  a  “curl  meter”  to  measure  curlV;  if  it  does  not 
rotate,  curlV  =  0;  if  it  does,  then  curlV  0.  In  (a),  curlV  ^  0  at  the  center  of 
the  vortex.  In  (b),  curlV  =  0.  In  (c),  curlV  ^  0  in  spite  of  the  fact  that  the  flow 
lines  are  parallel.  In  (d),  it  is  possible  to  have  curlV  =  0  even  though  the  stream 
lines  go  around  a  corner;  in  fact,  for  the  flow  of  water  around  a  corner,  curlV  =  0. 
What  you  should  realize  is  that  the  value  of  curlV  at  a 
circulation  in  the  neighborhood  of  the  point  and  not  on 
the  overall  flow  pattern. 

We  want  to  show  the  relation  between  the  circulation 
j>V-dr  and  curlV  for  a  given  vector  field  V.  Given  a 
point  P  and  a  direction  n,  let  us  find  the  component  of 
curlV  in  the  direction  n  at  P.  Draw  a  plane  through  P 
perpendicular  to  n  and  choose  axes  so  that  it  is  the  (a;,  y) 
plane  with  n  parallel  to  k.  Find  the  circulation  around 
an  element  of  area  da  centered  on  P.  (See  Figures  9.5 
and  11.2.)  By  (9.18)  with  area  A  replaced  by  the  element 
of  area  da ,  and  with  n  =  k 


point  depends  upon  the 


•4 


da  =  dx  dy 


P# 


dy 


dx 


Figure  11.2 


(11.5)  V  •  dr  =  J J (cmlV) -kdxdy  =  JJ (curl  V)  •  n  da 


around  da 


da 


da 


Note  that,  since  we  proved  (9.7)  and  so  (9.18)  for  non-rectangular  areas  A  (see 
Section  9),  da  here  may  be  more  general  than  dxdy ,  say  with  curved  or  slanted 
sides. 

We  assume  that  the  components  of  V  have  continuous  first  derivatives;  then 
curlV  is  continuous.  Thus  the  value  of  (curlV)  •  n  over  da  is  nearly  the  same 
as  (curlV)  •  n  at  P,  so  the  double  integral  in  (11.5)  is  approximately  the  value  of 
(curlV)  •  n  at  P  multiplied  by  da.  If  we  divide  (11.5)  by  da  and  take  the  limit  as 
da  0,  we  have  an  exact  equation 

(11.6)  (V  X  V)-n=  lim  -J-  /  V  •  dr. 

da—>0  da  J 

around  da 


This  equation  can  be  used  as  a  definition  of  curl  V ;  then  the  discussion  above  shows 
that  [see  equation  (9.16)]  the  components  of  curlV  are  those  given  in  our  previous 
definition  (7.3). 

In  evaluating  the  line  integral  we  must  go  around  the  area  element  da  as  in 
Figure  11.2  keeping  the  area  to  our  left.  Another  way  of  saying  this  is  that  we 
go  around  da  in  the  direction  indicated  by  n  and  the  right-hand  rule;  that  is,  if 
the  thumb  of  your  right  hand  points  in  the  direction  n,  your  fingers  curve  in  the 
direction  you  must  go  around  the  boundary  of  da  in  evaluating  the  line  integral. 
(See  Figure  11.2  with  n  =  k.) 
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Stokes’  Theorem  This  theorem  relates  an  integral  over  an  open  surface  to  the 
line  integral  around  the  curve  bounding  the  surface  (Figure  11.3).  A  butterfly 
net  is  a  good  example  of  what  we  are  talking  about;  the  net  is  the  surface  and 
the  supporting  rim  is  the  curve  bounding  the  surface.  The  surfaces  we  consider 
here  (and  which  arise  in  applications)  will  be  surfaces  which  could  be  obtained 
by  deforming  a  hemisphere  (or  the  butterfly  net  of  Figure  11.3).  In  particular, 
the  surfaces  we  consider  must  be  two-sided.  You  can  easily  construct  a  one-sided 
surface  by  taking  a  long  strip  of  paper,  giving  it  a  half  twist,  and  joining  the  ends 
(Figure  11.4).  A  belt  of  this  shape  is  sometimes  used  for  driving  machinery.  This 
surface  is  called  a  Moebius  strip,  and  you  can  verify  that  it  has  only  one  side  by 
tracing  your  finger  around  it  or  imagining  trying  to  paint  one  side.  Stokes’  theorem 
does  not  apply  to  such  surfaces  because  we  cannot  define  the  sense  of  the  normal 
vector  n  to  such  a  surface.  We  require  the  bounding  curve  to  be  simple  (that  is,  it 
must  not  cross  itself)  and  closed. 


Figure  11.3 


Figure  11.4 


Consider  the  kind  of  surface  we  have  described  and  imagine  it  divided  into 
area  elements  da  by  a  network  of  curves  as  in  Figure  11.5.  Draw  a  unit  vector  n 
perpendicular  to  each  area  element;  n,  of  course,  varies  from  element  to  element, 
but  all  n’s  must  be  on  the  same  side  of  the  two-sided  surface.  Each  area  element 


Figure  11.5 

is  approximately  an  element  of  the  tangent  plane  to  the  surface  at  a  point  in  da. 
Then,  as  in  (11.5),  we  have 


(11.7) 


V  •  dr 


X  V)  •  n  da 


around  da 


da 


for  each  element.  Recall  from  Section  9  and  the  comment  just  after  equation  (11.5), 
that  da  includes  area  elements  such  as  those  along  the  edges  in  Figure  11.5.  Then 
if  we  sum  the  equations  in  (11.7)  for  all  the  area  elements  of  the  whole  surface  area, 
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we  get 

(11.8) 


all  da 


<fv-dr  = 


(V 


surface  a 


X  V)  •  n  da. 


From  Figure  11.5  we  see  that  all  the  interior  line  integrals  cancel  because  along  a 
border  between  two  da's  the  two  integrals  are  in  opposite  directions.  Then  the  left 
side  of  (11-8)  becomes  simply  the  line  integral  around  the  outside  curve  bounding 
the  surface.  Thus  we  have  Stokes’  theorem  as  stated  in  (9.19): 


(11.9) 

j)  V  •  dr  =  jj  (V  X  V)  •  nda. 

Stokes’  theorem 

,  cu*Ye  surface  a 

bounding  a 

You  should  have  it  clearly  in  mind  that  this  is  for  an  open  surface  bounded  by  a 
simple  closed  curve.  Recall  the  example  of  a  butterfly  net.  Notice  that  Stokes’  the¬ 
orem  says  that  the  line  integral  j>  V  •  dr  is  equal  to  the  surface  integral  of  (V  X  V)  •  n 
over  any  surface  of  which  the  curve  is  a  boundary;  in  other  words,  you  don’t  change 
the  value  of  the  integral  by  deforming  the  butterfly  net!  An  easy  way  to  determine 
the  direction  of  integration  for  the  line  integral  is  to  imagine  collapsing  the  surface 
and  its  bounding  curve  into  a  plane;  then  the  “surface”  is  just  the  plane  area  inside 
the  curve  and  n  is  normal  to  the  plane.  The  direction  of  integration  is  then  given 
by  the  right-hand  rule  as  discussed  just  after  equation  (11.6). 

Example  1.  Given  V  =  4yi  +  xj  +  22k,  find  f (V  X  V)  •  nda  over  the  hemisphere  x 2  + 
y2  +  z2  =  a2,  z>  0. 

Using  (7.3),  we  find  that  V  X  V  =  —3k.  There  are  several  ways  we  could  do 
the  problem:  (a)  integrate  the  expression  as  it  stands;  (b)  use  Stokes’  theorem  and 
evaluate  j>V-dr  around  the  circle  x2  +  y2  =  a2  in  the  ( x ,  y)  plane;  (c)  use  Stokes’ 
theorem  to  say  that  the  integral  is  the  same  over  any  surface  bounded  by  this  circle, 
for  example,  the  plane  area  inside  the  circle!  Since  this  plane  area  is  in  the  ( x,y ) 
plane,  we  have 

n  =  k,  (V  x  V)  •  n  =  -3k  •  k  =  -3, 

so  the  integral  is 

—3  J  da  =  —3  •  na2  =  — 37ra2. 

This  is  the  easiest  way  to  do  the  problem;  however,  for  this  simple  case  it  is  not  too 
hard  by  the  other  methods.  We  shall  leave  (b)  for  you  to  do  and  do  (a).  Since  the 
surface  is  a  sphere  with  center  at  the  origin,  r  is  normal  to  it  (but  for  any  surface 
we  could  get  the  normal  from  the  gradient).  Then  on  the  surface 

r  r  ix  +  jy  +  kz 

n  jr  |  a  a 

(V  X  V)  •  n  =  —3k  •  —  =  —3  —  . 

a  a 
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We  want  to  evaluate  f  —3(z/a)  da  over  the  hemisphere.  In  spherical  coordinates 
(see  Chapter  5,  Section  4)  we  have 


z  =  r  cos  I 
2  . 


da  =  r  sin  9  dO  d(j>. 

For  our  surface  r  =  a.  Then  the  integral  is 

/•2tt  p'k/2 


/>2-7t 


r7r/2 


n  cos  0 

I  I  —3 - a2  sin  0d9d(j)  =  —  3a2  /  d(j>  I  sin  9  cos  0  d6 

l(/,=oJe=o  a  Jo  Jo 

=  —3a2  -2n  ■-  =  —3i ra2 
2 


(as  before). 


Ampere’s  Law  Stokes’  theorem  is  of  interest  in  electromagnetic  theory.  (Com¬ 
pare  the  use  of  the  divergence  theorem  in  connection  with  Gauss’s  law  in  Section 
10.)  Ampere’s  circuital  law  (in  SI  units)  says  that 


H  •  dr 


c 


I, 


where  H  =  B//xo,  B  is  the  magnetic  field,  p, o  is  a  constant  (called  the  permeability 
of  free  space),  C  is  a  closed  curve,  and  /  is  the  current  “linking”  C,  that  is  crossing 
any  surface  area  bounded  by  C .  The  surface  area  and  the  curve  C  are  related  just 
as  in  Stokes’  theorem  (butterfly  net  and  its  rim).  If  we  think  of  a  bundle  of  wires 
linking  a  closed  curve  C  (Figure  11.6)  and  then  spreading  out,  we  can  see  that  the 
same  current  crosses  any  surface  whose  bounding  curve  is  C. 

Just  as  Gauss’s  law  (10.23)  is  useful  in  computing  electric  fields,  so  Ampere’s 
law  is  useful  in  computing  magnetic  fields.  Consider,  for  example,  a  long  straight 
wire  carrying  a  current  /  (Figure  11.7).  At  a  distance  r  from  the  wire,  H  is  tangent 


Figure  11.6  Figure  11.7 


to  a  circle  of  radius  r  in  a  plane  perpendicular  to  the  wire.  By  symmetry,  |H|  same 
at  all  points  of  the  circle.  We  can  then  find  |H|  by  Ampere’s  law.  Taking  C  to  be 
the  circle  of  radius  r,  we  have 


r2n 


H  •  dr  = 


\H\rd6=  IHIr  ■  2ir  =  I 


c 


|H|  = 


I 

2iTr 


or 
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If,  in  Figure  11.6,  J  is  the  current  density  (current  crossing  unit  area  perpendic¬ 
ular  to  J),  then  J  •  nda  is  the  current  across  a  surface  element  da  [compare  (10.4)] 
and  JJ  J  •  n  da,  over  any  surface  a  bounded  by  C,  is  the  total  current  I  linking  C. 
Then  by  Ampere’s  law 


H  •  dr  = 


J  •  n  da. 


a 


By  Stokes’  theorem 


H  •  dr 


X  H)  •  n  da, 


G 


so  we  have 


(V  X  H)  •  nda  = 


J  •  n  da. 


G 


G 


Since  this  is  true  for  any  a,  we  have  VxH  =  J,  which  is  one  of  the  Maxwell  equa¬ 
tions.  Alternatively,  we  could  start  with  the  Maxwell  equation  and  apply  Stokes’ 
theorem  to  get  Ampere’s  law. 


Conservative  Fields  We  next  want 
to  state  carefully,  and  use  Stokes’  the¬ 
orem  to  prove,  under  what  conditions 
a  given  field  F  is  conservative  (see  Sec¬ 
tion  8).  First,  recall  that  in  physical 
problems  we  are  often  interested  only 
in  a  particular  region  of  space,  and  our 
formulas  (say  for  F)  may  very  well  be 
correct  only  in  that  region.  For  exam¬ 
ple,  the  gravitational  pull  of  the  earth 
on  an  object  is  proportional  to  1/r2  for 
r  >  earths’  radius  R,  but  this  is  not  a 
correct  formula  for  r  <  R  (see  Prob¬ 
lem  8.21).  The  electric  field  in  the  re¬ 
gion  between  the  plates  of  a  cylindrical 
capacitor  is  proportional  to  1/r  (problem  10.12),  but  only  in  this  region  is  this 
formula  correct.  We  must,  then,  consider  the  kind  of  region  in  which  a  given  field 
F  is  defined.  Consider  the  shaded  regions  in  Figure  11.8.  We  say  that  a  region  is 
simply  connected  if  any  simple^  closed  curve  in  the  region  can  be  shrunk  to  a  point 
without  encountering  any  points  not  in  the  region.  You  can  see  in  Figure  11.8c  that 
the  dotted  curve  surrounds  the  “hole”  and  so  cannot  be  shrunk  to  a  point  in  the 
region;  this  region  is  then  not  simply  connected.  The  “hole”  is  sometimes  only  a 
single  point,  but  this  is  enough  to  make  the  region  not  simply  connected.  In  three 
dimensions  the  region  between  cylindrical  capacitor  plates  (infinitely  long)  is  not 
simply  connected  since  a  loop  of  string  around  the  inner  cylinder  (see  cross  section, 
Figure  11.8c)  cannot  be  drawn  up  to  a  knot.  Similarly,  the  interior  of  an  inner 
tube  is  not  simply  connected.  The  region  between  two  concentric  spheres  is  simply 
connected,  however.  You  should  see  this  by  realizing  that  you  could  pull  up  into  a 


Not  simply  connected 

Figure  11.8 


'A  simple  curve  does  not  cross  itself;  for  example,  a  figure  eight  is  not  a  simple  curve. 
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knot,  a  loop  of  string  placed  anywhere  in  this  region.  We  shall  now  state  and  prove 
our  theorem. 


(11.10) 

If  the  components  of  F  and  their  first  partial  derivatives  are  continuous  in  a 
simply  connected  region,  then  any  one  of  the  following  five  conditions  implies  all 
the  others. 

(a)  curlF  =  0  at  every  point  of  the  region. 

(b)  j>  F  •  dr  =  0  around  every  simple  closed  curve  in  the  region. 

(c)  F  is  conservative,  that  is  ff  F  •  dr  is  independent  of  the  path  of  integration 
from  A  to  B.  (The  path  must,  of  course,  lie  entirely  in  the  region.) 

(d)  F  •  dr  is  an  exact  differential  of  a  single  valued  function. 

(e)  F  =  grad  W,  W  single- valued. 


We  shall  show  that  each  of  these  conditions  implies  the  one  following  it.  We 
can  use  Stokes’  theorem  to  prove  (b)  assuming  (a).  First  select  any  simple  closed 
curve  and  let  it  be  the  bounding  curve  for  the  surface  in  Stokes’  theorem.  Since 
the  region  is  simply  connected  we  can  think  of  shrinking  the  curve  to  a  point  in 
the  region;  as  it  shrinks  it  traces  out  a  surface  which  we  use  as  the  Stokes’  theorem 
surface.  Assuming  (a),  we  have  curlF  =  0  at  every  point  of  the  region  and  so  also 
at  every  point  of  the  surface.  Thus  the  surface  integral  in  Stokes’  theorem  is  zero 
and  therefore  the  line  integral  around  the  closed  curve  equals  zero.  This  gives  (b). 

To  show  that  (b)  implies  (c),  consider  any  two  paths  I  and  II 
from  A  to  B  (Figure  11.9).  From  (b)  we  have 

nB  pA 

F-dr+  F -dr  =  0.  1 

J  A  JB 

path  i  path  ii  Figure  11.9 


Since  an  integral  from  A  to  B  is  the  negative  of  an  integral  from  B  to  A,  we  have 


F-dr  - 


F  •  dr  =  0. 


Ja 

path  I 


Ja 

path  II 


which  is  (c). 

To  show  that  (c)  implies  (d),  select  some  reference  point  O  in  the  region  and 
calculate  f  F  •  dr  from  the  reference  point  to  every  other  point  of  the  region.  For 
each  point  P  we  find  a  single  value  of  the  integral  no  matter  what  path  of  integration 
we  choose  from  O  to  P.  Let  this  value  be  the  value  of  the  function  W  at  the  point 
P .  We  then  have  a  single-valued  function  W  such  that 


0  to  P 


F  •  dr  =  W(P). 
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Then  (since  F  is  continuous),  dW  =  F  •  dr,  that  is,  F  •  dr  is  the  differential  of  a 
single-valued  function  W.  Since  dW  =  VVF  •  dr  =  F  •  dr  for  arbitrary  dr,  we  have 
F  =  VW  which  is  (e). 

Finally,  (e)  implies  (a)  as  we  proved  in  Section  8.  (The  continuity  of  the  com¬ 
ponents  of  F  and  their  partial  derivatives  makes  the  second-order  mixed  partial 
derivatives  of  W  equal.)  Thus  we  have  shown  that  any  one  of  the  five  conditions 
(a)  to  (e)  implies  the  others  under  the  conditions  of  the  theorem.  It  is  worth  observ¬ 
ing  carefully  the  requirement  that  F  and  its  partial  derivatives  must  be  continuous 
in  a  simply  connected  region.  A  simple  example  makes  this  clear.  Look  at  Example 
2  in  Section  8;  you  can  easily  compute  curlF  and  find  that  it  is  zero  everywhere 
except  at  the  origin  (where  it  is  undefined).  You  might  then  be  tempted  to  assume 
that  j>  F  •  dr  =  0  around  any  closed  path.  But  we  found  that  F  •  dr  =  d6,  and  the 
integral  of  dd  along  a  circle  with  center  at  the  origin  is  27t.  What  is  wrong?  The 
trouble  is  that  F  does  not  have  continuous  partial  derivatives  at  the  origin,  and 
any  simply  connected  region  containing  the  circle  of  integration  must  contain  the 
origin.  Then  curlF  is  not  zero  at  every  point  inside  the  integration  curve.  Notice 
also  that  F  •  dr  =  dd  is  an  exact  differential,  but  not  of  a  single-valued  function;  d 
increases  by  27t  every  time  we  go  around  the  origin. 

A  vector  field  V  is  called  irrotational  (or  conservative  or  lamellar)  if  curl  V  =  0; 
in  this  case  V  =  grad  W,  where  W  (or  its  negative)  is  called  the  scalar  potential. 
If  div  V  =  0,  the  vector  field  is  called  solenoidal,  in  this  case  V  =  curl  A,  where  A 
is  a  vector  function  called  the  vector  potential.  It  is  easy  to  prove  (Problem  7.17d) 
that  if  V  =  V  X  A,  then  div  V  =  0.  It  is  also  possible  to  construct  an  A  (actually 
an  infinite  number  of  A’s)  so  that  V  =  curl  A  if  we  know  that  V  •  V  =  0. 


Example  2.  Given  V  =  i(x2  —  yz)  —  j2 yz  +  k (z2  —  2 zx),  find  A  such  that  V  =  V  X  A. 
We  find 


d-v  = 


\  9  ,  „  .  d  ,  o 

yz)  +  -{-2yt)  +  -{z 


=  2x  —  2z  +  2z  —  2x  =  0. 


2  zx) 


Thus  V  is  solenoidal  and  we  proceed  to  find  A.  We  are  looking  for  an  A  such  that 


(11.11)  V  =  curl  A  = 


d  d  d 
dx  dy  dz 


AX  Ay  Az 


=  i(x2  —  yz)  —  j2  yz  +  k  (z2  —  2  zx). 


There  are  many  A’s  satisfying  this  equation;  we  shall  show  first  how  to  find  one 
of  them  and  then  a  general  formula  for  all.  It  is  possible  to  find  an  A  with  one 
zero  component;  let  us  take  Ax  =  0.  Then  the  y  and  2  components  of  curl  A  each 
involve  just  one  component  of  A.  From  (11.11),  the  y  and  2  components  of  curl  A 
are 


-2  yz  = 


dAz 
dx  ’ 


z2  —  2 zx  = 


dAy_ 

dx 


(11.12) 
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If  we  integrate  (11.12)  partially  with  respect  to  x  (that  is,  with  y  and  z  constant), 
we  find  Ay  and  Az  except  for  possible  functions  of  y  and  z  which  could  be  added 
without  changing  (11.12): 


(11.13) 


Ay  =  z2x  -  zx 2  +  fi(y,  z), 
Az  =  2  xyz  +  f2(y,z). 


Substituting  (11.13)  into  the  x  component  of  (11.11),  we  get 


(11.14) 


x2  —  yz  = 


dAz 

dy 


dAy 

dz 


=  2  xz  +  —  2  zx  +  x2 

dy 


dfi 
dz  ' 


We  now  select  fi  and  to  satisfy  (11.14).  There  is  much  leeway  here  and  this 
can  easily  be  done  by  inspection.  We  could  take  =  0,  /i  =  \ yz 2,  or  /i  =  0, 
/2  =  —  \y2z,  and  so  forth.  Using  the  second  choice,  we  have 


(11.15)  A  =  j(z2x  —  zx2)  +  k(2  xyz  —  \)y2z). 

You  may  wonder  why  this  process  works  and  what  divV  =  0  has  to  do  with  it. 
We  can  answer  both  these  questions  by  following  the  above  process  with  a  general 

V  rather  than  a  special  example.  Given  that  divV  =  0,  we  want  an  A  such  that 

V  =  curl  A.  We  try  to  find  one  with  Ax  =  0.  Then  the  y  and  z  components  of 

V  =  curl  A  are 


(11.16) 

Then  we  have 


Vy  =  ~ 


dAz 

dx 


Vz  = 


dAy 

dx 


(11.17) 


Ay  — 


Vz  dx  +  f(y,z), 


Az 


Vydx  +  g(y,z). 


The  x  component  of  V  =  curl  A  is 


(11.18) 


-  dAz  _  dAy  - 

dy  dz 


Since  div  V  =  0,  we  can  put 


(dVy_ 

\  dy 


dx  +  h(y,  z). 


(11.19) 


dV^  dVA  dVx 

dy  dz  )  dx 


into  (11.18),  getting 

Vx  =  J  ^ -dx  +  h(y,  z ). 

This  is  correct  with  proper  choice  of  h(y,  z). 

When  we  know  one  A,  for  which  a  given  V  is  equal  to  curl  A,  all  others  are  of 
the  form 


(11.20)  A  +  V«, 

where  u  is  any  scalar  function.  For  (see  Problem  7.17b),  V  X  V«  =  0,  so  the 
addition  of  Vu  to  A  does  not  affect  V.  Also  we  can  show  that  all  possible  A’s  are 
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of  the  form  (11.20).  For  if  V  =  curl  Ai  and  V  =  curl  A2,  then  curl(Ai  —  A2)  =  0, 
so  Ai  —  A2  is  the  gradient  of  some  scalar  function. 

A  careful  statement  and  proof  that  div  V  =  0  is  a  necessary  and  sufficient  con¬ 
dition  for  V  =  curl  A  requires  that  V  have  continuous  partial  derivatives  at  every 
point  of  a  region  which  is  simply  connected  in  the  sense  that  every  closed  surface 
(rather  than  closed  curve)  can  be  shrunk  to  a  point  in  the  region  (for  example,  the 
region  between  two  concentric  spheres  is  not  simply  connected  in  this  sense). 


PROBLEMS,  SECTION  11 

1.  Do  case  (b)  of  Example  1  above. 

2.  Given  the  vector  A  =  ( x 2  —  y2)i  +  2xyj. 

(a)  Find  V  X  A. 

(b)  Evaluate  //  (V  X  A)  •  da  over  a  rectangle  in  the  ( x,y )  plane  bounded  by  the 
lines  x  =  0,  x  =  a,  y  =  0,  y  =  b. 

(c)  Evaluate  f  A  ■  dr  around  the  boundary  of  the  rectangle  and  thus  verify  Stokes’ 
theorem  for  this  case. 

Use  either  Stokes’  theorem  or  the  divergence  theorem  to  evaluate  each  of  the  following 
integrals  in  the  easiest  possible  way. 

3.  JJ  face  curl(x2i  +  22j  —  y2k)-nda,  where  a  is  the  part  of  the  surface  2  =  4  —  x2  —  y2 
above  the  (x,  y)  plane. 

4.  ff  curl(yi  +  2j  )  •  nda,  where  a  is  the  surface  in  the  first 
octant  made  up  of  part  of  the  plane  2x  +  3y  +  4z  =  12,  and 
triangles  in  the  (x,  2)  and  (y,  2)  planes,  as  indicated  in  the 
figure. 

5.  ff  r  •  nda  over  the  surface  in  Problem  4,  where  r  =  ix  + 
jy  +  k2.  Hint:  See  Problem  10.9. 


6.  ffV-n  da  over  the  closed  surface  of  the  tin  can  bounded  by  x2  +  y2  =  9,  2  =  0, 
2  =  5,  if 

V  =  2xyi  -  y2 j  +  (2  +  xy) k. 

7.  ff  (curl  V)  •  n  da  over  any  surface  whose  bounding  curve  is  in  the  (x,  y)  plane,  where 

V  =  (x  —  x2z)  i  +  (yz3  —  y2)  j  +  (x2y  —  xz)  k. 


8.  ff  curl(  x2 y\  —  xz k)  •  nda  over  the  dosed  surface  of  the  ellipsoid 


Warning:  Stokes’  theorem  applies  only  to  an  open  surface.  Hints:  Could  you  cut 
the  given  surface  into  two  halves?  Also  see  (d)  in  the  table  of  vector  identities  (page 
339). 

9.  ffV-n  da  over  the  entire  surface  of  the  volume  in  the  first  octant  bounded  by 
x2  +  y2  +  z2  =  16  and  the  coordinate  planes,  where 

V  =  (x  +  x2  —  y2)  i  +  (2  xyz  —  2xy)j  —  xz2  k. 


10.  //(curl  V)  •  n  da  over  the  part  of  the  surface  2  =  9  —  x2  —  9 y2  above  the  (x,  y)  plane, 
if  V  =  2xyi  +  (x2  —  2x)j  —  x2z2  k. 
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11.  ffV-n  da  over  the  entire  surface  of  a  cube  in  the  first  octant  with  edges  of  length 
2  along  the  coordinate  axes,  where 

V  =  ( * 2  —  y2)  i  +  3yj  —  2xzk. 

12.  V  •  dr  around  the  circle  ( x  —  2)2  +  (y  —  3)2  =  9,  z  =  0,  where 

V  =  (x2  +  yz2) i  +  (2x  -  y3) j. 


13.  f f  (2xi  —  2yj  +  5k)  •  n  da  over  the  surface  of  a  sphere  of  radius  2  and  center  at  the 
origin. 

14.  <f(y i  —  ,rj  +  zk)  •  dr  around  the  circumference  of  the  circle  of  radius  2,  center  at  the 
origin,  in  the  ( x ,  y)  plane. 

15.  j>cy  dx  +  z  dy  +  x  dz,  where  C  is  the  curve  of  intersection  of  the  surfaces  whose 
equations  are  x  +  y  =  2  and  x2  +  y2  +  z2  =  2(x  +  y). 

16.  What  is  wrong  with  the  following  “proof”  that  there  are  no  magnetic  fields?  By 
electromagnetic  theory,  V  •  B  =  0,  and  B  =  V  X  A.  (The  error  is  not  in  these 
equations.)  Using  them,  we  find 


V  •  B 


dr  =  0 


B  •  n  dcr 


(by  the  divergence  theorem) 


X  A)  •  n  da  = 


A  •  dr  (by  Stokes’  theorem). 


Since  f  A-  dr  =  0,  A  is  conservative,  or  A  =  Vi/>.  Then  B  =  VxA  =  Vx  Vi p  =  0, 
so  B  =  0. 


17.  Derive  the  following  vector  integral  theorems. 


(a) 


V  <p  dr  = 


<j>  n  da. 


surface 
inclosing  r 


Hint:  In  the  divergence  theorem  (10.17),  substitute  V  =  <j> C,  where  C  is  an 
arbitrary  constant  vector,  to  obtain  C  -  fV^dr  =  C  ■  <f  (pn  da.  Since  C  is 
arbitrary,  let  C  =  i  to  show  that  the  x  components  of  the  two  integrals  are 
equal;  similarly,  let  C  =  j  and  C  =  k  to  show  that  the  y  components  are  equal 
and  the  2  components  are  equal. 


(b) 

(c) 

(d) 

(e) 


V  X  V  dr  = 


n  X  V  da. 


surface 
nclosing  ' 


Hint:  Replace  V  in  the  divergence  theorem  by  V  X  C,  where  C  is  an  arbitrary 
constant  vector.  Follow  the  last  part  of  the  hint  in  (a). 


/  / 


curve 
bounding  cr 


>dr=  c t>  (n  X  Vi))dff. 

surface  a 


dr  X  V  = 


curve 

bounding  a 


/  < 


n  X  V )  X  V  da. 


Hints  for  (c)  and  (d):  Use  the  substitutions  suggested  in  (a)  and  (b)  but  in 
Stokes’  theorem  (11.9)  instead  of  the  divergence  theorem. 


/ 


6V  •  V dr  =  j)  <j>V-nda —  J  V-Vipdr. 

volume  r  surface  volume  r 

inclosing  r 

Hint:  Integrate  (7.6)  over  volume  r  and  use  the  divergence  theorem. 
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(f) 


(g) 


V-  (V  X  U)dr  = 


U-  (V  X  V)dr  + 


(U  X  V)  •  n  da. 


surface 
inclosing  r 


Hint:  Integrate  (h)  in  the  Table  of  Vector  Identities  (page  339)  and  use  the 
divergence  theorem. 


/ 


</>(V  X  V)  •  n  da  = 


surface  of  c r 


/ 


(V  X  V0)  •  nda  + 


surface  of  a 


curve 

bounding  cr 


0V  •  dr. 


Hint:  Integrate  (g)  in  the  Table  of  Vector  Identities  (page  339)  and  use  Stokes’ 
Theorem. 


Find  vector  fields  A  such  that  V  =  curl  A  for  each  given  V. 

18.  V  =  ( x 2  —  yz  +  y)  i  +  {x  —  2yz)i  +  ( z 2  —  2zx  +  x  +  y)k 

19.  V  =  i(x2  —  2  xz)  +  j  (y2  —  2xy)  +  k  (z2  —  2yz  +  xy) 

20.  V  =  i (zezy  +  x  sin  zx )  +  jx  cos  xz  —  kz  sin  zx 

21.  V  =  -k 

22.  V  =  (y  +  z)i  +  (x  -  «)j  +  (x2  +  y2)k 


12.  MISCELLANEOUS  PROBLEMS 


1.  If  A  and  B  are  unit  vectors  with  an  angle  9  between  them,  and  C  is  a  unit  vector 
perpendicular  to  both  A  and  B,  evaluate  [(A  X  B)  X  (B  X  C)]  X  (C  X  A). 

2.  If  A  and  B  are  the  diagonals  of  a  parallelogram,  find  a  vector  formula  for  the  area 
of  the  parallelogram. 

3.  The  force  on  a  charge  q  moving  with  velocity  v  =  dr/dt  in  a  magnetic  Held  B  is 
F  =  q(v  X  B).  We  can  write  B  as  B  =  V  X  A  where  A  (called  the  vector  potential) 
is  a  vector  function  of  x,  y,  z,  t.  If  the  position  vector  r  =  i:r  +  jy  +  kz  of  the  charge 
q  is  a  function  of  time  t,  show  that 


dA 

dt 


9A  „  . 

~&t  +  v  VA 


Thus  show  that 


F  =  qv  X  (V  X  A)  =  q 


V(v  •  A) 


dA  dA 

dt  dt 


4.  Show  that  V  •  (U  X  r)  =  r  •  (V  X  U)  where  U  is  a  vector  function  of  x,  y,  z,  and 
r  =  a:i  +  yj  +  zk. 

5.  Use  Green’s  theorem  (Section  9)  to  do  Problem  8.2. 

6.  Find  the  torque  about  the  point  (1,  —2, 1)  due  to  the  force  F  =  2i  —  j  +  3k  acting 
at  the  point  (1, 1,  —3). 

7.  Let  F  =  2i  —  3j  +  k  act  at  the  point  (5, 1,  3). 

(a)  Find  the  torque  of  F  about  the  point  (4, 1,0). 

(b)  Find  the  torque  of  F  about  the  line  r  =  4i  +  j  +  (2i  +  j  —  2k)t. 

8.  The  force  F  =  i  —  2j  —  2k  acts  at  the  point  (0, 1,  2).  Find  the  torque  of  F  about  the 

line  r  =  (2i  —  j)t. 

Let  F  =  i  —  5j  +  2k  act  at  the  point  (2, 1,  0).  Find  the  torque  of  F  about  the  line 
r  =  (3j  +  4k)  —  2it. 


9. 
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10.  Given  u  =  xy  +  sin  z,  find 

(a)  the  gradient  of  u  at  (1,  2,7r/2); 

(b)  how  fast  u  is  increasing,  in  the  direction  4i  +  3j,  at  (1,  2, 7r/2); 

(c)  the  equation  of  the  tangent  plane  to  the  surface  u  =  3  at  (1,  2, 7r/2). 

11.  Given  <f>  =  z2  —  3 xy,  find 

(a)  grad  </>; 

(b)  the  directional  derivative  of  cp  at  the  point  (1,  2,  3)  in  the  direction  i  +  j  +  k; 

(c)  the  equations  of  the  tangent  plane  and  of  the  normal  line  to  tp  =  3  at  the  point 

(1,2,3). 

12.  Given  u  =  xy  +  yz  +  z sin®,  find 

(a)  Vtt  at  (0, 1,2); 

(b)  the  directional  derivative  of  u  at  (0, 1,  2)  in  the  direction  2i  +  2 j  —  k; 

(c)  the  equations  of  the  tangent  plane  and  of  the  normal  line  to  the  level  surface 
u  =  2  at  (0, 1,2); 

(d)  a  unit  vector  in  the  direction  of  most  rapid  increase  of  u  at  (0, 1,2). 

13.  Given  <p  =  x2  —  yz  and  the  point  P( 3, 4, 1),  find 

(a)  Vcp  at  P; 

(b)  a  unit  vector  normal  to  the  surface  <p  =  5  at  P; 

(c)  a  vector  in  the  direction  of  most  rapid  increase  of  <p  at  P; 

(d)  the  magnitude  of  the  vector  in  (c); 

(e)  the  derivative  of  <p  at  P  in  a  direction  parallel  to  the  line  r  =  i  —  j  +  2k  +  (6i  — 
j  -  4k)  t. 

14.  If  the  temperature  is  T  =  x2  —  xy  +  z2,  find 

(a)  the  direction  of  heat  flow  at  (2, 1,  —1); 

(b)  the  rate  of  change  of  temperature  in  the  direction  j  —  k  at  (2, 1,  —1). 

15.  Show  that 

F  =  y2zsinh{2xz)\  +  2y  cosh2(*«)j  +  y2x  sinh(2a;^)k 
is  conservative,  and  find  a  scalar  potential  <j>  such  that  F  =  —  V0. 

16.  Given  Fi  =  2xzi  +  yj  +  x2k  and  F2  =  yi  —  a-j: 

(a)  Which  F,  if  either,  is  conservative? 

(b)  If  one  of  the  given  F’s  is  conservative,  find  a  function  W  so  that  F  =  VW. 

(c)  If  one  of  the  F’s  is  nonconservative,  use  it  to  evaluate  f  F-dr  along  the  straight 
line  from  (0, 1)  to  (1,  0). 

(d)  Do  part  (c)  by  applying  Green’s  theorem  to  the  triangle  with  vertices  (0,0), 
(0,1),  (1,0). 

17.  Find  the  value  of  J  F  •  dr  along  the  circle  x2  +  y2  =2  from  (1, 1)  to  (1,  —1)  if 

F  =  (2x  —  3y)i  -  (3x  -  2y)j. 

18.  Is  F  =  yi  +  xzj  +  zk  conservative?  Evaluate  f  F  ■  dr  from  (0,  0,  0)  to  (1, 1, 1)  along 
the  paths 
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(a)  broken  line  (0,  0,  0)  to  (1,  0,  0)  to  (1, 1,  0)  to  (1, 1, 1), 

(b)  straight  line  connecting  the  points. 

19.  Given  Fi  =  -2yi  +  (z  -  2x)j  +  (y  +  z) k,  F2  =  yi  +  2xj: 

(a)  Is  Fi  conservative?  Is  F2  conservative? 

(b)  Find  the  work  done  by  F2  on  a  particle  that  moves  around  the  ellipse  x  =  cos  9 , 
y  =  2  sin  9  from  9  =  0  to  9  =  2n. 

(c)  For  any  conservative  force  in  this  problem  find  a  potential  function  V  such 
that  F  =  -  W. 

(d)  Find  the  work  done  by  Fi  on  a  particle  that  moves  along  the  straight  line  from 
(0,1,0)  to  (0,2,5). 

(e)  Use  Green’s  theorem  and  the  result  of  Problem  9.7  to  do  Part  (b)  above. 

In  Problems  20  to  31,  evaluate  each  integral  in  the  simplest  way  possible. 

20.  f  f  P  •  nda  over  the  upper  half  of  the  sphere  r  =  1  if  P  =  curl(jx  —  kz). 

21.  JJ(VXV).  n  da  over  the  surface  consisting  of  the  four  slanting  faces  of  a  pyramid 
whose  base  is  the  square  in  the  (x,  y)  plane  with  corners  at  (0,  0),  (0,  2),  (2,  0),  (2,  2) 
and  whose  top  vertex  is  at  (1, 1,  2),  where 

V  =  (x2  z  —  2)i  +  (x  +  y  —  z)  j  —  xj/zk. 

22.  ff  V  ■  nda  over  the  entire  surface  of  the  sphere  ( x  —  2)2  +  (y  +  3)2  +  z2  =9,  if 

V  =  (3*  —  yz)i  +  (z2  -  y2) j  +  (2 yz  +  x2)k. 

23.  ffF-n  da  where  F  =  ( y 2  —  x2)i  +  ( 2xy  —  y)j  +  3zk  and  a  is  the  entire  surface  of 
the  tin  can  bounded  by  the  cylinder  x2  +  y2  =  16,  z  =  3,  z  =  —3. 

24.  ffr-n  da  over  the  entire  surface  of  the  hemisphere  x2  +  y2  +  z2  =  9,  z  >  0,  where 
r  =  xi  +  yj  +  z  k. 

25.  ff  V-n  da  over  the  curved  part  of  the  hemisphere  in  Problem  24,  ifV  =  curl(yi— xj). 

26.  JJ(curlV)  -  nda  over  the  entire  surface  of  the  cube  in  the  first  octant  with  three 
faces  in  the  three  coordinate  planes  and  the  other  three  faces  intersecting  at  (2,  2,  2), 
where 

V  =  (2  —  y)  i  +  xzj  +  xyzk. 

27.  Problem  26,  but  integrate  over  the  open  surface  obtained  by  leaving  out  the  face  of 
the  cube  in  the  ( x,y )  plane. 

28.  f  F  -  dr  around  the  circle  x2  +  y2  +  2x  =  0,  where  F  =  j/i  —  xj. 

29.  f  V-dr  around  the  boundary  of  the  square  with  vertices  (1,  0),  (0, 1),  (—1, 0),  (0,  —1), 
if  V  =  x2i  +  5xj. 

30-  fc  (x2  —  y)dx+  (x  +  y3)dy,  where  C  is  the  parallelogram  with  vertices  at  (0,  0),  (2,  0), 

(1.1).  (3,1). 

f  (y2  —  x2)  dx  +  (2 xy  +  3 )dy  along  the  x  axis  from  (0, 0)  to  (\/5, 0)  and  then  along 
a  circular  arc  from  (\/5, 0)  to  (1,2).  Hint:  Use  Green’s  theorem. 
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Table  of  Vector  Identities  Involving  V 


Note  carefully  that  <p  and  ip  are  scalar  functions;  U  and  V  are  vector  functions. 
Formulas  are  given  in  rectangular  coordinates;  for  other  coordinate  systems,  see 
Chapter  10,  Section  9. 


(a)  V  •  V0  =  div  grad  =  X72cp  =  Laplacian  cp 


d2(p  d2(p  d2(p 
dx2  dy 2  dz2 


(b)  V  X  Vc/>  =  curl  grad  <p  =  0 


(c)  V(V-V) 


grad  div  V 

d2Vy  d2Vz\  fdH^  dH 4 
\  dx 2  dxdy  dxdz )  J  \  dxdy  dy 2 

(d2Vx  d2Vy  d2Vz\ 

\dxdz  dydz  dz2  ) 


d2Vz\ 

dydz  J 


(d)  V  •  (V  X  V)  =  div  curl  V  =  0 

(e)  V  X  (V  X  V)  =  curl  curl  V  =  V(V  •  V)  —  V2V  =  grad  div  V  —  Laplacian  V 

(f)  V  •  (<pv)  =  (P{V  ■  V)  +  V  .  (V0) 

(g)  V  x  (<pV)  =  (p(V  x  V)  -  V  x  (V0) 

(h)  V  •  (U  x  V)  =  V  •  (V  x  U)  -  U  •  (V  x  V) 

(i)  V  x  (U  x  V)  =  (V  •  V)U  -  (U  •  V)V  -  V(V  •  U)  +  U(V  •  V) 

(j)  V(U  •  V)  =  U  x  (V  x  V)  +  (U  •  V)V  +  V  x  (V  x  U)  +  (V  •  V)U 

(k)  V  •  (V(p  x  Vip)  =  0 
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Fourier  Series  and  Transforms 


1.  INTRODUCTION 

Problems  involving  vibrations  or  oscillations  occur  frequently  in  physics  and  engi¬ 
neering.  You  can  think  of  examples  you  have  already  met:  a  vibrating  tuning  fork, 
a  pendulum,  a  weight  attached  to  a  spring,  water  waves,  sound  waves,  alternating 
electric  currents,  etc.  In  addition,  there  are  many  more  examples  which  you  will 
meet  as  you  continue  to  study  physics.  Some  of  them — for  example,  heat  conduc¬ 
tion,  electric  and  magnetic  fields,  light — do  not  appear  in  elementary  work  to  have 
anything  oscillatory  about  them,  but  will  turn  out  in  your  more  advanced  work  to 
involve  the  sines  and  cosines  which  are  used  in  describing  simple  harmonic  motion 
and  wave  motion. 

In  Chapter  1  we  discussed  the  use  of  power  series  to  approximate  complicated 
functions.  In  many  problems,  series  called  Fourier  series,  whose  terms  are  sines  and 
cosines,  are  more  useful  than  power  series.  In  this  chapter  we  shall  see  how  to  find 
and  use  Fourier  series.  Then,  in  Chapter  13  (Sections  2  to  4),  we  shall  consider 
several  of  the  physics  problems  which  Fourier  was  trying  to  solve  when  he  invented 
Fourier  series. 

Since  sines  and  cosines  are  periodic  functions,  Fourier  series  can  represent  only 
periodic  functions.  We  will  see  in  Section  12  how  to  represent  a  non-periodic  func¬ 
tion  by  a  Fourier  integral  (Fourier  transform). 

2.  SIMPLE  HARMONIC  MOTION  AND  WAVE  MOTION; 

PERIODIC  FUNCTIONS 

We  shall  need  much  of  the  notation  and  terminology 
used  in  discussing  simple  harmonic  motion  and  wave 
motion.  Let’s  discuss  these  two  topics  briefly. 

Let  particle  P  (Figure  2.1)  move  at  constant 
speed  around  a  circle  of  radius  A.  At  the  same 
time,  let  particle  Q  move  up  and  down  along  the 
straight  line  segment  RS  in  such  a  way  that  the 
y  coordinates  of  P  and  Q  are  always  equal.  If  u>  is 
the  angular  velocity  of  P  in  radians  per  second,  and 
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(Figure  2.1)  6  =  0  when  t  =  0,  then  at  a  later  time  t 

(2.1)  d  =  tot. 

The  y  coordinate  of  Q  (which  is  equal  to  the  y  coordinate  of  P)  is 

(2.2)  y  =  Asind  =  Asimot. 

The  back  and  forth  motion  of  Q  is  called  simple  harmonic  motion.  By  definition,  an 
object  is  executing  simple  harmonic  motion  if  its  displacement  from  equilibrium  can 
be  written  as  A  sin  cot  [or  A  cos  cot  or  A  sin(u;f +</>),  but  these  two  functions  differ  from 
Asinwf  only  in  choice  of  origin;  such  functions  are  called  sinusoidal  functions ].  You 
can  think  of  many  physical  examples  of  this  sort  of  simple  vibration:  a  pendulum, 
a  tuning  fork,  a  weight  bobbing  up  and  down  at  the  end  of  a  spring. 

The  x  and  y  coordinates  of  particle  P  in  Figure  2.1  are 

(2.3)  x  =  Acosiot,  y  =  Asincut. 

If  we  think  of  P  as  the  point  z  =  x  +  iy  in  the  complex  plane,  we  could  replace  (2.3) 
by  a  single  equation  to  describe  the  motion  of  P: 

(2.4)  z  =  x  +  iy  =  A(cosLot  +  i  sin  cot) 

=  Aeiut. 


It  is  often  worth  while  to  use  this  complex  notation  even  to  describe  the  motion 
of  Q;  we  then  understand  that  the  actual  position  of  Q  is  equal  to  the  imaginary 
part  of  z  (or  with  different  starting  conditions  the  real  part  of  z).  For  example,  the 
velocity  of  Q  is  the  imaginary  part  of 

(2.5)  —  =  —  (Aelwt)  =  Aiujelult  =  Aiuo{cosLot  +  ism  cut). 

Lit  L lb 

[The  imaginary  part  of  (2.5)  is  Auocosuot,  which  is  dy/dt  from  (2.2).] 


It  is  useful  to  draw  a  graph  of  x  and  y  in  (2.2)  and  (2.3)  as  a  function  of  t. 
Figure  2.2  represents  any  of  the  functions  sin  u>t,  cos  cot,  sin(o;£  +  cf>)  if  we  choose 
the  origin  correctly.  The  number  A  is  called  the  amplitude  of  the  vibration  or  the 
amplitude  of  the  function.  Physically  it  is  the  maximum  displacement  of  Q  from 
its  equilibrium  position.  The  period  of  the  simple  harmonic  motion  or  the  period  of 
the  function  is  the  time  for  one  complete  oscillation,  that  is,  2i t/lo  (See  Figure  2.2). 

We  could  write  the  velocity  of  Q  from  (2.5)  as 


(2.6) 


—  =  Aco  cos  tot  =  B  cos  Lot. 
dt 
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Here  B  is  the  maximum  value  of  the  velocity  and  is  called  the  velocity  amplitude. 
Note  that  the  velocity  has  the  same  period  as  the  displacement.  If  the  mass  of  the 
particle  Q  is  m ,  its  kinetic  energy  is: 


(2.7) 


Kinetic  energy 


1  ( dy\  1  2  2 

-to  (  —j—  )  =  -mB  cos  uf. 

2  \dt  2 


We  are  considering  an  idealized  harmonic  oscillator  which  does  not  lose  energy. 
Then  the  total  energy  (kinetic  plus  potential)  must  be  equal  to  the  largest  value  of 
the  kinetic  energy,  that  is,  \ mB 2 .  Thus  we  have: 


(2.8) 


Total  energy  =  —to B 


2 


Notice  that  the  energy  is  proportional  to  the  square  of  the  (velocity)  amplitude;  we 
shall  be  interested  in  this  result  later  when  we  discuss  sound. 

Waves  are  another  important  example  of  an  oscillatory  phenomenon.  The  math¬ 
ematical  ideas  of  wave  motion  are  useful  in  many  fields;  for  example,  we  talk  about 
water  waves,  sound  waves,  and  radio  waves. 


Example  1.  Consider  water  waves  in  which  the  shape  of  the  water  surface  is  (unrealis¬ 
tically!)  a  sine  curve.  If  we  take  a  photograph  (at  the  instant  t  =  0)  of  the  water 
surface,  the  equation  of  this  picture  could  be  written  (relative  to  appropriate  axes) 

,  .  27t:e 

(2.9)  y  =  A  sin—, 

where  x  represents  horizontal  distance  and  A  is  the  distance  between  wave  crests. 
Usually  A  is  called  the  wavelength ,  but  mathematically  it  is  the  same  as  the  period 
of  this  function  of  x.  Now  suppose  we  take  another  photograph  when  the  waves  have 
moved  forward  a  distance  vt  (v  is  the  velocity  of  the  waves  and  t  is  the  time  between 
photographs).  Figure  2.3  shows  the  two  photographs  superimposed.  Observe  that 
the  value  of  y  at  the  point  x  on  the  graph  labeled  t,  is  just  the  same  as  the  value  of  y 
at  the  point  (x  —  vt)  on  the  graph  labeled  t  =  0.  If  (2.9)  is  the  equation  representing 
the  waves  at  t  =  0,  then 


Figure  2.3 

27T 

(2.10)  y  =  A  sin  —  (x  —  vt) 

A 

represents  the  waves  at  time  t.  We  can  interpret  (2.10)  in  another  way.  Suppose 
you  stand  at  one  point  in  the  water  [fixed  x  in  (2.10)]  and  observe  the  up  and  down 
motion  of  the  water,  that  is,  y  in  (2.10)  as  a  function  of  t  (for  fixed  x).  This  is  a 
simple  harmonic  motion  of  amplitude  A  and  period  X/v.  You  are  doing  something 
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analogous  to  this  when  you  stand  still  and  listen  to  a  sound  (sound  waves  pass  your 
ear  and  you  observe  their  frequency)  or  when  you  listen  to  the  radio  (radio  waves 
pass  the  receiver  and  it  reacts  to  their  frequency) . 

We  see  that  y  in  (2.10)  is  a  periodic  function  either  of  x  ( t  fixed)  or  of  t  ( x  fixed); 
both  interpretations  are  useful.  It  makes  no  difference  in  the  basic  mathematics, 
however,  what  letter  we  use  for  the  independent  variable.  To  simplify  our  notation 
we  shall  ordinarily  use  x  as  the  variable,  but  if  the  physical  problem  calls  for  it,  you 
can  replace  x  by  t. 


Figure  2.4 

Sines  and  cosines  are  periodic  functions;  once  you  have  drawn  sin  a;  from  x  =  0 
to  x  =  2-7T,  the  rest  of  the  graph  from  x  =  —  oo  to  x  =  +oo  is  just  a  repetition  over 
and  over  of  the  0  to  27r  graph.  The  number  27t  is  the  period  of  sins.  A  periodic 
function  need  not  be  a  simple  sine  or  cosine,  but  may  be  any  sort  of  complicated 
graph  that  repeats  itself  (Figure  2.4).  The  interval  of  repetition  is  the  period. 

Example  2.  If  we  are  describing  the  vibration  of  a  seconds  pendulum,  the  period  is  2  sec 
(time  for  one  complete  back-and-forth  oscillation).  The  reciprocal  of  the  period  is 
the  frequency ,  the  number  of  oscillations  per  second;  for  the  seconds  pendulum,  the 
frequency  is  ^  sec-1.  When  radio  announcers  say,  “operating  on  a  frequency  of  780 
kilohertz,”  they  mean  that  780,000  radio  waves  reach  you  per  second,  or  that  the 
period  of  one  wave  is  (1/780,000)  sec. 

By  definition,  the  function  /(x)  is  periodic  if  f(x  +  p)  =  /(x)  for  every  x;  the 
number  p  is  the  period.  The  period  of  sinx  is  2n  since  sin(x  +  27r)  =  sinx;  similarly, 
the  period  of  sin  2nx  is  1  since  sin  27r(x  +  1)  =  sin(27rx  +  27r)  =  sin  2-7tx,  and  the 
period  of  sin(7rx/^)  is  21  since  sin(7r//)(x  +  21)  =  sin(7rx/i).  In  general,  the  period 
of  sin  27 ix /T  is  T. 

PROBLEMS,  SECTION  2 

In  Problems  1  to  6  find  the  amplitude,  period,  frequency,  and  velocity  amplitude  for  the 
motion  of  a  particle  whose  distance  s  from  the  origin  is  the  given  function. 


1. 

s  =  3  cos  5 1 

2. 

s  =  2  sin(4£  —  1) 

3. 

S  =  |  COs(7Tf  —  8) 

4. 

s  =  5  sin(£  —  7r ) 

5. 

s  =  2  sin  3£  cos  3 1 

6. 

s  =  3  sin(2£  +  7t/8)  +  3  sin(2£ 

In  Problems  7  to  10  you  are  given  a  complex  function  z  =  /(f).  In  each  case,  show  that 
a  particle  whose  coordinate  is  (a)  x  =  Rez,  (b)  y  =  Im  z  is  undergoing  simple  harmonic 
motion,  and  find  the  amplitude,  period,  frequency,  and  velocity  amplitude  of  the  motion. 

7.  z  =  5e“  8.  z  =  2e~it/2  9.  z  =  2eint  10.  z  =  -4ei(2t+37r) 
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11.  The  charge  q  on  a  capacitor  in  a  simple  a-c  circuit  varies  with  time  according  to  the 
equation  q  =  3sin(1207rf  +  7t/4).  Find  the  amplitude,  period,  and  frequency  of  this 
oscillation.  By  definition,  the  current  flowing  in  the  circuit  at  time  t  is  I  =  dq/dt. 
Show  that  I  is  also  a  sinusoidal  function  of  t,  and  find  its  amplitude,  period,  and 
frequency. 


12.  Repeat  Problem  11:  (a)  if  q  =  Re4e30i,rt;  (b)  if  q  =  Im4e30i,rt. 

13.  A  simple  pendulum  consists  of  a  point  mass  m  suspended  by  a 
(weightless)  cord  or  rod  of  length  Z,  as  shown,  and  swinging  in 
a  vertical  plane  under  the  action  of  gravity.  Show  that  for  small 
oscillations  (small  9),  both  9  and  x  are  sinusoidal  functions  of  time, 
that  is,  the  motion  is  simple  harmonic.  Hint:  Write  the  differential 
equation  F  =  raa  for  the  particle  m.  Use  the  approximation 
sin#  =  9  for  small  9,  and  show  that  9  =  AsinaA  is  a  solution  of 
your  equation.  What  are  A  and  u>? 


-///////////A 
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14.  The  displacements  x  of  two  simple  pendulums  (see  Problem  13)  are  4sin(7rf/3)  and 
3sin(7rt/4).  They  start  together  at  x  =  0.  How  long  will  it  be  before  they  are 
together  again  at  x  =  0?  Hint:  Sketch  or  computer  plot  the  graphs. 


15.  As  in  Problem  14,  the  displacements  x  of  two  simple  pendulums  are  x  =  —  2cos(f/2) 
and  3sin(Z/3).  They  are  not  together  at  t  =  0;  plot  graphs  to  see  when  they  are 
first  together. 

16.  As  in  Problem  14,  let  the  displacements  be  y\  =  3sin(t/\/2)  and  j/2  =  sinf.  The 
pendulums  start  together  at  t  =  0.  Make  computer  plots  to  estimate  when  they  will 
be  together  again  and  then,  by  computer,  solve  the  equation  yi  =  j/2  for  the  root 
near  your  estimate. 


17.  Show  that  equation  (2.10)  for  a  wave  can  be  written  in  all  these  forms: 


y  =  A  sin  —  ( x  —  vt )  =  A  sin  2-7T 
=  Asino;  -  tj  =  Asin  _  2tt f?j  m  Asin  ^  -  tj  . 


Here  A  is  the  wavelength,  /  is  the  frequency,  v  is  the  wave  velocity,  T  is  the  period, 
and  lo  =  2nf  is  called  the  angular  frequency.  Hint:  Show  that  v  =  A/. 


In  Problems  18  to  20,  find  the  amplitude,  period,  frequency,  wave  velocity,  and  wavelength 
of  the  given  wave.  By  computer,  plot  on  the  same  axes,  y  as  a  function  of  x  for  the  given 
values  of  t,  and  label  each  graph  with  its  value  of  t.  Similarly,  plot  on  the  same  axes,  y  as 
a  function  of  t  for  the  given  values  of  x,  and  label  each  curve  with  its  value  of  x. 

18.  y  =  2sin  ^n(x  —  3 f);  t  =  0,  |,  |,  §;  x  =  0,  1,  2,  3. 

19.  y  =  cos2?r(x  -  ±t);  t  =  0,  1,  2,  3;  x  =  Q,  §. 

20.  y  =  3sin7r(a;  —  |i);  t  =  0,  1,  2,  3;  x  =  0,  1,  |,  2. 

21.  Write  the  equation  for  a  sinusoidal  wave  of  wavelength  4,  amplitude  20,  and  veloc¬ 

ity  6.  (See  Problem  17.)  Make  computer  plots  of  y  as  a  function  of  t  for  x  =  0,  1, 
2,  3,  and  of  y  as  a  function  of  x  for  t  =  0,  i,  1.  If  this  wave  represents  the  shape 
of  a  long  rope  which  is  being  shaken  back  and  forth  at  one  end,  find  the  velocity 
dy/dt  of  particles  of  the  rope  as  a  function  of  x  and  t.  (Note  that  this  velocity  has 
nothing  to  do  with  the  wave  velocity  v,  which  is  the  rate  at  which  crests  of  the  wave 
move  forward.) 
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22.  Do  Problem  21  for  a  wave  of  amplitude  4,  period  6,  and  wavelength  3.  Make 

computer  plots  of  y  as  a  function  of  x  when  t  =  0,  1,  2,  3,  and  of  y  as  a  function  of 
t  when  x  =  1,  § ,  2. 

23.  Write  an  equation  for  a  sinusoidal  sound  wave  of  amplitude  1  and  frequency  440  hertz 
(1  hertz  means  1  cycle  per  second).  (Take  the  velocity  of  sound  to  be  350  m/sec.) 

24.  The  velocity  of  sound  in  sea  water  is  about  1530  m/sec.  Write  an  equation  for  a 
sinusoidal  sound  wave  in  the  ocean,  of  amplitude  1  and  frequency  1000  hertz. 

25.  Write  an  equation  for  a  sinusoidal  radio  wave  of  amplitude  10  and  frequency  600 
kilohertz.  Hint:  The  velocity  of  a  radio  wave  is  the  velocity  of  light,  c  =  3- 10s  m/sec. 

3.  APPLICATIONS  OF  FOURIER  SERIES 

We  have  said  that  the  vibration  of  a  tuning  fork  is  an  example  of  simple  harmonic 
motion.  When  we  hear  the  musical  note  produced,  we  say  that  a  sound  wave  has 
passed  through  the  air  from  the  tuning  fork  to  our  ears.  As  the  tuning  fork  vibrates 
it  pushes  against  the  air  molecules,  creating  alternately  regions  of  high  and  low 


Low  High 

pressure  pressure 


Figure  3.1 

pressure  (Figure  3.1).  If  we  measure  the  pressure  as  a  function  of  x  and  t  from  the 
tuning  fork  to  us,  we  find  that  the  pressure  is  of  the  form  of  (2.10);  if  we  measure 
the  pressure  where  we  are  as  a  function  of  t  as  the  wave  passes,  we  find  that  the 
pressure  is  a  periodic  function  of  t.  The  sound  wave  is  a  pure  sine  wave  of  a  definite 
frequency  (in  the  language  of  music,  a  pure  tone).  Now  suppose  that  several  pure 
tones  are  heard  simultaneously.  In  the  resultant  sound  wave,  the  pressure  will  not 
be  a  single  sine  function  but  a  sum  of  several  sine  functions.  If  you  strike  a  piano 
key  you  do  not  get  a  sound  wave  of  just  one  frequency.  Instead,  you  get  a  fundamen¬ 
tal  accompanied  by  a  number  of  overtones  (harmonics)  of  frequencies  2,  3,  4,  •  •  • , 
times  the  frequency  of  the  fundamental.  Higher  frequencies  mean  shorter  periods.  If 
sin  ut  and  cos  u>t  correspond  to  the  fundamental  frequency,  then  sin  nut  and  cos  nut 
correspond  to  the  higher  harmonics.  The  combination  of  the  fundamental  and  the 
harmonics  is  a  complicated  periodic  function  with  the  period  of  the  fundamental 
(Problem  5).  Given  the  complicated  function,  we  could  ask  how  to  write  it  as  a 
sum  of  terms  corresponding  to  the  various  harmonics.  In  general  it  might  require 
all  the  harmonics,  that  is,  an  infinite  series  of  terms.  This  is  called  a  Fourier  series. 
Expanding  a  function  in  a  Fourier  series  then  amounts  to  breaking  it  down  into 
its  various  harmonics.  In  fact,  this  process  is  sometimes  called  harmonic  analysis. 
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There  are  applications  to  other  fields  besides  sound.  Radio  waves,  visible  light, 
and  x  rays  are  all  examples  of  a  kind  of  wave  motion  in  which  the  “waves”  correspond 
to  varying  electric  and  magnetic  fields.  Exactly  the  same  mathematical  equations 
apply  as  for  water  waves  and  sound  waves.  We  could  then  ask  what  light  frequencies 
(these  correspond  to  the  color)  are  in  a  given  light  beam  and  in  what  proportions. 
To  find  the  answer,  we  would  expand  the  given  function  describing  the  wave  in  a 
Fourier  series. 

You  have  probably  seen  a  sine  curve  used  to  represent  an  alternating  current 
(a-c)  or  voltage  in  electricity.  This  is  a  periodic  function,  but  so  are  the  functions 
shown  in  Figure  3.2.  Any  of  these  and  many  others  might  represent  signals  (volt¬ 
ages  or  currents)  which  are  to  be  applied  to  an  electric  circuit.  Then  we  could  ask 


(a)  Square  wave 


(b)  Sawtooth 


(c)  Rectified  half-wave 


Figure  3.2 


what  a-c  frequencies  (harmonics)  make  up  a  given  signal  and  in  what  proportions. 
When  an  electric  signal  is  passed  through  a  network  (say  a  radio),  some  of  the 
harmonics  may  be  lost.  If  most  of  the  important  ones  get  through  with  their 
relative  intensities  preserved,  we  say  that  the  radio  possesses  “high  fidelity.”  To 
find  out  which  harmonics  are  the  important  ones  in  a  given  signal,  we  expand  it  in 
a  Fourier  series.  The  terms  of  the  series  with  large  coefficients  then  represent  the 
important  harmonics  (frequencies). 

Since  sines  and  cosines  are  themselves  periodic,  it  seems  rather  natural  to  use 
series  of  them,  rather  than  power  series,  to  represent  periodic  functions.  There  is 
another  important  reason.  The  coefficients  of  a  power  series  are  obtained,  you  will 
recall  (Chapter  1,  Section  12),  by  finding  successive  derivatives  of  the  function  being 
expanded;  consequently,  only  continuous  functions  with  derivatives  of  all  orders  can 
be  expanded  in  power  series.  Many  periodic  functions  in  practice  are  not  continuous 
or  not  differentiable  (Figure  3.2).  Fortunately,  Fourier  series  (unlike  power  series) 
can  represent  discontinuous  functions  or  functions  whose  graphs  have  corners.  On 
the  other  hand,  Fourier  series  do  not  usually  converge  as  rapidly  as  power  series 
and  much  more  care  is  needed  in  manipulating  them.  For  example,  a  power  series 
can  be  differentiated  term  by  term  (Chapter  1,  Section  11),  but  differentiating  a 
Fourier  series  term  by  term  sometimes  produces  a  series  which  doesn’t  converge. 
(See  end  of  Section  9.) 

Our  problem  then  is  to  expand  a  given  periodic  function  in  a  series  of  sines  and 
cosines.  We  shall  take  this  up  in  Section  5  after  doing  some  preliminary  work. 
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PROBLEMS,  SECTION  3 


For  each  of  the  following  combinations  of  a  fundamental  musical  tone  and  some  of  its 
overtones,  make  a  computer  plot  of  individual  harmonics  (all  on  the  same  axes)  and  then 
a  plot  of  the  sum.  Note  that  the  sum  has  the  period  of  the  fundamental  (Problem  5). 


1. 

sin  t  —  |  sin  3t 

2. 

2  cos  t  +  cos  2 1 

3. 

sin  7r t  +  sin  2nt  +  |  sin  3nt 

4. 

cos  2nt  +  cos  4-7rf  +  |  cos  67rt 

5.  Using  the  definition  (end  of  Section  2)  of  a  periodic  function,  show  that  a  sum  of 
terms  corresponding  to  a  fundamental  musical  tone  and  its  overtones  has  the  period 
of  the  fundamental. 

In  Problems  6  and  7,  use  a  trigonometry  formula  to  write  the  two  terms  as  a  single 
harmonic.  Find  the  period  and  amplitude.  Compare  computer  plots  of  your  result  and 
the  given  problem. 

6.  sin  2x  +  sin  2[x  +  7r/3)  7.  cos  ttx  —  cos  7r(x  —  1/2) 

8.  A  periodic  modulated  (AM)  radio  signal  has  the  form 

y  =  [A  +  B  sin  2nft)  sin  2nfc  (t  —  — ^  . 

The  factor  sin27r/c(t  —  x/v)  is  called  the  carrier  wave;  it  has  a  very  high  frequency 
(called  radio  frequency;  fc  is  of  the  order  of  106  cycles  per  second).  The  amplitude 
of  the  carrier  wave  is  {A  +  B  sin  2-7r/f).  This  amplitude  varies  with  time  -hence  the 
term  “amplitude  modulation” — with  the  much  smaller  frequency  of  the  sound  being 
transmitted  (called  audio  frequency;  /  is  of  the  order  of  102  cycles  per  second).  In 
order  to  see  the  general  appearance  of  such  a  wave,  use  the  following  simple  but 
unrealistic  data  to  sketch  a  graph  of  y  as  a  function  of  t  for  x  =  0  over  two  periods 
of  the  amplitude  function:  A  =  3,  B  =  1,  /  =  1,  fc  =  20.  Using  trigonometric 
formulas,  show  that  y  can  be  written  as  a  sum  of  three  waves  of  frequencies  /c, 
fc  +  /,  and  fc  —  /;  the  first  of  these  is  the  carrier  wave  and  the  other  two  are  called 
side  bands. 

4.  AVERAGE  VALUE  OF  A  FUNCTION 

The  concept  of  the  average  value  of  a  function  is  often  useful.  You  know  how  to  find 
the  average  of  a  set  of  numbers:  you  add  them  and  divide  by  the  number  of  numbers. 


Figure  4.1 
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This  process  suggests  that  we  ought  to  get  an  approximation  to  the  average  value 
of  a  function  /(x)  on  the  interval  (a,  b)  by  averaging  a  number  of  values  of  /(x) 
(Figure  4.1): 

(4.1)  Average  of  f(x)  on  (a,  b)  is  approximately  equal  to 

i)  +  /(x2)  H - F  f{xn) 

n 

This  should  become  a  better  approximation  as  n  increases.  Let  the  points  X\,  x2, 
be  Ax  apart.  Multiply  the  numerator  and  the  denominator  of  the  approximate 
average  by  Ax.  Then  (4.1)  becomes: 

(4.2)  Average  of  /(x)  on  (a,  b)  is  approximately  equal  to 

[/(x i)  H - F  /(xn)]Ax 

n  Ax 

Now  n  Ax  =  b  —  a,  the  length  of  the  interval  over  which  we  are  averaging,  no  matter 
what  n  and  Ax  are.  If  we  let  n  — >  oo  and  Ax  — >  0,  the  numerator  approaches 
/Qb  /(x)  dx,  and  we  have 


(4.3) 


Average  of  f(x)  on  (a,  b)  = 


fa  /O)  dx 

b  —  a 


In  applications,  it  may  happen  that  the  average  value  of  a  given  function  is  zero. 

Example  1.  The  average  of  sinx  over  any  number  of  periods  is  zero.  The  average  value  of 
the  velocity  of  a  simple  harmonic  oscillator  over  any  number  of  vibrations  is  zero. 
In  such  cases  the  average  of  the  square  of  the  function  may  be  of  interest. 

Example  2.  If  the  alternating  electric  current  flowing  through  a  wire  is  described  by  a 
sine  function,  the  square  root  of  the  average  of  the  sine  squared  is  known  as  the 
root-mean-square  or  effective  value  of  the  current,  and  is  what  you  would  measure 
with  an  a-c  ammeter.  In  the  example  of  the  simple  harmonic  oscillator,  the  average 
kinetic  energy  (average  of  ^mv2)  is  times  the  average  of  v2. 


Now  you  can,  of  course,  find  the  average  value  of  sin2  x  over  a  period  (say  —ir 
to  7 r)  by  evaluating  the  integral  in  (4.3).  There  is  an  easier  way.  Look  at  the  graphs 
of  cos2  x  and  sin2  x  (Figure  4.2).  You  can  probably  convince  yourself  that  the  area 
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under  them  is  the  same  for  any  quarter-period  from  0  to  7r/2,  7t/2  to  7r,  etc.  (Also 
see  Problems  2  and  13.)  Then 


Similarly  (for  integral  n/  0), 


But  since  sin2  nx  +  cos2  nx  =  1, 


(4.6) 

Using  (4.5),  we  get 

(4.7) 


/7T  /»7T 

(sin2  nx  +  cos2  ns)  dx=  dx  =  27t. 

-7T  J  —TV 


sin"  nxdx  =  /  cos2  nx  dx  =  n. 


L 


Then  using  (4.3)  we  see  that: 


(4.8) 


The  average  value  (over  a  period)  of  sin2  nx 
=  the  average  value  (over  a  period)  of  cos2  nx 


h 


sin2  nx  dx  = 


27 r 


cos2  nx  dx  = 


2n 


1 

2' 


We  can  say  all  this  more  simply  in  words.  By  (4.5),  the  average  value  of  sin2  nx 
equals  the  average  value  of  cos2  nx.  The  average  value  of  sin2  nx  +  cos2  nx  =  1  is  1. 
Therefore  the  average  value  of  sin2  nx  or  of  cos2  nx  is  ^ .  (In  each  case  the  average 
value  is  taken  over  one  or  more  periods.) 

PROBLEMS,  SECTION  4 

1.  Show  that  if  f(x)  has  period  p,  the  average  value  of  /  is  the  same  over  any  interval 
of  length  p.  Hint:  Write  Ja+P  /(x)  dx  as  the  sum  of  two  integrals  (a  to  p,  and  p  to 
a  +  p)  and  make  the  change  of  variable  x  =  t  +  p  in  the  second  integral. 

2.  (a)  Prove  that  f^2  sin2  x  dx  =  f^2  cos2  x  dx  by  making  the  change  of  variable 

x  =  7r  —  t  in  one  of  the  integrals. 

(b)  Use  the  same  method  to  prove  that  the  averages  of  sin2  (mrx/l)  and  cos2  (mrx/l) 
are  the  same  over  a  period. 

In  Problems  3  to  12,  find  the  average  value  of  the  function  on  the  given  interval.  Use 
equation  (4.8)  if  it  applies.  If  an  average  value  is  zero,  you  may  be  able  to  decide  this  from 
a  quick  sketch  which  shows  you  that  the  areas  above  and  below  the  x  axis  are  the  same. 


3.  sinx  +  2sin2x  +  3sin3x  on  (0,  27r)  4.  1  — e  x  on  (0, 1) 
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K  2  X 

5.  cos  —  on 


K) 


7.  x  —  cos  6x  on 


(*i) 


9.  sin2  3a;  on  (0,  47t) 


11.  sin  x  +  sin2  x  on  (0,  2n) 


6. 

sin  x  on  (0, 7r) 

8. 

.  /  7T  ’ 

sm  2x  on  — ,  - 

V6 

10. 

cos  x  on  (0,  37t) 

12. 

2  7tvx  ( 

cos  — —  on  ( 

2  V 

13.  Using  (4.3)  and  equations  similar  to  (4.5)  to  (4.7),  show  that 


fb  .  2.  ,  fb 

/  sin  kxdx  =  / 

J  a  J  a 


sin  kxdx  =  J  cos  kxdx  =  —  (b  —  a) 


if  k(b  —  a)  is  an  integral  multiple  of  n,  or  if  kb  and  ka  are  both  integral 
multiples  of  7r/2. 


Use  the  results  of  Problem  13  to  evaluate  the  following  integrals  without  calculation. 


14. 

(a) 

(b) 

f^/2  2  (x 

L, 2 cos  U 

15. 

(a) 

f11'4  ^  . 

/  cos  nx  ax 

J  - 1/4 

(b) 

/x?) 

16. 

(a) 

f27r/u  2 
/  sin  ut  dt 

Jo 

(b) 

f  cos2  2nt  dt 
Jo 

5.  FOURIER  COEFFICIENTS 

We  want  to  expand  a  given  periodic  function  in  a  series  of  sines  and  cosines.  To 
simplify  our  formulas  at  first,  we  start  with  functions  of  period  27r;  that  is,  we  shall 
expand  periodic  functions  of  period  2tt  in  terms  of  the  functions  sin  nx  and  cos  nx. 
(Later  we  shall  see  how  we  can  change  the  formulas  to  fit  a  different  period — see 
Section  8.)  The  functions  sinx  and  cosx  have  period  27r;  so  do  sinnx  and  cos  nx 
for  any  integral  n  since  sinn(x  +  2-7t)  =  sin(nx  +  2mr)  =  sinnx.  (It  is  true  that 
sinnx  and  cos  nx  also  have  shorter  periods,  namely  27r/n,  but  the  fact  that  they 
repeat  every  2-7t  is  what  we  are  interested  in  here,  for  this  makes  them  reasonable 
functions  to  use  in  an  expansion  of  a  function  of  period  2n.)  Then,  given  a  function 
f(x)  of  period  27r,  we  write 


(5.1) 


/(x )  =  —ao  +  ai  cos  x  T  Gt2  cos  2x  +  a 3  cos  3x  +  •  •  • 
+  61  sin  x  +  62  sin  2x  +  63  sin  3x  +  •  •  •  , 


and  derive  formulas  for  the  coefficients  an  and  bn.  (The  reason  for  writing  ^ ao  as 
the  constant  term  will  be  clear  later — it  makes  the  formulas  for  the  coefficients 
simpler  to  remember — but  you  must  not  forget  the  ^  in  the  series!) 

In  finding  formulas  for  an  and  bn  in  (5.1)  we  need  the  following  integrals: 
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(5.2) 


The  average  value  of  sin  mx  cos  nx  (over  a  period) 

i  r 

=  —  /  sin  mx  cos  nxdx  =  0. 

2tt 

The  average  value  of  sin  mx  sin  nx  (over  a  period) 
_  [  0,  m  +  n, 

l[ 

=  —  /  sin  mx  sin  nxdx  =  <  i  m  =  n  ^  0, 

2?r  ■'-*  n  n 

^0,  m  =  n  =  0. 

The  average  value  of  cos  mx  cos  nx  (over  a  period) 

_  [  0,  to  ^  n, 

=  —  /  cos  mx  cos  nxdx  =  <  m  =  n  ^  0, 

i  n 

II,  m  =  n  =  0. 


We  have  already  shown  that  the  average  values  of  sin2  nx  and  cos2  nx  are  ^ .  The 
last  integral  in  (5.2)  is  the  average  value  of  1  which  is  1.  To  show  that  the  other 
average  values  in  (5.2)  are  zero  (unless  m  =  n  ^  0),  we  could  use  the  trigonometry 
formulas  for  products  like  sin  6  cos  </>  and  then  integrate.  An  easier  way  is  to  use  the 
formulas  for  the  sines  and  cosines  in  terms  of  complex  exponentials.  [See  (7.1)  or 
Chapter  2,  Section  11.]  We  shall  show  this  method  for  one  integral 


(5.3) 


/»7r  g imx  g—imx  ^ inx  _|_  ^—inx 

sin  mx  cos  nx  dx  =  /  - • - dx. 

2  i  2 


We  can  see  the  result  without  actually  multiplying  these  out.  All  terms  in  the 
product  are  of  the  form  elkx,  where  k  is  an  integer  ^  0  (except  for  the  cross- 
product  terms  when  n  =  m,  and  these  cancel).  We  can  show  that  the  integral  of 
each  such  term  is  zero: 


(5.4) 


/7T  ikx 

eikx  dx  =  — 
ik 


gik-K  _  ^—ik'K 


ik 


=  0 


because  e*fc7r  =  e~lkn  =  coskn  (since  sinfc7r  =  0).  The  other  integrals  in  (5.2)  may 
be  evaluated  similarly  (Problem  12). 

We  now  show  how  to  find  an  and  bn  in  (5.1).  To  find  ao,  we  find  the  average 
value  on  (— 7r,7r)  of  each  term  of  (5.1). 

rmdx^  r  dx+ai±-  r  cos  xdx 

,  ,  2^  J 2  2-7T  J_v  2 7T  J_„ 

2  z*71-  2  r7* 

+  02  —  /  cos  2x  dx  +  •  •  •  +  b\  —  /  sin  xdx  +  ■■  ■  . 
271"  J-n  2tt  J 

By  (5.2),  all  the  integrals  on  the  right-hand  side  of  (5.5)  are  zero  except  the  first, 
because  they  are  integrals  of  sin  mx  cos  nx  or  of  cos  mx  cos  nx  with  n  =  0  and  m/0 
(that  is,  m  ^  n).  Then  we  have 

2 'n!j{x)dx=^hL,ix  =  ai' 


(5.6) 


a  o  — 


f{x)  dx. 
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Given  /(x)  to  be  expanded  in  a  Fourier  series,  we  can  now  evaluate  ao  by  calculating 
the  integral  in  (5.6). 

To  find  a i,  multiply  both  sides  of  (5.1)  by  cos  a;  and  again  find  the  average  value 
of  each  term: 


1  rfn  ,  a0 1  r  i  r  2 

—  /  /(x)cosxdx=  — - /  cosxdx  +  ai  —  /  cos  xdx 

j- 7T  2  27r  J- 7T  27r  d-TT 


(5.7) 


+  a2  —  /  cos  2x  cos  a;  dx  + 
27r  d-TT 

1  t77 

+  &i  —  /  sin  a;  cos  x  dx  +  •  • 
27T  . 


This  time,  by  (5.2),  all  terms  on  the  right  are  zero  except  the  a\  term  and  we  have 
1 


„  ,  ,  cosxdx  =  oi—  f  cos2  x  dx  =  —a\. 

'  2 7T ./ 2 


Solving  for  ai,  we  have 


ai 


/(x)  cos  x  dx. 


The  method  should  be  clear  by  now,  so  we  shall  next  find  a  general  formula  for  an. 
Multiply  both  sides  of  (5.1)  by  cos  nx  and  find  the  average  value  of  each  term: 


—  [  f(x) cos nxdx  =  — —  [  cosnxdx  +  ai—  f  cos x  cos nxdx 
27r  J- 7T  2  27r  J- 7T  2?r  a  —  7T 


(5.8) 


r  —  7T 

1  f77 

a2  —  /  cos  2x  cos  nx  dx 
27T 


+  6i  —  /  sin  x  cos  nx  dx  +  •  •  •  . 
27T 


By  (5.2),  all  terms  on  the  right  are  zero  except  the  an  term  and  we  have 

1  fn  If™  1 

—  /  fix)  cos  nx  dx  =  an —  /  cos2  nx  dx  =  —an. 


27T 

Solving  for  an,  we  have 


Notice  that  this  includes  the  n  =  0  formula,  but  only  because  we  called  the  constant 
term  ^(Iq. 

To  obtain  a  formula  for  bn,  we  multiply  both  sides  of  (5.1)  by  sinnx  and  take 
average  values  just  as  we  did  in  deriving  (5.9).  We  find  (Problem  13) 


1  f™ 

(5.10)  bn  —  —  f{x)  sin  nxdx. 

*  J  —  7T 


The  formulas  (5.9)  and  (5.10)  will  be  used  repeatedly  in  problems  and  should  be 
memorized. 
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Example  1.  Expand  in  a  Fourier  series  the  function  /(s)  sketched  in  Figure  5.1.  This 
function  might  represent,  for  example,  a  periodic  voltage  pulse.  The  terms  of  our 
Fourier  series  would  then  correspond  to  the  different  a-c  frequencies  which  are  com¬ 
bined  in  this  “square  wave”  voltage,  and  the  magnitude  of  the  Fourier  coefficients 
would  indicate  the  relative  importance  of  the  various  frequencies. 

f(*> 


— 2ir  —  it  [0  7r  27 r  37r 

Figure  5.1 

Note  that  /(s)  is  a  function  of  period  2ir.  Often  in  problems  you  will  be  given 
/(s)  for  only  one  period;  you  should  always  sketch  several  periods  so  that  you  see 
clearly  the  periodic  function  you  are  expanding.  For  example,  in  this  problem, 
instead  of  a  sketch,  you  might  have  been  given 


(5.11) 


/(a 0  = 


0,  —7 T  <  X  <  0, 

1,  0  <  X  <  IT. 


It  is  then  understood  that  f(x)  is  to  be  continued  periodically  with  period  27r  outside 
the  interval  (— 7r,7r). 

We  use  equations  (5.9)  and  (5.10)  to  find  an  and  bn : 


\  r  i  r  r°  r 

—  /  f(x)  cos  nx  dx  =  —  /  0  •  cos  nxdx+  /  1  •  cos  nxdx 

*  J — 7r  tt  U-k  Jo 


1 


1  1  . 

—  •  —  sin  nx 


=  —  cos  nx  dx  = 


—  )  n  n 


=  0  for  n  7^  0, 


7T  —  1 


for  n=  0. 

Thus  ao  =  1,  and  all  other  an  =  0. 

l  rn  i  r  r°  r 

bn  =  —  f(x)  sin  nx  dx  =  —  /  0  •  sin  nx  dx  +  /  1-  sin  nxdx 

^  J-TT  K  J-TT  JO 


sin  nx  dx  = 


>0 


—  cos  nx 


'  —TV 

1  ft 


=  K-ir-u 


0  for  even  n, 
—  for  odd  n. 


Putting  these  values  for  the  coefficients  into  (5.1),  we  have 


...  1  2  /  sins  sin  3s  sin  5s 

/(a 0  =  o  +  T 


(5.12) 


1 


3 


5 
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Example  2.  We  can  now  find  the  Fourier  series  for  some  other  functions  without  more 
evaluation  of  coefficients.  For  example,  consider 


(5.13) 


g{x)  = 


—  1,  —7 r  <  x  <  0, 

1,  0  <  X  <  7T. 


Sketch  this  and  verify  that  g(x)  =  2f(x)  —  l,  where  f(x)  is  the  function  in  Example  1. 
Then  from  (5.12),  the  Fourier  series  for  g(x)  is 


(5.14) 


.  .  4  /  sin  x  sin  3x  sin  5x 

9W  =  ^(— +  — +  — 


Similarly,  verify  that  h{x)  =  f(x  +  tt/2)  is  Fig.  5.1  shifted  n/2  to  the  left  (sketch 
it),  and  its  Fourier  series  is  (replace  x  in  (5.12)  by  x  +  7t/2) 

,  ,  .  1  2  /  cos  x  cos  3x  cos  5x 

M*)  =  o  +  - 


2  7T  V  1  3  5 

since  sin(x  +  7t/2)  =  cos  a;,  sin(a;  +  37r/2)  =  —  cos  3a;,  etc. 

PROBLEMS,  SECTION  5 

In  each  of  the  following  problems  you  are  given  a  function  on  the  interval  — 7r  <  x  <  tt. 
Sketch  several  periods  of  the  corresponding  periodic  function  of  period  2n.  Expand  the 
periodic  function  in  a  sine-cosine  Fourier  series. 


!•  /(*)  = 


1,  — 7T  <  X  <  0, 

0,  0  <  X  <  7T. 

In  this  case  the  sketch  is: 


y 

i 

i . i  r . 

i  i  i 

i_ i_ i 

1 

1  t 

1  1 

i 

i 

0 

Tt 

2 TT 
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.  ,  .  .  ...  1  2  /sin*  sin  3®  sin  5a; 

Your  answer  tor  the  senes  is:  j[x)  =  — - — - - 1 - - - 1 - - — 

2  tv  \  1  3  5 

Can  you  use  the  ideas  of  Example  2  to  find  this  result  without  computation? 
f{x)  =  < 


0,  — 7T  <  X  <  0, 

1,  0  <  x  < 

n  7r 

0,  —  <  X  <  7T. 


,  .  1  If  cos  a; 

Answer-.  j(x)  =  —  H —  \  - 

4  7T  \  1 

1  ^  sini 

7T 


cos  3a;  cos  5a; 


2  sin  2a:  sin  3a;  sin  5x 

+  — x - 1 - ^ - 1 - — 


3.  f(x)  = 


7 r 

-7T  <  X  <  -, 
7 r 

—  <  X  <  7T. 


Answer :  f(x)  =  — - 

4  7T 


+ 


cos  a; 

1 

sin  x 


cos  3a;  cos  5a; 

2  sin  2a: 


sin  3a;  sin  5a; 

H - - - t - 


2  sin  6a; 


5 


6 
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4.  f{x)  = 


—  1,  — 7T  <  X  < 


<  x  <  n. 


Could  you  use  Problem  3  to  solve  Problem  4  without  computation? 

0,  — 7T  <  x  <  0, 

5.  f(x)  =  -1.  0  <  x  < 

1,  -  <x  <n. 


6.  /(x)  = 


7.  /(x)  = 


'  7T  .  7T 

1,  —  7r  <  x  <  —  — ,  and  0  <  x  <  — ; 

7T  VF 

0,  —  —  <  x  <  0,  and  —  <  x  <  n. 

<  A  A 


0,  — 7r  <  x  <  0; 

X,  0  <  X  <  7T. 


7T 


Answer :  /(x)  = - —  I  cosx  + 


cos  3x  cos  5x 


32 


+ 


52 


+  ■ 


,  .  sin  2x  sin  3x 

+  smi - - - 1 - - — 


8.  f(x)  =  1  +  X,  — 7T  <  X  <  7T. 

Answer:  f(x)  =  1  +  2  f  sin  x  —  —  sin  2x  +  —  sin  3x  —  —  sin  4x  + 

\  A  o  4 


9.  f(x)  = 


—  X,  —TV  <  X  <  0, 
x,  0  <  x  <  7 r. 


Answer:  f{x)  =  — - (  cos  x  +  —  cos  3x  +  —  cos  5x  + 


9 


25 


10.  f(x)  = 


11-  f{x)  = 


7r  +  x,  —n  <  x  <  0, 
7T  —  X,  0  <  X  <  7T. 

0,  — 7T  <  X  <  0, 

sinx,  0  <  x  <  7r. 


.  ,,,11.  2 

Answer:  j(x)  = - b  —  smx - 

7T  2  7T 


cos  2x 


+ 


cos  4x  cos  6x 

+  - 7  +  ' 


22  —  1  42  -  1  62  -  1 


12.  Show  that  in  (5.2)  the  average  values  of  sinmxsinnx  and  of  cosmxcosnx,  m  n, 
are  zero  (over  a  period),  by  using  the  complex  exponential  forms  for  the  sines  and 
cosines  as  in  (5.3). 


13.  Write  out  the  details  of  the  derivation  of  equation  (5.10). 


6.  DIRICHLET  CONDITIONS 

Now  we  have  a  series,  but  there  are  still  some  questions  that  we  ought  to  get 
answered.  Does  it  converge,  and  if  so,  does  it  converge  to  the  values  of  /(x)?  You 
will  find,  if  you  try,  that  for  most  values  of  x  the  series  in  (5.12)  does  not  respond 
to  any  of  the  tests  for  convergence  that  we  discussed  in  Chapter  1.  What  is  the 
sum  of  the  series  at  x  =  0  where  f(x)  jumps  from  0  to  1?  You  can  see  from  the 
series  (5.12)  that  the  sum  at  x  =  0  is  i,  but  what  does  this  have  to  do  with  f(x)7 
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These  questions  would  not  be  easy  for  us  to  answer  for  ourselves,  but  they  are 
answered  for  us  for  most  practical  purposes  by  the  theorem  of  Dirichlet: 


If  f{x)  is  periodic  of  period  2n,  and  if  between  —  it  and  n  it  is  single- valued, 
has  a  finite  number  of  maximum  and  minimum  values,  and  a  finite  number  of 
discontinuities,  and  if  f_n\f{x)\dx  is  finite,  then  the  Fourier  series  (5.1)  [with 
coefficients  given  by  (5.9)  and  (5.10)]  converges  to  f(x)  at  all  the  points  where 
f(x)  is  continuous;  at  jumps  the  Fourier  series  converges  to  the  midpoint  of  the 
jump.  (This  includes  jumps  that  occur  at  ±7r  for  the  periodic  function.) 


To  see  what  all  this  means,  we  shall  consider  some  special  functions.  We  have 
already  discussed  what  a  periodic  function  means.  A  function  fix)  is  single-valued 
if  there  is  just  one  value  of  fix)  for  each  x.  For  example,  if  x2  +  y2  =  1,  y 
is  not  a  single- valued  function  of  x,  unless  we  select  just  y  =  +\/l  —  x2  or  just 
V  =  -a/1  —  x2 .  An  example  of  a  function  with  an  infinite  number  of  maxima  and 
minima  is  sin(l/x),  which  oscillates  infinitely  many  times  as  x  — >  0.  If  we  imagine 
a  function  constructed  from  sin(l/x)  by  making  fix)  =  1  for  every  x  for  which 
sin(l/a:)  >  0,  and  fix)  =  —1  for  every  x  for  which  sin(l/x)  <  0,  this  function 
would  have  an  infinite  number  of  discontinuities.  Now  most  functions  in  applied 
work  do  not  behave  like  these,  but  will  satisfy  the  Dirichlet  conditions. 

Finally,  if  y  =  1  /x,  we  find 


1 

r  i  7 

dx  =  2 

/  —  dx  =  2  In  x 

X 

Jo  x 

=  oo, 


so  the  function  1/x  is  ruled  out  by  the  Dirichlet  conditions.  On  the  other  hand,  if 
fix)  =  1/VH  then 


2 


rn  dx 

Jo  Vx 


=  4%/tt, 


so  the  periodic  function  which  is  1  / \/\x\  between  —  n  and  n  can  be  expanded  in  a 
Fourier  series.  In  most  problems  it  is  not  necessary  to  find  the  value  of  [___  |/(a;)|  dx ; 
let  us  see  why.  If  fix)  is  bounded  (that  is,  all  its  values  lie  between  ±M  for  some 
positive  constant  M),  then 

/7T  /*  7T 

\fix)\dx  <  /  M dx  =  M  ■  2tt 

-7T  J  —TV 


Figure  6.1 
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and  so  is  finite.  Thus  you  can  simply  verify  that  the  function  you  are  considering  is 
bounded  (if  it  is)  instead  of  evaluating  the  integral.  Figure  6.1  is  an  (exaggerated!) 
example  of  a  function  which  satisfies  the  Dirichlet  conditions  on  (— 7r,7r). 

We  see,  then,  that  rather  than  testing  Fourier  series  for  convergence  as  we  did 
power  series,  we  instead  check  the  given  function;  if  it  satisfies  the  Dirichlet  con¬ 
ditions  we  are  then  sure  that  the  Fourier  series,  when  we  get  it,  will  converge  to 
the  function  at  points  of  continuity  and  to  the  midpoint  of  a  jump.  For  exam¬ 
ple,  consider  the  function  /( x)  in  Figure  5.1.  Between  — 7r  and  n  the  given  f{x) 
is  single- valued  (one  value  for  each  x),  bounded  (between  +1  and  0),  has  a  finite 
number  of  maximum  and  minimum  values  (one  of  each),  and  a  finite  number  of 
discontinuities  (at  —7 r,  0,  and  7r),  and  therefore  satisfies  the  Dirichlet  conditions. 
Dirichlet’s  theorem  then  assures  us  that  the  series  (5.12)  actually  converges  to  the 
function  /( x)  in  Figure  5.1  at  all  points  except  x  =  mr  where  it  converges  to  1/2. 

In  Chapter  3,  Sections  10  and  14,  we  defined  a  basis  for  ordinary  3-dimensional 
space  as  a  set  of  linearly  independent  vectors  (like  i,  j  ,  k)  in  terms  of  which  we  could 
write  every  vector  in  the  space.  We  then  extended  this  idea  to  an  n-dimensional 
space  and  to  a  space  in  which  the  basis  vectors  were  functions.  By  analogy,  we  say 
here  that  the  functions  sin  nx ,  cos  nx  are  a  set  of  basis  functions  for  the  (infinite  di¬ 
mensional)  space  of  all  functions  (satisfying  Dirichlet  conditions)  defined  on  (—7 r,  n) 
or  any  27t  interval.  (Also  see  “completeness  relation”  in  Section  11.  And  for  more 
examples  of  such  sets  of  basis  functions,  see  Chapters  12  and  13.) 


Figure  6.2 

It  is  interesting  to  see  a  graph  of  the  sum  of  a  large  number  of  terms  of  a  Fourier 
series.  Figure  6.2  shows  several  different  partial  sums  of  the  series  in  (5.12)  for  the 
function  in  Figure  5.1.  We  can  see  that  the  sum  of  many  terms  of  the  series  closely 
approximates  the  function  away  from  the  jumps  and  goes  through  the  midpoint 
of  the  jump.  The  “overshoot”  on  either  side  of  a  jump  bears  comment.  It  does 
not  disappear  as  we  add  more  and  more  terms  of  the  series.  It  simply  becomes  a 
narrower  and  narrower  spike  of  height  equal  to  about  9%  of  the  jump.  This  fact  is 
called  the  Gibbs  phenomenon. 
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We  ought  to  say  here  that  the  converse  of  Dirichlet’s  theorem  is  not  true — if 
a  function  fails  to  satisfy  the  Dirichlet  conditions,  it  still  may  be  expandable  in  a 
Fourier  series.  The  periodic  function  which  is  sin(l/a;)  on  (— 7r,7r)  is  an  example  of 
such  a  function.  However,  such  functions  are  rarely  met  with  in  practice. 


Example.  Fourier  series  can  be  useful  in  summing  numerical  series.  Look  at  Problem  5.2 
(sketch  it).  From  Dirichlet’s  theorem,  we  see  that  the  Fourier  series  converges  to 
1/2  at  x  =  0.  Let  x  =  0  in  the  Fourier  series  to  get 


since  sinO  =  0  and  cosO  =  1.  Thus 


PROBLEMS,  SECTION  6 

1  to  11.  For  each  of  the  periodic  functions  in  Problems  5.1  to  5.11,  use  Dirichlet’s  theorem 

to  find  the  value  to  which  the  Fourier  series  converges  at  x  =  0,  ±7t/2,  ±7t,  ±27t. 

12.  Use  a  computer  to  produce  graphs  like  Fig.  6.2  showing  Fourier  series  approximations 
to  the  functions  in  Problems  5.1  to  5.3,  and  5.7  to  5.11.  You  might  like  to  set 
up  a  computer  animation  showing  the  Gibbs  phenomenon  as  the  number  of  terms 
increases. 

13.  Repeat  the  example  using  the  same  Fourier  series  but  at  x  =  7r/2. 

14.  Use  Problem  5.7  to  show  that  X)odd  n  V7*2  =  7t2/8.  Try  x  =  0,  and  x  =  n.  What 

do  you  find  at  x  =  7t/2? 

15.  Use  Problem  5.11  to  show  that  1  +  1  +  1  H - =  i. 

2^  —  1  4^  —  1  6^  —  1  2 


7.  COMPLEX  FORM  OF  FOURIER  SERIES 


Recall  that  real  sines  and  cosines  can  be  expressed  in  terms  of  complex  exponentials 
by  the  formulas  [Chapter  2,  (11.3)] 


(7.1) 


sin  nx  = 


2  i 


cos  nx  = 


If  we  substitute  equations  (7.1)  into  a  Fourier  series  like  (5.12),  we  get  a  series  of 
terms  of  the  forms  emx  and  e~mx .  This  is  the  complex  form  of  a  Fourier  series.  We 
can  also  find  the  complex  form  directly;  this  is  often  easier  than  finding  the  sine- 
cosine  form.  We  can  then,  if  we  like,  work  back  the  other  way  and  [using  Euler’s 
formula,  Chapter  2,  (9.3)]  get  the  sine-cosine  form  from  the  exponential  form. 

We  want  to  see  how  to  find  the  coefficients  in  the  complex  form  directly.  We 
assume  a  series 


(7.2)  f{x)  =  c0  +  c\eix  +  c_ie-“  +  c2e2“  +  c_2e-2“  +  •  •  • 

n=+oo 

=  c«ei"x 
n=—  oo 
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and  try  to  find  the  c„’s.  From  (5.4)  we  know  that  the  average  value  of  elkx  on  (— tt,  it) 
is  zero  when  k  is  an  integer  not  equal  to  zero.  To  find  cq,  we  find  the  average  values 
of  the  terms  in  (7.2): 


(7.3) 


1 

27 r 


f(x)  dx 


average  values  of  terms  of  the 
form  elkx  with  k  an  integer  ^  0 


—  Cq  +  0, 


(7.4) 


i  r 

c0  =  —  y  f(x )  dx. 


To  find  cn,  we  multiply  (7.2)  by  e~mx  and  again  find  the  average  value  of  each 
term.  Note  the  minus  sign  in  the  exponent.  In  finding  an,  the  coefficient  of  cosnx 
in  equation  (5.1),  we  multiplied  by  cos  nx;  but  here  in  finding  the  coefficient  cn  of 
emx,  we  multiply  by  the  complex  conjugate  e~lnx . 


(7.5) 


27T 


f(x)e  inx  dx  =  cq 


27T 


:  dx  +  Ci 


—mx^ix 


27T 


elxdx 


■ c_i 

2tt 


—mx  „—ix 


dx  - 


The  terms  on  the  right  are  the  average  values  of  exponentials  elkx ,  where  the  k 
values  are  integers.  Therefore  all  these  terms  are  zero  except  the  one  where  k  =  0; 
this  is  the  term  containing  cn.  We  then  have 


i  r 

(7.6) 

C„  =  —  y  f{x)e~mx  dx. 

Note  that  this  formula  contains  the  one  for  cq  (no  \  to  worry  about  here!).  Also, 
since  (7.6)  is  valid  for  negative  as  well  as  positive  n,  you  have  only  one  formula  to 
memorize  here!  You  can  easily  show  that  for  real  f(x),  c_„  =  cn  (Problem  12). 


Example.  Let  us  expand  the  same  f(x)  we  did  before,  namely  (5.11).  We  have  from  (7.6) 


(7.7) 


Cn 


Co 


J-  [  e-’-nx  .  o  .  dx  +  J_  f  e-,.nx  ,x,dx 

2?r  ./_,  27T  JQ 


^  g—inx  71 

27 r  — in  0 


—2'Kin 


-( e~inn  -  1) 


n  odd, 
n  even  ^  0, 


1 

27T 
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Then 

oo 

(7.8)  /(z)=5>„e^=-  +  - 

z '  2  77T 


It  is  interesting  to  verify  that  this  is  the  same  as  the  sine-cosine  series  we  had  before. 
We  could  use  Euler’s  formula  for  each  exponential,  but  it  is  easier  to  collect  terms 
like  this: 


(7.9) 


1  2 

/(*)=o  +  - 
Z  7 r 


e™  —  e 
2i~ 


^  g3 ix  g— 3 ix 

3 


2  i 


sin  cc  - 


■  sin  3x  - 


which  is  the  same  as  (5.12). 


PROBLEMS,  SECTION  7 

1  to  11.  Expand  the  same  functions  as  in  Problems  5.1  to  5.11  in  Fourier  series  of 
complex  exponentials  elnx  on  the  interval  (— 7r,  7r)  and  verify  in  each  case  that  the  answer 
is  equivalent  to  the  one  found  in  Section  5. 

12.  Show  that  if  a  real  f(x)  is  expanded  in  a  complex  exponential  Fourier  series 

CnSlTLX ,  then  c-n  =  c„,  where  c„  means  the  complex  conjugate  of  c„. 

13.  If  f(x)  =  \ao  +  an  cos nx  +  bn  sinn*  =  cneznx ,  use  Euler’s  formula 

to  find  an  and  bn  in  terms  of  cn  and  C-„,  and  to  find  cn  and  c~„  in  terms  of  an 
and  bn- 


8.  OTHER  INTERVALS 


The  functions  sinner  and  cos  nx  and  emx  have  period  2n.  We  have  been  considering 
(— 7r,7r)  as  the  basic  interval  of  length  27r.  Given  f(x)  on  ( — 7r,  7r) ,  we  have  first 
sketched  it  for  this  interval,  and  then  repeated  our  sketch  for  the  intervals  (7 r,  37r), 
(37t,57t),  (— 37t,  — 7r),  etc.  There  are  (infinitely)  many  other  intervals  of  length  27T, 
any  one  of  which  could  serve  as  the  basic  interval.  If  we  are  given  f(x)  on  any 
interval  of  length  27t,  we  can  sketch  f(x )  for  that  given  basic  interval  and  then 
repeat  it  periodically  with  period  2-7t.  We  then  want  to  expand  the  periodic  function 
so  obtained,  in  a  Fourier  series.  Recall  that  in  evaluating  the  Fourier  coefficients, 
we  used  average  values  over  a  period.  The  formulas  for  the  coefficients  are  then 
unchanged  (except  for  the  limits  of  integration)  if  we  use  other  basic  intervals  of 
length  27t.  In  practice,  the  intervals  (— n,  7 r)  and  (0,  27r)  are  the  ones  most  frequently 
used.  For  f(x)  defined  on  (0,27t)  and  then  repeated  periodically,  (5.9),  (5.10),  and 
(7.6)  would  read 


r2  n 


r2-x 


Q"n.  — 


f(x)  cos  nxdx , 


bn.  — 


f(x)  sin  nxdx, 


(8.1) 


C-n  — 


27 r 


f{x)e~inxdx , 
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and  (5.1)  and  (7.2)  are  unchanged. 

f(x) 


X 


Figure  8.1 

Notice  how  important  it  is  to  sketch  a  graph  to  see  clearly  what  function  you 
are  talking  about.  For  example,  given  /(x)  =  x 2  on  (— n,  n),  the  extended  function 
of  period  27t  is  shown  in  Figure  8.1.  But  given  /(x)  =  x2  on  (0,27t),  the  extended 
periodic  function  is  different  (see  Figure  8.2).  On  the  other  hand,  given  /(x)  as  in 
our  example  (5.11),  or  given  f(x)  =  1  on  (0, 7r),  f(x)  =  0  on  (-7T,  2i r),  you  can  easily 
verify  by  sketching  that  the  graphs  of  the  extended  functions  are  identical.  In  this 
case  you  would  get  the  same  answer  from  either  formulas  (5.9),  (5.10),  and  (7.6)  or 
formulas  (8.1). 


Physics  problems  do  not  always  come  to  us  with  intervals  of  length  27t.  Fortu¬ 
nately,  it  is  easy  now  to  change  to  other  intervals.  Consider  intervals  of  length  2 1, 
say  (—1,1)  or  (0, 2Z) .  The  function  sin(mrx/Z)  has  period  21,  since 

•  n7T  /  •  f™ TX  „  \  ■  n7TX 

sin  —  (x  +  21)  =  sin  y  — h  2mrj  =  sm  —j—. 

Similarly,  cos(mrx/l)  and  em'KX'1  have  period  21.  Equations  (5.1)  and  (7.2)  are  now 
replaced  by 


(8.2) 


/(a 0 


/(a 0 


ao  7rx  27 rx 

—  +  oi  cos  —  +  a2  cos  — — I - 

.  7rx  .  2ttx 
+  b\  sm  —  +  o2  sm  — - - f-  •  •  • 


a o  /  n7r 

Y  +  Z^\an  cos  ~ 


+  bn  sin 


mrx\ 

~r )  ’ 


oo 

J2cneinvx/l. 

— OO 
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We  have  already  found  the  average  values  over  a  period  of  all  the  functions  we 
need  to  use  to  find  an,  bn,  and  cn  here.  The  period  is  now  of  length  21,  say  —l  to  l, 
so  in  finding  average  values  of  the  terms  we  replace 


1 

27T 


Recall  that  the  average  of  the  square  of  either  the  sine  or  the  cosine  over  a  period 
is  |  and  the  average  of  em7rx/i  •  e-mnx/1  =  1  is  1.  Then  the  formulas  (5.9),  (5.10), 
and  (7.6)  for  the  coefficients  become 


For  the  basic  interval  (0,  21)  we  need  only  change  the  integration  limits  to  0  to  21. 
The  Dirichlet  theorem  just  needs  7 r  replaced  by  l  in  order  to  apply  here. 


Example.  Given  f(x) 


0,  0  <  x  <  l, 

1,  l  <  x  <21. 


Expand  f{x)  in  an  exponential  Fourier  series  of  period  21.  [The  function  is  given 
by  the  same  formulas  as  (5.11)  but  on  a  different  interval.] 

m 


X 


Figure  8.3 

First  we  sketch  a  graph  of  /( x)  repeated  with  period  21  (Figure  8.3).  By  equa¬ 
tions  (8.3),  we  find 


(8.4) 


Cn 


CO 


k 


l 


r2l 


0  -dx+-  I  l-e~in™/ldx 

Jl 


^  g—iniTx/l 


21  — imr/l 


1 


—2imr 


(e 


—‘lin'K  _ 


—2imr 


, —  (1  —  ein7r)  = 


0,  even  n  yf  0, 

- ,  odd  n, 

imr 


1 

21 


dx  = 


1 

2' 


•  21 
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Then, 


(8.5) 


f(x) 


1  _ L| (pirrx/l 

2  in{ 


1  2 
2  ”  7T  '  Sm 


—  e 


—iirx/l 


7TX 

T 


l  . 

—  sm 

3 


3 

37 TX 


1 

~~  36 


PROBLEMS,  SECTION  8 

1  to  9.  In  Problems  5.1  to  5.9,  define  each  function  by  the  formulas  given  but  on  the 
interval  (—1,1).  [That  is,  replace  ±7r  by  del  and  ±7t/2  by  ±Z/2.]  Expand  each  function  in 
a  sine-cosine  Fourier  series  and  in  a  complex  exponential  Fourier  series. 


10.  (a) 


(b) 


Sketch  several  periods  of  the  function  f(x)  of  period  27r  which  is  equal  to  x 
on  — 7r  <  x  <  7T.  Expand  f(x)  in  a  sine-cosine  Fourier  series  and  in  a  complex 
exponential  Fourier  series. 

Answer:  f(x)  =  2(sin*  —  |  sin  2*  +  |  sin  3a;  —  |  sin 4a;  +  ■••). 


Sketch  several  periods  of  the  function  f(x)  of  period  27r  which  is  equal  to  x 
on  0  <  x  <  27 r.  Expand  /( x)  in  a  sine-cosine  Fourier  series  and  in  a  complex 
exponential  Fourier  series.  Note  that  this  is  not  the  same  function  or  the  same 
series  as  (a). 


Answer:  f(x)  =  n  — 


i 


sin  nx 
n 


In  Problems  11  to  14,  parts  (a)  and  (b),  you  are  given  in  each  case  one  period  of  a  function. 
Sketch  several  periods  of  the  function  and  expand  it  in  a  sine-cosine  Fourier  series,  and  in 
a  complex  exponential  Fourier  series. 


11. 

(a) 

f(x)  =  x2,  — 7T  <  X  <  7T; 

(b) 

s 

II 

x2 ,  0  <  x  <  2n. 

12. 

(a) 

f(x)  =  ex,  — 7r  <x  <i r; 

(b) 

s 

II 

ex ,  0  <  x  <  2n. 

13. 

(a) 

f(x)  =  2  —  x,  —2<x<2; 

(b) 

f(x)  = 

2  —  x,  0  <  x  <  4. 

14. 

(a) 

f(x)  =  sin7ra;,  — |  <  x  < 

(b) 

s 

II 

sin7ra;,  0  <  x  <  1. 

15. 

Sketch  (or  computer  plot)  each  of  the  following  functions 

on  the  interval  (—1,1)  and 

expand  it  in  a  complex  exponential  series  and  in  a  sine-cosine  series, 
(a)  f(x)  =  x,  —  1  <  x  <  1. 


*2  _ ^ 

Answer:  f(x)  =  —  Yf— 1) 

-7T  f  ^ 


n_l_i  sm  nnx 
n 


(b)  f(x)  = 


1  +  2x,  —  1  <  x  <  0, 

1  —  2x,  0  <  x  <  1. 

8 


Answer:  f(x)  =  —  V 

7 T2,  ' 


(c)  f(x )  = 


x  +  x2,  —  1  <  a:  <  0, 

x  —  x2 ,  0  <  x  <  1. 


Answer:  f(x)  =  V 
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Each  of  the  following  functions  is  given  over  one  period.  Sketch  several  periods  of  the 
corresponding  periodic  function  and  expand  it  in  an  appropriate  Fourier  series. 


.  „  „  .  .  _  2  smni^x 

16.  /( x)  =  x,  0  <  x  <  2.  Answer:  fix)  =  1 - }  - . 

7 r  ^ '  rt. 


17.  f[x)  = 


0,  —  1  <  x  <  0, 

1,  0  <  x  <  3. 


18.  f{x)  =  x2,  0  <  x  <  10. 


19.  f{x)  = 


0,  <  x  <  0, 

x,  0  <  x  <  \. 


20.  fix)  = 


x/2,  0  <  x  <  2, 

1,  2  <  x  <  3. 


21.  Write  out  the  details  of  the  derivation  of  the  formulas  (8.3). 


9.  EVEN  AND  ODD  FUNCTIONS 

An  even  function  is  one  like  x2  or  cos  a:  (Figure  9.1)  whose  graph  for  negative  x  is 
just  a  reflection  in  the  y  axis  of  its  graph  for  positive  x.  In  formulas,  the  value  of 
fix)  is  the  same  for  a  given  x  and  its  negative;  that  is 


(9.1)  fix)  is  even  if  f{-x)  =  f{x). 


Figure  9.2 

An  odd  function  is  one  like  x  or  sin  a;  (Figure  9.2)  for  which  the  values  of  f{x) 
and  /(— x)  are  negatives  of  each  other.  By  definition 


(9.2)  fix)  is  odd  if  /(- x)  =  -/( x). 


Notice  that  even  powers  of  x  are  even,  and  odd  powers  of  x  are  odd;  in  fact,  this 
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is  the  reason  for  the  names.  You  should  verify  (Problem  14)  the  following  rules  for 
the  product  of  two  functions:  An  even  function  times  an  even  function,  or  an  odd 
function  times  an  odd  function,  gives  an  even  function;  an  odd  function  times  an 
even  function  gives  an  odd  function.  Some  functions  are  even,  some  are  odd,  and 
some  (for  example,  ex)  are  neither.  However,  any  function  can  be  written  as  the 
sum  of  an  even  function  and  an  odd  function,  like  this: 

f(x)  =  \if(x)  +  f(~x )]  +  ~  f(~x)b 

the  first  part  is  even  and  the  second  part  is  odd.  For  example, 

ex  =  \(ex+  e~x)  +  ^(ex  -  e~x)  =  cosh x  +  sinhx; 

cosh  x  is  even  and  sinh  x  is  odd  (look  at  the  graphs) . 

Integrals  of  even  functions  or  of  odd  functions,  over  symmetric  intervals  like 
(— 7r,7r)  or  (—1,1),  can  be  simplified.  Look  at  the  graph  of  sinx  and  think  about 
f_  sin  x  dx.  The  negative  area  from  —n  to  0  cancels  the  positive  area  from  0  to  tt, 
so  the  integral  is  zero.  This  integral  is  still  zero  for  any  interval  (— l,  l)  which  is 
symmetric  about  the  origin,  as  you  can  see  from  the  graph.  The  same  is  true  for 
any  odd  f(x);  the  areas  to  the  left  and  to  the  right  cancel.  Next  look  at  the  cosine 
graph  and  the  integral  cos  x  dx.  You  see  that  the  area  from  —  -k/2  to  0  is  the 
same  as  the  area  from  0  to  tt/2.  We  could  then  just  as  well  find  the  integral  from 
0  to  7t/2  and  multiply  it  by  2.  In  general,  if  f(x)  is  even,  the  integral  of  f(x)  from 
— I  to  l  is  twice  the  integral  from  0  to  l.  Then  we  have 

0  if  f(x)  is  odd, 

2  /  f(x)  dx  if  f(x)  is  even. 

Jo 


Suppose  now  that  we  are  given  a  function  on  the  interval  (0,1).  If  we  want  to 
represent  it  by  a  Fourier  series  of  period  21,  we  must  have  f(x)  defined  on  (—1,0) 
too.  There  are  several  things  we  could  do.  We  could  define  it  to  be  zero  (or,  indeed, 
anything  else)  on  (—1,  0)  and  go  ahead  as  we  have  done  previously  to  find  either  an 
exponential  or  a  sine-cosine  series  of  period  21.  However,  it  often  happens  in  prac¬ 
tice  that  we  need  (for  physical  reasons — see  Chapter  13)  to  have  an  even  function 
(or,  in  a  different  problem,  an  odd  function).  We  first  sketch  the  given  function  on 
(0,/)  (heavy  lines  in  Figures  9.3  and  9.4).  Then  we  extend  the  function  on  (—1,0) 
to  be  even  or  to  be  odd  as  required.  To  sketch  more  periods,  just  repeat  the  (—1,  l) 
sketch.  (If  the  graph  is  complicated,  it  is  helpful  to  trace  it  with  a  finger  of  one 


Figure  9.3 
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hand  while  you  use  the  other  hand  to  copy  exactly  what  you  are  tracing.  Turn  the 
paper  upside  down  to  avoid  crossing  hands.) 


For  even  or  odd  functions,  the  coefficient  formulas  for  an  and  bn  simplify.  First 
suppose  f{x)  is  odd.  Since  sines  are  odd  and  cosines  are  even,  f{x)  sin(mra;/Z)  is 
even  and  f{x)  cos(mrx/l)  is  odd.  Then  an  is  the  integral,  over  a  symmetric  interval 
( — Z,  Z) ,  of  an  odd  function,  namely  f{x)cos{mrx/l)\  an  is  therefore  zero.  But  bn  is 
the  integral  of  an  even  function  over  a  symmetric  interval  and  is  therefore  twice  the 
0  to  l  integral.  We  have: 


(9.4) 


If  f(x)  is  odd, 


fix)  sin 


mrx 

~T 


dx , 


We  say  that  we  have  expanded  fix)  in  a  sine  series  (an  =  0  so  there  are  no  cosine 
terms).  Similarly,  if  fix)  is  even,  all  the  bn' s  are  zero,  and  the  an’s  are  integrals  of 
even  functions.  We  have: 


(9.5) 


If  fix)  is  even, 


tt  x  mrx 
fix)  cos  — —  dx, 


We  say  that  fix)  is  expanded  in  a  cosine  series.  (Remember  that  the  constant  term 
is  a0/ 2.) 

You  have  now  learned  to  find  several  different  kinds  of  Fourier  series  that  rep¬ 
resent  a  given  function  fix)  on,  let  us  say,  the  interval  (0,1).  How  do  you  know 
which  to  use  in  a  given  problem?  You  have  to  decide  this  from  the  physical  problem 
when  you  are  using  Fourier  series.  There  are  two  things  to  check:  (1)  the  basic  pe¬ 
riod  involved  in  the  physical  problem;  the  functions  in  your  series  should  have  this 
period;  and  (2)  the  physical  problem  may  require  either  an  even  function  or  an  odd 
function  for  its  solution;  in  these  cases  you  must  find  the  appropriate  series.  Now 
consider  fix)  defined  on  (0, 1).  We  could  find  for  it  a  sine-cosine  or  an  exponential 
series  of  period  1  (that  is,  l  =  i): 

oo 

fix)  =  ^c„e2imrx  where  cn  =  /  fix)e~2innxdx. 


—  OO 


0 
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(The  choice  between  sine-cosine  and  exponential  series  is  just  one  of  convenience  in 
evaluating  the  coefficients — the  series  are  really  identical.)  But  we  could  also  find 
two  other  Fourier  series  representing  the  same  f(x)  on  (0,1).  These  series  would 
have  period  2  (that  is,  1  =  1).  One  would  be  a  cosine  series 


OO 

/(*)  =  an  cosnnx, 

n— 0 


an 


f{x)  cosnirx  dx, 


bn  =  0, 


and  represent  an  even  function;  the  other  would  be  a  sine  series  and  represent  an 
odd  function.  In  the  problems,  you  may  just  be  told  to  expand  a  function  in  a 
cosine  series,  say.  You  must  then  see  for  yourself  what  the  period  is  when  you  have 
sketched  an  even  function,  and  so  choose  the  proper  l  in  cos(mrx/l)  and  in  the 
formula  for  an. 


Example.  Represent  f(x)  =  '  ^ 

by  (a)  a  Fourier  sine  series,  (b)  a  Fourier  cosine  series,  (c)  a  Fourier  series  (the  last 
ordinarily  means  a  sine-cosine  or  exponential  series  whose  period  is  the  interval  over 
which  the  function  is  given;  in  this  case  the  period  is  1). 


-3 


ir 


-1 


3 


Figure  9.5 


(a)  Sketch  the  given  function  between  0  and  1.  Extend  it  to  the  interval  (—1,0) 
making  it  odd.  The  period  is  now  2,  that  is,  l  =  1.  Continue  the  function  with 
period  2  (Figure  9.5).  Since  we  now  have  an  odd  function,  an  =  0  and 


b 


n 


bi 


2  f1  r1/2 

—  /  f(x)  sin  nnxdx  =  2  /  sin  nnxdx 

1  J  o  Jo 


2 


- COS  TITTX 

nn 


1/2 


0 


2 

nn 


(“T-0 


2 

3tt’ 


64  =  0, 


Thus  we  obtain  the  Fourier  sine  series  for  f(x): 


f(x )  =  —  (  sin7nr;  + 

7T 


2  sin  2irx  sin  37 rx  sin  57tx  2  sin  67nr 
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-3 


l 


1 

3 


Figure  9.6 


(b)  Sketch  an  even  function  of  period  2  (Figure  9.6). 
Here  l  =  1,  bn  =  0,  and 


a0 


r  i  r 1/2 

2  /  /(a;)  dx  =  2  dx 

Jo  Jo 


=  1, 


=  2 


/(x)  cos  mrx  dx 


2 


1/2 


—  sin  mrx 
mr 


o 


2  mr 

—  sin  — . 

mr  2 


Then  the  Fourier  cosine  series  for  f{x)  is 


f(x) 


COS  7 TX  COS  37 TX  COS  57TX 


3 


Figure  9.7 


(c)  Sketch  the  given  function  on  (0, 1)  and  continue  it  with  period  1  (Figure  9.7). 
Here  21  =  1,  and  we  find  cn  as  we  did  in  the  example  of  Section  8.  As  in  that 
example,  the  exponential  series  here  can  then  be  put  in  sine-cosine  form. 


Cn 


CO 


f(x) 


/0)e 
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dx  = 


dx 
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1 -(-!)"  I  — 
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n  odd, 

imr 

0,  n  even  ^  0. 
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Alternatively  we  can  find  both  an  and  bn  directly. 


«o  —  2 


,1  ,1/2 
/  f(x)  dx  =  2  /  dx  =  1. 
./o  Ao 


nl/2 


—  2 


cos  2mrx  dx  =  0. 


r1/2  i  i 

bn  =  2  sin2mrx  dx  =  — (1  —  cosri7r)  =  — [1  —  (—1)"]. 
J  o  7171  7171 


>0 

2  2 

b  i  =  — ,  62  =  0,  bs  =  —,  64  =  0, 

71  071 


There  is  one  other  very  useful  point  to  notice  about  even  and  odd  functions.  If 
you  are  given  a  function  on  (— l,  l )  to  expand  in  a  sine-cosine  series  (of  period  21) 
and  happen  to  notice  that  it  is  an  even  function,  you  should  realize  that  the  s 
are  all  going  to  be  zero  and  you  do  not  have  to  work  them  out.  Also  the  an’s  can 
be  written  as  twice  an  integral  from  0  to  l  just  as  in  (9.5).  Similarly,  if  the  given 
function  is  odd,  you  can  use  (9.4).  Recognizing  this  may  save  you  a  good  deal  of 
algebra. 


Differentiating  Fourier  Series  Now  that  we  have  a  supply  of  Fourier  series  for 
reference,  let’s  discuss  the  question  of  differentiating  a  Fourier  series  term  by  term. 
First  consider  a  Fourier  series  in  which  an  and  bn  are  proportional  to  1/n.  Since 
the  derivative  of  i  sin  nx  is  cos  nx  (and  a  similar  result  for  the  cosine  terms) ,  we  see 
that  the  differentiated  series  has  no  1/n  factors  to  make  it  converge.  Now  you  might 
suspect  (correctly)  that  if  you  can’t  differentiate  the  Fourier  series,  then  the  function 
f(x )  which  it  represents  can’t  be  differentiated  either,  at  least  not  at  all  points. 
Turn  back  to  examples  and  problems  for  which  the  Fourier  series  have  coefficients 
proportional  to  1/n  and  look  at  the  graphs  (or  sketch  them).  Note  in  every  case 
that  f(x)  is  discontinuous  (that  is,  has  jumps)  at  some  points,  and  so  can’t  be 
differentiated  there.  Next  consider  Fourier  series  with  an  and  bn  proportional  to 
1/n2.  If  we  differentiate  such  a  series  once,  there  are  still  1/n  factors  left  but  we  can’t 
differentiate  it  twice.  In  that  case  we  would  (correctly)  expect  the  function  to  be 
continuous  with  a  discontinuous  first  derivative.  (Look  for  examples.)  Continuing, 
if  an  and  bn  are  proportional  to  1/n3,  we  can  find  two  derivatives,  but  the  second 
derivative  is  discontinuous,  and  so  on  for  Fourier  coefficients  proportional  to  higher 
powers  of  1/n.  (See  Problems  26  and  27.) 

It  is  interesting  to  plot  (by  computer)  a  given  function  together  with  enough 
terms  of  its  Fourier  series  to  give  a  reasonable  fit.  In  Section  5  we  did  this  for 
discontinuous  functions  and  it  took  many  terms  of  the  series.  You  will  find  (see 
Problems  26  and  27)  that  the  more  continuous  derivatives  a  function  has,  the  fewer 
terms  of  its  Fourier  series  are  required  to  approximate  it.  We  can  understand  this: 
The  higher  order  terms  oscillate  more  rapidly  (compare  sin  x7  sin  2x ,  sin  10a:) ,  and 
this  rapid  oscillation  is  what  is  needed  to  fit  a  curve  which  is  changing  rapidly 
(for  example,  a  jump).  But  if  f(x)  has  several  continuous  derivatives,  then  it  is 
quite  “smooth”  and  doesn’t  require  so  much  of  the  rapid  oscillation  of  the  higher 
order  terms.  This  is  reflected  in  the  dependence  of  the  Fourier  coefficients  on  a 
power  of  1  /n. 
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PROBLEMS,  SECTION  9 


The  functions  in  Problems  1  to  3  are  neither  even  nor  odd.  Write  each  of  them  as  the  sum 
of  an  even  function  and  an  odd  function. 


1. 

(a)  einx 

(b)  xex 

2. 

(a)  In  1 1  —  a:  | 

(b)  (1  +  a;)  (sin  x  +  cos  x) 

3. 

(a)  x5  —  x4  +  x3  —  1 

(b)  l  +  ex 

4. 

Using  what  you  know  about 

even  and  odd  functions,  prove  the  first  part  of  (5.2) 

Each  of  the  functions  in  Problems  5  to  12  is  given  over  one  period.  For  each  function, 
sketch  several  periods  and  decide  whether  it  is  even  or  odd.  Then  use  (9.4)  or  (9.5)  to 
expand  it  in  an  appropriate  Fourier  series. 


5.  /( x)  = 


6.  f{x)  = 


-  1,  —  7T  <  X  <  0, 

1,  0  <  X  <  7T. 

—  1,  —1<X<  0, 

1.  0  <  X  <1. 


37rrc 


57ra; 


Answer:  f(x)  =  —  (  sin  —  +  —  sin  — —  +  -  sin  — - h 


7.  f(x)  = 


1,  —  1  <  x  <  1, 

0,  —  2  <  x  <  —  1  and  1  <  x  <  2. 


r ,  N  7 T  TV 

8.  f(x)  =x,  ——  <  x  <  —. 

9.  f(x)  =  x2,  —  |<a;<§. 

Answer:  f(x)  =  \ 

12  TVZ 

10.  f(x)  =  \x\,  -^<3J<|. 

11.  /(*)  =  cosha;,  —n<x<iv. 


cos  27ra;  —  —  cos  47ra:  +  —  cos  67ro:  —  • 
2 1  3J 


Answer:  f(x)  = 


2  sinh  7 r 


—  —  —  cos  x  +  —  cos  2a; - —  cos  3a;  +  —  cos  4a;  — 

2  2  5  10  17 


12.  f(x)  = 


x  +  1,  —  1  <  x  <  0, 
x  —  1,  0  <  x  <  1. 


13.  Give  algebraic  proofs  of  (9.3).  Hint:  Write  =  /°;  +  J0\  make  the  change  of 
variable  x  =  —  t  in  and  use  the  definition  of  even  or  odd  function. 

14.  Give  algebraic  proofs  that  for  even  and  odd  functions: 

(a)  even  times  even  =  even;  odd  times  odd  =  even;  even  times  odd  =  odd; 

(b)  the  derivative  of  an  even  function  is  odd;  the  derivative  of  an  odd  function  is 


15.  Given  /( x)  =  x  for  0  <  x  <  1,  sketch  the  even  function  fc  of  period  2  and  the  odd 
function  fa  of  period  2,  each  of  which  equals  f(x)  on  0  <  x  <  1.  Expand  fc  in  a 
cosine  series  and  fa  in  a  sine  series. 

14/  1 

Answer:  fJx)  = - ^  cos  7ra;  H — r-  cos  37ra;  +  • 

o  i 


1  . 


1  . 


fs( x)  =  —  sin  7ra;  —  —  sin  27ra;  +  -  sin  37ra:  — 
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16.  Let  f{x)  =  sin2  x,  0  <  x  <  n.  Sketch  (or  computer  plot)  the  even  function  fc  of 
period  2n,  the  odd  function  fs  of  period  2n,  and  the  function  fp  of  period  n,  each  of 
which  is  equal  to  f(x)  on  (0,  n).  Expand  each  of  these  functions  in  an  appropriate 
Fourier  series. 

In  Problems  17  to  22  you  are  given  f(x)  on  an  interval,  say  0  <  x  <  b.  Sketch  several 
periods  of  the  even  function  fc  of  period  2b,  the  odd  function  fa  of  period  2b,  and  the 
function  fp  of  period  b,  each  of  which  equals  f(x)  on  0  <  x  <  b.  Expand  each  of  the  three 
functions  in  an  appropriate  Fourier  series. 


f(x)  =  | 

f  1, 

0  <  x  <  5, 

/(*)  =  | 

fl, 

0  <  x  <  1, 

17. 

18. 

l"1’ 

I  <  X  <  1. 

l°> 

1  <  x  <  3. 

19. 

f(x)  =  | 

COS  *1, 

0  <  X  <  TV. 

20. 

/Or)  =  x2, 

0  <  x  <  1. 

f(x)  =  | 

I  X, 

0  <  x  <  1, 

f(x)  =  | 

r  io. 

0  <  x  <  10, 

21. 

22. 

[2-x, 

,  1  <  x  <  2. 

[20, 

10  <  2;  <  20. 

23.  If  a  violin  string  is  plucked  (pulled  aside  and  let 
go),  it  is  possible  to  find  a  formula  f(x,t)  for 
the  displacement  at  time  t  of  any  point  x  of  the 
vibrating  string  from  its  equilibrium  position.  It 
turns  out  that  in  solving  this  problem  we  need 
to  expand  the  function  f(x,  0),  whose  graph  is  the  initial  shape  of  the  string,  in  a 
Fourier  sine  series.  Find  this  series  if  a  string  of  length  l  is  pulled  aside  a  small 
distance  h  at  its  center,  as  shown. 

24.  If,  in  Problem  23,  the  string  is  stopped  at  the  f(x,0) 

center  and  half  of  it  is  plucked,  then  the  func¬ 
tion  to  be  expanded  in  a  sine  series  is  shown  - 

here.  Find  the  series.  Caution:  Note  that 
f(x,  0)  =3  0  for  1/2  <  x  <  l. 

25.  Suppose  that  f(x)  and  its  derivative  f'(x)  are  both  expanded  in  Fourier  series  on 
(— 7r,7r).  Call  the  coefficients  in  the  f(x)  series  an  and  bn  and  the  coefficients  in  the 
f'(x)  series  a'n  and  b'n.  Write  the  integral  for  an  [equation  (5.9)]  and  integrate  it 
by  parts  to  get  an  integral  of  f'(x)smnx.  Recognize  this  integral  in  terms  of  b'n 
[equation  (5.10)  for  f'(x)]  and  so  show  that  b'n  =  —nan.  (In  the  integration  by 
parts,  the  integrated  term  is  zero  because  /(7r)  =  /(— n)  since  /  is  continuous — 
sketch  several  periods.).  Find  a  similar  relation  for  a'n  and  bn.  Now  show  that  this 
is  the  result  you  get  by  differentiating  the  f(x)  series  term  by  term.  Thus  you  have 
shown  that  the  Fourier  series  for  f'(x)  is  correctly  given  by  differentiating  the  f(x) 
series  term  by  term  (assuming  that  f'(x)  is  expandable  in  a  Fourier  series). 


In  Problems  26  and  27,  find  the  indicated  Fourier  series.  Then  differentiate  your  result 
repeatedly  (both  the  function  and  the  series)  until  you  get  a  discontinuous  function.  Use  a 
computer  to  plot  f(x)  and  the  derivative  functions.  For  each  graph,  plot  on  the  same  axes 
one  or  more  terms  of  the  corresponding  Fourier  series.  Note  the  number  of  terms  needed 
for  a  good  fit  (see  comment  at  the  end  of  the  section). 


26.  /Or) 

27.  f(x) 


j  3x2  +  2x3, 
[3x2  -2x3, 

(X  -7T  )  , 


— 1  <  x  <  0, 
0  <  x  <  1. 


— 7T  <  X  <  7T. 
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10.  AN  APPLICATION  TO  SOUND 


We  have  said  that  when  a  sound  wave  passes  through  the  air  and  we  hear  it,  the 
air  pressure  where  we  are  varies  with  time.  Suppose  the  excess  pressure  above  (and 
below)  atmospheric  pressure  in  a  sound  wave  is  given  by  the  graph  in  Figure  10.1. 
(We  shall  not  be  concerned  here  with  the  units  of  p ;  however,  reasonable  units  in 
Figure  10.1  would  be  p  in  10-6  atmospheres.)  Let  us  ask  what  frequencies  we  hear 
when  we  listen  to  this  sound.  To  find  out,  we  expand  p(t)  in  a  Fourier  series.  The 
period  of  p(t)  is  ^5;  that  is,  the  sound  wave  repeats  itself  262  times  per  second. 
We  have  called  the  period  21  in  our  formulas,  so  here  l  =  The  functions  we 
have  called  sin(mrx/l)  here  become  sin  524n7rf.  We  can  save  some  work  by  observing 


p(t) 

1 

1 

1 

1 

1 

1 

1 

0  1 

1048 

i 

104) 

| 

1 

?! 

-il - 

8 

7 


1 

524 

I 

I 


1 

262 

I 

I 

J 


t  in 

seconds 


Figure  10.1 

that  p(t)  is  an  odd  function;  there  are  then  only  sine  terms  in  its  Fourier  series  and 
we  need  to  compute  only  bn.  Using  (9.4),  we  have 


(10.1) 


/■1/524 

bn  =  2(524)  /  p(t)  sin  524n7rf  dt 

Jo 

/*  1/1048  ly  /*l/OZ4 

=  1048  /  sin  524n7rf  dt - (1048)  /  sin  524n7rf  dt 

JO  §  J 1/1048 


f 1/524 


=  1048  - 


mr  mr  - 

cos - 1  7  cos  nir  —  cos  — 

2  ,  i _ 2_ 

524n7r  8  524n7r 


2/15  mr  7 

— - cos  — -  +  1  +  -  cos  mr  . 

mr  \  8  2  8 


From  this  we  can  compute  the  values  of  bn  for  the  first  few  values  of  n: 


(10.2) 


^5 

b6 


67 


&8 


1  1 

5-7T  4 


1  1 
77r  '  4 


0,  etc. 
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Then  we  have 
(10.3)  p(t)  = 


1  /  sin  5247 rt  30  sin(524  •  2nt)  sin(524  •  3nt) 

47t  ^  T  +  2  +  3 

sin(524  •  5nt)  ^  30sin(524  •  67ri)  ^  sin(524  •  7nt) 


We  can  see  just  by  looking  at  the  coefficients  that  the  most  important  term 
is  the  second  one.  The  first  term  corresponds  to  the  fundamental  with  frequency 
262  vibrations  per  second  (this  is  approximately  middle  C  on  a  piano).  But  it  is 
much  weaker  in  this  case  than  the  first  overtone  (second  harmonic)  corresponding  to 
the  second  term;  this  tone  has  frequency  524  vibrations  per  second  (approximately 
high  C).  (You  might  like  to  use  a  computer  to  play  one  or  several  terms  of  the 
series.)  The  sixth  harmonic  (corresponding  to  n  =  6)  and  also  the  harmonics  for 
n  =  10,  14,  18,  22,  and  26  are  all  more  prominent  (that  is,  have  larger  coefficients) 
than  the  fundamental.  We  can  be  even  more  specific  about  the  relative  importance 
of  the  various  frequencies.  Recall  that  in  discussing  a  simple  harmonic  oscillator, 
we  showed  that  its  average  energy  was  proportional  to  the  square  of  its  velocity 
amplitude.  It  can  be  proved  that  the  intensity  of  a  sound  wave  (average  energy 
striking  unit  area  of  your  ear  per  second)  is  proportional  to  the  average  of  the 
square  of  the  excess  pressure.  Thus  for  a  sinusoidal  pressure  variation  Asin2nft, 
the  intensity  is  proportional  to  A2.  In  the  Fourier  series  for  p(t),  the  intensities  of  the 
various  harmonics  are  then  proportional  to  the  squares  of  the  corresponding  Fourier 
coefficients.  (The  intensity  corresponds  roughly  to  the  loudness  of  the  tone — not 
exactly  because  the  ear  is  not  uniformly  sensitive  to  all  frequencies.)  The  relative 
intensities  of  the  harmonics  in  our  example  are  then: 


n  =1  2  345  6  7  8  9  10  •  •  • 

Relative  intensity  =  1  225  ^  0  ^  25  ^  0  ^  9 

From  this  we  see  even  more  clearly  that  we  would  hear  principally  the  second 
harmonic  with  frequency  524  (high  C). 
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In  Problems  1  to  3,  the  graphs  sketched  represent  one  period  of  the  excess  pressure  p(t)  in 
a  sound  wave.  Find  the  important  harmonics  and  their  relative  intensities.  Use  a  computer 
to  play  individual  terms  or  a  sum  of  several  terms  of  the  series. 


1. 


p(t) 

l 

1 

l 

i 

1 

1 

! 

i 

i 

220 

1 

i 

330 

t 

1 

i 

660 

-i 

i 

660 

i 

330 

I 

L t  in 

220  seconds 

1 

nU) 

3 

2 

1 


i - 1 


i  i 

_j _ i — 

1  _L 

393  262 


2. 
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In  Problems  4  to  10,  the  sketches  show  several  practical  examples  of  electrical  signals 
(voltages  or  currents).  In  each  case  we  want  to  know  the  harmonic  content  of  the  signal, 
that  is,  what  frequencies  it  contains  and  in  what  proportions.  To  find  this,  expand  each 
function  in  an  appropriate  Fourier  series.  Assume  in  each  case  that  the  part  of  the  graph 
shown  is  repeated  sixty  times  per  second. 


4. 


Output  of  a  simple  d-c  generator;  the  shape  of 
the  curve  is  the  absolute  value  of  a  sine  func¬ 
tion.  Let  the  maximum  voltage  be  100  v. 


5.  Rectified  half-wave;  the  curve  is  a  sine  function 
for  half  the  cycle  and  zero  for  the  other  half. 
Let  the  maximum  current  be  5  amp.  Hint:  Be 
careful!  The  value  of  l  here  is  1/60,  but  I(t)  = 
sint  only  from  t  =  0  to  t  =  1/120. 


6.  Triangular  wave;  the  graph  consists  of  two 
straight  lines  whose  equations  you  must  write! 
The  maximum  voltage  of  100  v  occurs  at  the 
middle  of  the  cycle. 


7.  Sawtooth 


8.  Rectified  sawtooth 


m 

10 

>1 

0 

1 

1 

120 

60 

9.  Square  wave 


10.  Periodic  ramp  function 
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11.  PARSEVAL'S  THEOREM 

We  shall  now  find  a  relation  between  the  average  of  the  square  (or  absolute  square)  of 
f(x)  and  the  coefficients  in  the  Fourier  series  for  /(x),  assuming  that  J_  |/(x)|2  dx 
is  finite.  The  result  is  known  as  Parseval’s  theorem  or  the  completeness  relation. 
You  should  understand  that  the  point  of  the  theorem  is  not  to  get  the  average  of 
the  square  of  a  given  /(x)  by  using  its  Fourier  series.  [Given  /(x),  it  is  easy  to  get 
its  average  square  just  by  doing  the  integration  in  (11.2)  below.]  The  point  of  the 
theorem  is  to  show  the  relation  between  the  average  of  the  square  of  /(x)  and  the 
Fourier  coefficients.  We  can  derive  a  form  of  Parseval’s  theorem  from  any  of  the 
various  Fourier  expansions  we  have  made;  let  us  use  (5.1). 


(H.l) 


f(x)  =  -ao  +  ^  an  cos  nx  +  ^  bn  si 


sm  nx. 


l  l 

We  square  /(x)  and  then  average  the  square  over  (— 7r,7r): 

(11.2)  The  average  of  [/(x)]2  is  f  [f(x)]2dx. 

2lT  J- tt 


When  we  square  the  Fourier  series  in  (11.1)  we  get  many  terms.  To  avoid  writing 
out  a  large  number  of  them,  consider  instead  what  types  of  terms  there  are  and 
what  the  averages  of  the  different  kinds  of  terms  are.  First,  there  are  the  squares 
of  the  individual  terms.  Using  the  fact  that  the  average  of  the  square  of  a  sine  or 
cosine  over  a  period  is  i,  we  have: 

The  average  of  (^«o)2  is  (^ao)2. 

(11.3)  The  average  of  (ancosnx)2  is  a2  • 

The  average  of  (6„sinux)2  is  b2  ■ 

Then  there  are  cross-product  terms  of  the  forms  2  •  ^aoan  cos  nx,  2  ■  ^ cioAiSinnx , 
and  2anbm  cos  nxsinmx  with  m  yf  n  (we  write  n  in  the  cosine  factor  and  m  in  the 
sine  factor  since  every  sine  term  must  be  multiplied  times  every  cosine  term).  By 
(5.2),  the  average  values  of  terms  of  all  these  types  are  zero.  Then  we  have 


(11.4)  The  average  of  [/(x)]2  (over  a  period) 


1 

2 


This  is  one  form  of  Parseval’s  theorem.  You  can  easily  verify  (Problem  1)  that  the 
theorem  is  unchanged  if  /(x)  has  period  21  instead  of  2-7T  and  its  square  is  averaged 
over  any  period  of  length  21.  You  can  also  verify  (Problem  3)  that  if  /(x)  is  written 
as  a  complex  exponential  Fourier  series,  and  if  in  addition  we  include  the  possibility 
that  /(x)  itself  may  be  complex,  then  we  find: 


OO 

(11.5)  The  average  of  |/(x)|2  (over  a  period)  =  ^  |cn|2. 

—  OO 


Parseval’s  theorem  is  also  called  the  completeness  relation.  In  the  problem  of 
representing  a  given  sound  wave  as  a  sum  of  harmonics,  suppose  we  had  left  one  of 
the  harmonics  out  of  the  series.  It  seems  plausible  physically,  and  it  can  be  proved 
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mathematically,  that  with  one  or  more  harmonics  left  out,  we  would  not  be  able  to 
represent  sound  waves  containing  the  omitted  harmonics.  We  say  that  the  set  of 
functions  sin  nx,  cos  nx  is  a  complete  set  of  functions  on  any  interval  of  length  2i r; 
that  is,  any  function  (satisfying  Dirichlet  conditions)  can  be  expanded  in  a  Fourier 
series  whose  terms  are  constants  times  sin  nx  and  cos  nx.  If  we  left  out  some  values 
of  n,  we  would  have  an  incomplete  set  of  basis  functions  (see  basis,  page  357)  and 
could  not  use  it  to  expand  some  given  functions.  For  example,  suppose  that  you 
made  a  mistake  in  finding  the  period  (that  is,  the  value  of  l)  of  your  given  function 
and  tried  to  use  the  set  of  functions  sin  2 nx,  cos  2 nx  in  expanding  a  given  function 
of  period  27r.  You  would  get  a  wrong  answer  because  you  used  an  incomplete  set 
of  basis  functions  (with  the  sin  x,  cos  x,  sin  3x,  cos  3x,  •  •  • ,  terms  missing) .  If  your 
Fourier  series  is  wrong  because  the  set  of  basis  functions  you  use  is  incomplete,  then 
the  results  you  get  from  Parseval’s  theorem  (11.4)  or  (11.5)  will  be  wrong  too.  In 
fact,  if  we  use  an  incomplete  basis  set  in,  say,  (11.5),  then  there  are  missing  (non¬ 
negative)  terms  on  the  right-hand  side,  so  the  equation  becomes  the  inequality: 
Average  of  \f(x)\2  >  ]T)Xco  lcn|2-  This  is  known  as  Bessel’s  inequality.  Conversely, 
if  (11.4)  and  (11.5)  are  correct  for  all  f(x),  then  the  set  of  basis  functions  used 
is  a  complete  set.  This  is  why  Parseval’s  theorem  is  often  called  the  completeness 
relation.  (Also  see  page  377  and  Chapter  12,  Section  6.) 

Let  us  look  at  some  examples  of  the  physical  meaning  and  the  use  of  Parseval’s 
theorem. 

Example  1.  In  Section  10  we  said  that  the  intensity  (energy  per  square  centimeter  per 
second)  of  a  sound  wave  is  proportional  to  the  average  value  of  the  square  of  the 
excess  pressure.  If  for  simplicity  we  write  (10.3)  with  letters  instead  of  numerical 
values,  we  have 


OO 

(H.6)  p(t)  =  Y  bn  sin  27 xnft. 

l 

For  this  case,  Parseval’s  theorem  (11.4)  says  that: 

OO  ^  OO 

(11.7)  The  average  of  [p(t)]2  =  the  average  of  b 2  sin2  2m: ft  . 

l  l 

Now  the  intensity  or  energy  (per  square  centimeter  per  second)  of  the  sound  wave 
is  proportional  to  the  average  of  [p(f)]2,  and  the  energy  associated  with  the  nth 
harmonic  is  proportional  to  the  average  of  62  sin2  2m: ft.  Thus  Parseval’s  theorem 
says  that  the  total  energy  of  the  sound  wave  is  equal  to  the  sum  of  the  energies 
associated  with  the  various  harmonics. 


Example  2.  Let  us  use  Parseval’s  theorem  to  find  the  sum  of  an  infinite  series.  From 
Problem  8.15(a)  written  in  complex  exponential  form  we  get: 

The  function  f(x)  of  period  2  which  is  equal  to  x  on  (—1, 1) 


= - (emx-e 

7 T 


— iirx  _  _  2iirx 

2 


26 


—  2i-nx 


3iirx  _  }_p-3iTrx 

3  3 


+  •••)• 
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Let  us  find  the  average  of  [f(x)}2  on  (—1, 1). 


The  average  of  [f(x)]2 


1 

[  x2dx=- 

X3 

1 

2„ 

l-i  2 

3 

-1_ 

1 

3' 


By  Parseval’s  theorem  (11.5),  this  is  equal  to  J2-oo  lcn| 2,  so  we  have 


—  OO 


1 


n 


2  ' 


Then  we  get  the  sum  of  the  series 

I  I  -V—  — 

+  4  +  9  +  '"  ~  n2  ~  2  3  6' 

We  have  seen  that  a  function  given  on  (0,1)  can  be  expanded  in  a  sine  series  by 
defining  it  on  (— 2,  0)  to  make  it  odd,  or  in  a  cosine  series  by  defining  it  on  (— 2 ,  0) 
to  make  it  even.  Here  is  another  useful  example  of  defining  a  function  to  suit  our 
purposes.  (We  will  need  this  in  Chapter  13.)  Suppose  we  want  to  expand  a  function 
defined  on  (0,  2 )  in  terms  of  the  basis  functions  sin(n  +  |)^  =  sin  (2n+^7rx .  Can 
we  do  it,  that  is,  do  these  functions  make  up  a  complete  set  for  this  problem?  Note 
that  our  proposed  basis  functions  have  period  42,  say  (—21,  21)  (observe  the  21  in 
the  denominator  where  you  are  used  to  2).  So  given  f(x)  on  (0,1),  we  can  define  it 
as  we  like  on  (1,21)  and  on  (—22,0).  We  know  (by  the  Dirichlet  theorem)  that  the 
functions  sin  and  cos  'n^r ,  all  n,  make  up  a  complete  set  on  (—21, 21).  We  need 
to  see  how,  on  (0,  l)  we  can  use  just  the  sines  (that’s  easy — make  the  function  odd) 
and  only  the  odd  values  of  n.  It  turns  out  (see  Problem  11)  that  if  we  define  f(x) 
on  (1,21)  to  make  it  symmetric  around  x  =  l,  then  all  the  s  for  even  n  are  equal 
to  zero.  So  our  desired  basis  set  is  indeed  a  complete  set  on  (0,2).  Similarly  we  can 
show  (Problem  11)  that  the  functions  cos  ^2n~^>nx  make  up  a  complete  set  on  (0, 2). 


PROBLEMS,  SECTION  11 

1.  Prove  (11.4)  for  a  function  of  period  22  expanded  in  a  sine-cosine  series. 

2.  Prove  that  if  f(x)  =  Cnemx ,  then  the  average  value  of  [f(x)]2  is  cnC-n. 

Show  by  Problem  7.12  that  for  real  f(x)  this  becomes  (11.5). 

3.  If  f(x)  is  complex,  we  usually  want  the  average  of  the  square  of  the  absolute  value 

of  f(x).  Recall  that  \f(x)\2  =  f(x)  ■  f(x),  where  f(x)  means  the  complex  conjugate 
of  f(x).  Show  that  if  a  complex  f(x)  =  then  (11.5)  holds. 

4.  When  a  current  I  flows  through  a  resistance  R,  the  heat  energy  dissipated  per  second 
is  the  average  value  of  RI2.  Let  a  periodic  (not  sinusoidal)  current  I(t)  be  expanded 
in  a  Fourier  series  I(t)  =  ,c„e120tnnt.  Give  a  physical  meaning  to  Parseval’s 
theorem  for  this  problem. 

Use  Parseval’s  theorem  and  the  results  of  the  indicated  problems  to  find  the  sum  of  the 
series  in  Problems  5  to  9. 

5.  The  series  1  +  7^7  +  i  d —  •,  using  Problem  9.6. 

3-*  5^ 

6.  The  series  — ,  using  Problem  9.9. 

71=1  ^ 
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7.  The  series  ,  using  Problem  5.8. 

n=  1  ^ 

8.  The  series  -i-,  using  Problem  9.10. 


9.  The  series  +  — Lj-  +  H - ,  using  Problem  5.11. 

3^  15^  35^ 

10.  A  general  form  of  Parseval’s  theorem  says  that  if  two  functions  are  expanded  in 
Fourier  series 


/(*) 

3(*) 


1 

2 

1 

2 


OO 

do  +  ^  ^  CLn 
1 

oo 

a'o  + 

1 


oo 

cos  nx  +  bn  sin  nx, 
i 

oo 

cos  nr  +  b’n  sin  nx, 
i 


then  the  average  value  of  f(x)g(x)  is  \aoa'0  +  |  a„a'n  +  §  bnb'n.  Prove  this. 

11.  (a)  Let  f(x)  on  (0,  21)  satisfy  f(2l  —  x)  =  f(x),  that  is,  f(x)  is  symmetric  about 

x  =  l.  If  you  expand  f(x)  on  (0,  21)  in  a  sine  series  bn  sin  show  that 
for  even  n,  bn  =  0.  Hint:  Note  that  the  period  of  the  sines  is  47.  Sketch  an 
f(x)  which  is  symmetric  about  x  =  l,  and  on  the  same  axes  sketch  a  few  sines 
to  see  that  the  even  ones  are  antisymmetric  about  x  =  l.  Alternatively,  write 
the  integral  for  bn  as  an  integral  from  0  to  l  plus  an  integral  from  l  to  21,  and 
replace  x  by  21  —  x  in  the  second  integral. 

(b)  Similarly,  show  that  if  we  define  f(2l  —  x)  =  —f(x),  the  cosine  series  has  an  =  0 
for  even  n. 


12.  FOURIER  TRANSFORMS 

We  have  been  expanding  periodic  functions  in  series  of  sines,  cosines,  and  complex 
exponentials.  Physically,  we  could  think  of  the  terms  of  these  Fourier  series  as 
representing  a  set  of  harmonics.  In  music  these  would  be  an  infinite  set  of  frequen¬ 
cies  nf,  n  =  1,  2,  3,  •  •  • ;  notice  that  this  set,  although  infinite,  does  not  by  any 
means  include  all  possible  frequencies.  In  electricity,  a  Fourier  series  could  repre¬ 
sent  a  periodic  voltage;  again  we  could  think  of  this  as  made  up  of  an  infinite  but 
discrete  (that  is,  not  continuous)  set  of  a-c  voltages  of  frequencies  m o.  Similarly, 
in  discussing  light,  a  Fourier  series  could  represent  light  consisting  of  a  discrete  set 
of  wavelengths  X/n,  n  =  1,  2,  •••,  that  is,  a  discrete  set  of  colors.  Two  related 
questions  might  occur  to  us  here.  First,  is  it  possible  to  represent  a  function  which 
is  not  periodic  by  something  analogous  to  a  Fourier  series?  Second,  can  we  some¬ 
how  extend  or  modify  Fourier  series  to  cover  the  case  of  a  continuous  spectrum  of 
wavelengths  of  light,  or  a  sound  wave  containing  a  continuous  set  of  frequencies? 

If  you  recall  that  an  integral  is  a  limit  of  a  sum,  it  may  not  surprise  you  very 
much  to  learn  that  the  Fourier  series  (that  is,  a  sum  of  terms)  is  replaced  by  a 
Fourier  integral  in  the  above  cases.  The  Fourier  integral  can  be  used  to  represent 
nonperiodic  functions,  for  example  a  single  voltage  pulse  not  repeated,  or  a  flash 
of  light,  or  a  sound  which  is  not  repeated.  The  Fourier  integral  also  represents 
a  continuous  set  (spectrum)  of  frequencies,  for  example  a  whole  range  of  musical 
tones  or  colors  of  light  rather  than  a  discrete  set. 
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Recall  from  equations  (8.2)  and  (8.3),  these  complex  Fourier  series  formulas: 


f(x)  =  J2 ' 


x/l 


(12.1) 


s/_/w< 


^  dx. 


The  period  of  fix)  is  21  and  the  frequencies  of  the  terms  in  the  series  are  n/(2l). 
We  now  want  to  consider  the  case  of  continuous  frequencies. 


Definition  of  Fourier  Transforms  We  state  without  proof  (see  plausibility  ar¬ 
guments  below)  the  formulas  corresponding  to  (12.1)  for  a  continuous  range  of 
frequencies. 


/•OO 

fix )  =  /  gia)elax  da, 

(12.2) 

J — OO 

9 (a)  =  J  fix)e~iax  dx. 

Compare  (12.2)  and  (12.1);  g(a)  corresponds  to  cn,  a  corresponds  to  n,  and  f^° 
corresponds  to  ^1°^.  This  agrees  with  our  discussion  of  the  physical  meaning  and 
use  of  Fourier  integrals.  The  quantity  a  is  a  continuous  analog  of  the  integral- valued 
variable  n ,  and  so  the  set  of  coefficients  cn  has  become  a  function  g{a)\  the  sum 
over  n  has  become  an  integral  over  a.  The  two  functions  fix)  and  g{a)  are  called 
a  pair  of  Fourier  transforms.  Usually,  g(a)  is  called  the  Fourier  transform  of  /( x), 
and  f{x)  is  called  the  inverse  Fourier  transform  of  g(a),  but  since  the  two  integrals 
differ  in  form  only  in  the  sign  in  the  exponent,  it  is  rather  common  simply  to  call 
either  a  Fourier  transform  of  the  other.  You  should  check  the  notation  of  any  book 
or  computer  program  you  are  using.  Another  point  on  which  various  references 
differ  is  the  position  of  the  factor  l/(2n)  in  (12.2);  it  is  possible  to  have  it  multiply 
the  f{x)  integral  instead  of  the  g{a)  integral,  or  to  have  the  factor  l/y/2f r  multiply 
each  of  the  integrals. 

The  Fourier  integral  theorem  says  that,  if  a  function  /( x)  satisfies  the  Dirichlet 
conditions  (Section  6)  on  every  finite  interval,  and  if  f_oa  \fix)\dx  is  finite,  then 
(12.2)  is  correct.  That  is,  if  g(a)  is  computed  and  substituted  into  the  integral 
for  f{x)  [compare  the  procedure  of  computing  the  cn’s  for  a  Fourier  series  and 
substituting  them  into  the  series  for  fix)],  then  the  integral  gives  the  value  of  fix) 
anywhere  that  fix)  is  continuous;  at  jumps  of  fix),  the  integral  gives  the  midpoint 
of  the  jump  (again  compare  Fourier  series,  Section  6).  The  following  discussion 
is  not  a  mathematical  proof  of  this  theorem  but  is  intended  to  help  you  see  more 
clearly  how  Fourier  integrals  are  related  to  Fourier  series. 
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It  might  seem  reasonable  to  think  of  trying  to  represent  a  function  which  is 
not  periodic  by  letting  the  period  (—1,1)  increase  to  (—00,00).  Let  us  try  to  do 
this,  starting  with  (12.1).  If  we  call  mr/l  =  an  and  an+i  —  an  =  tt /l  =  A  a,  then 
1/(2Z)  =  Aa/(27r)  and  (12.1)  can  be  rewritten  as 

OO 

(12.3)  f(x)=Y,CneanX, 

—  OO 

(12.4)  c„  =  1  f  f(x)e~ianX  dx=^  j'  f(u)e~ia"u  du. 

(We  have  changed  the  dummy  integration  variable  in  cn  from  x  to  u  to  avoid  later 
confusion.)  Substituting  (12.4)  into  (12.3),  we  have 


(12.5) 


where 

(12.6) 


/(*)  =  J2 


Aa 
27 r 


f-i 


f(u)e~ 


'  du 


A  a 


E  17  /  .  /Me**-'-"’  du  =  -  Y,  n«»)  Aa, 


f-l 


27 r 


F(an ) 


f(u)eia^x~u)  du. 


Now  )C!°oo  I’KJAa  looks  rather  like  the  formula  in  calculus  for  the  sum  whose 
limit,  as  Acr  tends  to  zero,  is  an  integral.  If  we  let  l  tend  to  infinity  [that  is,  let  the 
period  of  f(x)  tend  to  infinity],  then  Aa  =  7 t/1  — >  0,  and  the  sum  F(an)Aa 

goes  over  formally  to  F(a)da ;  we  have  dropped  the  subscript  n  on  a  now  that 
it  is  a  continuous  variable.  We  also  let  l  tend  to  infinity  and  an  =  a  in  (12.6)  to  get 


(12.7) 


F(a) 


f(u)eia(x~u) 


du. 


Replacing  x  F(an)Aa  in  (12.5)  by  j°°oQF(a)  da  and  substituting  from  (12.7) 
for  F(a)  gives 


(12.8) 


1  f  OO  1  P  OO  POO 

f(x )  =  ^zj  F(a )  da=—J  J  f(u)eia ^  duda 

/OO  POO 

eiax  da  /  f(u)e~iau  du. 

-00  J  —  OO 


1 

27T 


If  we  define  g(a)  by 

(12.9)  g(a)  =  f(x)e~iax  dx  =  i-  f(u)e~iau  du, 


then  (12.8)  gives 


f(x)  =  /  g(u)elax  da. 


(12.10) 


— OO 
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These  equations  are  the  same  as  (12.2).  Notice  that  the  actual  requirement  for  the 
factor  1  /  (27t)  is  that  the  product  of  the  constants  multiplying  the  two  integrals  for 
g(a)  and  /(x)  should  be  l/(2n);  this  accounts  for  the  various  notations  we  have 
discussed  before. 

Just  as  we  have  sine  series  representing  odd  functions  and  cosine  series  rep¬ 
resenting  even  functions  (Section  9),  so  we  have  sine  and  cosine  Fourier  integrals 
which  represent  odd  or  even  functions  respectively.  Let  us  prove  that  if  /(x)  is 
odd,  then  g(a)  is  odd  too,  and  show  that  in  this  case  (12.2)  reduces  to  a  pair  of 
sine  transforms.  The  corresponding  proof  for  even  /(x)  is  similar  (Problem  1).  We 
substitute 

e~zax  _  cog  ax  _  ^  gjn  ax 

into  (12.9)  to  get 


1 

(12.11)  g(a)  =  —  /  /(x)(cos  ax  —  i  sin  ax)  dx. 

271"  J- oo 


Since  cosax  is  even  and  we  are  assuming  that  /(x)  is  odd,  the  product  /(x)  cosax 
is  odd.  Recall  that  the  integral  of  an  odd  function  over  a  symmetric  interval  about 
the  origin  (here,  — oo  to  +00)  is  zero,  so  the  term  f_  /(x)  cos  ax  dx  in  (12.11)  is 
zero.  The  product  /(x)  sin  ax  is  even  (product  of  two  odd  functions);  recall  that 
the  integral  of  an  even  function  over  a  symmetric  interval  is  twice  the  integral  over 
positive  x.  Substituting  these  results  into  (12.11),  we  have 

1  i  f°° 

(12.12)  g(a)  =  —  /  /(x)(— isinax)  dx  = - /  /(x)  sin  ax  dx. 

2tt  J_00  7t  J o 


From  (12.12),  we  can  see  that  replacing  a  by  —a  changes  the  sign  of  sin  ax  and  so 
changes  the  sign  of  g(a).  That  is,  g{— a)  =  — g(a ),  so  g(a)  is  an  odd  function  as 
we  claimed.  Then  expanding  the  exponential  in  (12.10)  and  arguing  as  we  did  to 
obtain  (12.12),  we  find 

/OO  n  OO 

g(a)elax  dx  =  2i  /  g(a)  sin  ax  da. 

-00  J  0 

If  we  substitute  g(a)  from  (12.12)  into  (12.13)  to  obtain  an  equation  like  (12.8),  the 
numerical  factor  is  (—i/n)(2i)  =  2/7 r;  thus  the  imaginary  factors  are  not  needed. 
The  factor  2  jit  may  multiply  either  of  the  two  integrals  or  each  integral  may  be 
multiplied  by  ^J2  jit.  Let  us  make  the  latter  choice  in  giving  the  following  definition. 


Fourier  Sine  Transforms  We  define  /s(x)  and  <7s(a),  a  pair  of  Fourier  sine 
transforms  representing  odd  functions,  by  the  equations 


(12.14) 


fs{x)  = 


9s{x)  = 


<7s(a)  sin  ax  da, 


fs  (x)  sin  ax  dx. 


We  discuss  even  functions  in  a  similar  way  (Problem  1). 


382  Fourier  Series  and  Transforms 


Chapter  7 


Fourier  Cosine  Transforms  We  define  fc(x)  and  gc(a),  a  pair  of  Fourier 
cosine  transforms  representing  even  functions ,  by  the  equations 


(12.15) 


2  f°° 

fc(x)  =  \  —  /  gc (x)  cos  ax  da, 

V  7T  Jo 

/2  f°° 

—  /  fc{x)  cos  ax  dx. 

*  Jo 


Example  1 .  Let  us  represent  a  nonperiodic  function  as  a  Fourier 
integral.  The  function 


/(a 0  = 


f(x) 

1 

1 

1 

1 

1 

1 

1 

1 

1 

-1  0 

1 

Figure  12.1 


shown  in  Figure  12.1  might  represent  an  impulse  in  mechanics  (that  is,  a  force 
applied  only  over  a  short  time  such  as  a  bat  hitting  a  baseball),  or  a  sudden  short 
surge  of  current  in  electricity,  or  a  short  pulse  of  sound  or  light  which  is  not  repeated. 
Since  the  given  function  is  not  periodic,  it  cannot  be  expanded  in  a  Fourier  series, 
since  a  Fourier  series  always  represents  a  periodic  function.  Instead,  we  write  f(x) 
as  a  Fourier  integral  as  follows.  Using  (12.9),  we  calculate  g(a)\  this  process  is  like 
finding  the  c„’s  for  a  Fourier  series.  We  find 

/•OO  1  />1 


(12.16) 


9{a}  =  h  S-J(x)e~^ ix  =  h  l 

1  e —iax  1  |  e~ia  —  “ia 


e~iax  dx 
sin  a 


27 r  —  ia  _1  7ra  —2 i 


We  substitute  g(a)  from  (12.16)  into  the  formula  (12.10)  for  f(x)  (this  is  like  sub¬ 
stituting  the  evaluated  coefficients  into  a  Fourier  series).  We  get 

(  00  sin  rv 

(12.17)  f(x)=  /  - -elaxdx 


1  sin  a  (cos  ax  +  i  sin  ax  ,  2 

—  /  - da  =  — 


sin  a  cos  ax 


da 


since  (sin  a) /a  is  an  even  function.  We  thus  have  an  integral  representing  the 
function  f{x)  shown  in  Figure  12.1. 


Example  2.  We  can  use  (12.17)  to  evaluate  a  definite  integral.  Using  f{x)  in  Figure  12.1, 
we  find 


(12.18) 


sin  a  cos  a*  7r  I  f  for  d  <1,  \  for  \x\  =  1, 

- da=  —  fix)  =  <  ,, 

a  2  ]  0  for  \x\  >  1. 


Notice  that  we  have  used  the  fact  that  the  Fourier  integral  represents  the  midpoint 
of  the  jump  in  f(x)  at  \x\  =  1.  If  we  let  x  =  0,  we  get 


(12.19) 


sin  a  7T 

- da  = 

a  2 
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We  could  have  done  this  problem  by  observing  that  f{x)  is  an  even  function  and  so 
can  be  represented  by  a  cosine  transform.  The  final  results  (12.17)  to  (12.19)  would 
be  just  the  same  (Problem  2). 

In  Section  9,  we  sometimes  started  with  a  function  defined  only  for  x  >  0  and 
extended  it  to  be  even  or  odd  so  that  we  could  represent  it  by  a  cosine  series  or  by 
a  sine  series.  Similarly,  for  Fourier  transforms,  we  can  represent  a  function  defined 
for  x  >  0  by  either  a  Fourier  cosine  integral  (by  defining  it  for  x  <  0  so  that  it  is 
even) ,  or  by  a  Fourier  sine  integral  (by  defining  it  for  x  <  0  so  that  it  is  odd) .  (See 
Problem  2  and  Problems  27  to  30.) 


Parseval’s  Theorem  for  Fourier  Integrals  Recall  (Section  11)  that  Parse- 
val’s  theorem  for  a  Fourier  series  f{x)  =  Y^oaCnem^X^  relates  J^Af^dx  and 
!c™|2-  In  physical  applications  (see  Section  11),  Parseval’s  theorem  says  that 
the  total  energy  (say  in  a  sound  wave,  or  in  an  electrical  signal)  is  equal  to  the  sum 
of  the  energies  associated  with  the  various  harmonics.  Remember  that  a  Fourier 
integral  represents  a  continuous  spectrum  of  frequencies  and  that  g{a)  corresponds 
to  cn.  Then  we  might  expect  that  J2°°oo  \Cn\ 2  w°nld  be  replaced  by  \g(a)\2da 
(that  is,  a  “sum”  over  a  continuous  rather  than  a  discrete  spectrum)  and  that  Par¬ 
se  val’s  theorem  would  relate  \f\2dx  and  \g\2da.  Let  us  try  to  find  the 
relation. 

We  will  first  find  a  generalized  form  of  Parseval’s  theorem  involving  two  functions 
/i(x),  f2(x)  and  their  Fourier  transforms  gi(a),  32(0:).  Let  gi(a)  be  the  complex 
conjugate  of  gi(a);  from  (12.1),  we  have 

(12.20)  51(a)  =  ^  J°°  h(x)eiaxdx. 


We  now  multiply  (12.20)  by  (72(a)  and  integrate  with  respect  to  a: 


/OO  -t  pOO  pOO 

gi{a)g2{a)da  =  —  /  /  /i(x) 

-00  ^  J  —  OO  IJ-OO 


]  dx 


g2{a)da. 


Let  us  rearrange  (12.21)  so  that  we  integrate  first  with  respect  to  a.  [This  is 
justified  assuming  that  the  absolute  values  of  the  functions  fi  and  f2  are  integrable 
on  (—00,  00).] 


(12.22)  —  /  h(x)dx 


/OO 

52(a) 

-OO 


elax  da 


=  2 ~  J  fi(x)f2(x)dx 


by  (12.2).  Thus 


(12.23) 


1  r°°  _ 

5i(«)52(a)  da  =  —  /i(x)/2(x)  dx. 

)  J  —  OO 


(Compare  this  with  the  corresponding  Fourier  series  theorem  in  Problem  11.10.)  If 
we  set  /1  =  fi  =  f  and  51  =  52  =  5,  we  get  Parseval’s  theorem: 


\g(a)\2da 


1 

27T 


|/(x)|2  dx. 


(12.24) 


—  OO 


—  OO 
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PROBLEMS,  SECTION  12 

1.  Following  a  method  similar  to  that  used  in  obtaining  equations  (12.11)  to  (12.14), 
show  that  if  /(*)  is  even,  then  g(a)  is  even  too.  Show  that  in  this  case  /(*)  and 
g(a)  can  be  written  as  Fourier  cosine  transforms  and  obtain  (12.15). 

2.  Do  Example  1  above  by  using  a  cosine  transform  (12.15).  Obtain  (12.17);  for  x  >  0, 
the  0  to  oo  integral  represents  the  function 


/(*)  = 


0  <  x  <  1, 
x  >  1. 


Represent  this  function  also  by  a  Fourier  sine  integral  (see  the  paragraph  just  before 
Parseval’s  theorem). 


In  Problems  3  to  12,  find  the  exponential  Fourier  transform  of  the  given  /( x)  and  write 
/(*)  as  a  Fourier  integral  [that  is,  find  g(a)  in  equation  (12.2)  and  substitute  your  result 
into  the  first  integral  in  equation  (12.2)]. 


{  —  1,  —  n  <  x  <  0 

1,  0  <  x  <  n 

0,  |a:|  >  7T 


4-  /(*) 


1,  71-/2  <  |*|  <  7T 

0,  otherwise 


5.  /(*) 


1,  0  <  *  <  1 

0,  otherwise 


6.  /(*) 


*,  |*|  <  1 

0,  |*|  >  1 


7-  /(*) 


|*|,  |*|  <  1 

0,  |*|  >  1 


8.  /(*) 


*,  0  <  *  <  1 

0,  otherwise 


H  fM  =  /C0S*5  — 7r/2  <  *  <  7t/2  fsin*,  |*|<tt/2 

'  ^  {  0,  |*|  >  tt/2  •  /(  )  |  Qi  ^1  >  n/2 

Hint:  In  Problems  11  and  12,  use  complex  exponentials. 

In  Problems  13  to  16,  find  the  Fourier  cosine  transform  of  the  function  in  the  indicated 
problem,  and  write  /(*)  as  a  Fourier  integral  [use  equation  (12.15)].  Verify  that  the  cosine 
integral  for  /(*)  is  the  same  as  the  exponential  integral  found  previously. 


13.  Problem  4. 


14.  Problem  7. 


15. 


Problem  9. 


16. 


Problem  11. 
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In  Problems  17  to  20,  find  the  Fourier  sine  transform  of  the  function  in  the  indicated 
problem,  and  write  f(x)  as  a  Fourier  integral  [use  equation  (12.14)].  Verify  that  the  sine 
integral  for  f(x)  is  the  same  as  the  exponential  integral  found  previously. 


17.  Problem  3. 

19.  Problem  10. 


18.  Problem  6. 

20.  Problem  12. 


21.  Find  the  Fourier  transform  of  f{x)  =  e~x  ^2a  \  Hint:  Complete  the  square  in  the 
x  terms  in  the  exponent  and  make  the  change  of  variable  y  =  x  +  a2ia.  Use  tables 
or  computer  to  evaluate  the  definite  integral. 

22.  The  function  ji(a)  =  (acosa  —  sina)/a  is  of  interest  in  quantum  mechanics.  [It 
is  called  a  spherical  Bessel  function;  see  Chapter  12,  equation  (17.4).]  Using  Prob¬ 
lem  18,  show  that 


/” 

Jo 


j i  (a)  sin  ax  da  = 
23.  Using  Problem  17,  show  that 


■xx/2,  —  1  <  x  <  1, 

0,  |a;|  >  1. 


/" 


f 


1  —  COS  7T  a  7T 

- sm  ada  =  —, 

a  2 

1  —  COS  7TQ  .  ,  7T 

- sm  na  da  =  — . 

a  4 


24.  (a)  Find  the  exponential  Fourier  transform  of  f(x)  =  e  ^  and  write  the  inverse 

transform.  You  should  find 


f 


a2  +  1  da  —  2  6~ 


(b)  Obtain  the  result  in  (a)  by  using  the  Fourier  cosine  transform  equations  (12.15). 

(c)  Find  the  Fourier  cosine  transform  of  f{x)  =  1/(1  +  x2).  Hint:  Write  your 
result  in  (b)  with  x  and  a  interchanged. 

25.  (a)  Represent  as  an  exponential  Fourier  transform  the  function 


/(*)  = 


sin  a;,  0  <  x  <  n, 

0,  otherwise. 


Hint:  Write  sin  a:  in  complex  exponential  form. 

(b)  Show  that  your  result  can  be  written  as 

cos  ax  +  cos  a(x  —  n) 


/(X)  =  l  / 

n  Jo 

26.  Using  Problem  15,  show  that 


1  —  a2 


da. 


f 


1  —  COS  a  ,  7T 

- - - da  =  — . 


Represent  each  of  the  following  functions  (a)  by  a  Fourier  cosine  integral;  (b)  by  a  Fourier 
sine  integral.  Hint:  See  the  discussion  just  before  Parseval’s  theorem. 


27.  f(x)  = 


1,  0  <  x  <  7t/2 

0,  x  >  7t/2 


28.  f(x)  = 


1,  2  <  x  <  4 

0,  0<x<2,  x>4 
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r — i,  o  <  a;  <  2 

29.  f{x)  =  i  1,  2<x<3 

y  o,  %  >  3 


30.  f(x) 


1  -  x/2, 
0, 


0  <  x  <  2 
x  >  2 


Verify  Parseval’s  theorem  (12.24)  for  the  special  cases  in  Problems  31  to  33. 

31.  f(x)  as  in  Figure  12.1.  Hint:  Integrate  by  parts  and  use  (12.18)  to  evaluate 

f  lfl(a)|2  da. 

J  —  OO 


32.  f(x)  and  g(a)  as  in  Problem  21. 

33.  f(x)  and  g(a)  as  in  Problem  24a. 

34.  Show  that  if  (12.2)  is  written  with  the  factor  l/%/27r  multiplying  each  integral,  then 
the  corresponding  form  of  Parseval’s  theorem  (12.24)  is 


\g(a)\2da. 


35.  Starting  with  the  symmetrized  integrals  as  in  Problem  34,  make  the  substitutions 
a  =  2np/h  (where  p  is  the  new  variable,  h  is  a  constant),  f(x)  =  i/j(x),  g(a)  = 
^h/2n  4>(p);  show  that  then 

1  r°° 

ip(x)  =  —7f  <j>(p)e2nwx/h  dp, 

v  h  J — oo 

i  r°° 

<fr(p)  =  J-  i/j(x)e~2nipx^h  dx, 

v  h  J  — oo 

/oo  roo 

\ip{x)\2dx  =  /  \(j>{p)\2  dp. 

-oo  J  —  OO 


This  notation  is  often  used  in  quantum  mechanics. 

36.  Normalize  f{x)  in  Problem  21;  that  is  find  the  factor  N  so  that  \N f(x)\2  =  1. 

Let  ip(x)  =  N f(x),  and  find  cj>{p )  as  given  in  Problem  35.  Verify  Parseval’s  theorem, 
that  is,  show  that  f^,  \4>{p)\2  dp  =  1. 


13.  MISCELLANEOUS  PROBLEMS 


1.  The  displacement  (from  equilibrium)  of  a  particle  executing  simple  harmonic  motion 
may  be  either  y  =  A  sin  ut  or  y  =  A  sin(wt  +  <j>)  depending  on  our  choice  of  time 
origin.  Show  that  the  average  of  the  kinetic  energy  of  a  particle  of  mass  m  (over 
a  period  of  the  motion)  is  the  same  for  the  two  formulas  (as  it  must  be  since  both 
describe  the  same  physical  motion).  Find  the  average  value  of  the  kinetic  energy  for 
the  sin(u;t  +  4> )  case  in  two  ways: 


(a)  By  selecting  the  integration  limits  (as  you  may  by  Problem  4.1)  so  that  a 
change  of  variable  reduces  the  integral  to  the  sino>t  case. 

(b)  By  expanding  sin(cuf  +  tj>)  by  the  trigonometric  addition  formulas  and  using 
(5.2)  to  write  the  average  values. 


2. 


The  symbol  [*]  means  the  greatest  integer  less  than  or  equal  to  x  (for  example, 
[3]  =  3,  [2.1]  =  2,  [—4.5]  =  —5).  Expand  x  —  [a;]  —  ^  in  an  exponential  Fourier  series 
of  period  1.  Hint:  Sketch  the  function. 


Answer:  — 
2n 


+  ■ 


■  +  ■ 
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3.  We  have  said  that  Fourier  series  can  represent  discontinuous  functions  although 
power  series  cannot.  It  might  occur  to  you  to  wonder  why  we  could  not  substitute 
the  power  series  for  sin  no;  and  cos  nx  (which  converge  for  all  x )  into  a  Fourier 
series  and  collect  terms  to  obtain  a  power  series  for  a  discontinuous  function.  As  an 
example  of  what  happens  if  we  try  this,  consider  the  series  in  Problem  9.5.  Show  that 
the  coefficients  of  x,  if  collected,  form  a  divergent  series;  similarly,  the  coefficients 
of  x 3  form  a  divergent  series,  and  so  on. 


4.  The  diagram  shows  a  “relaxation”  oscillator.  The  charge  q  on  the  capacitor  builds  up 
until  the  neon  tube  fires  and  discharges  the  capacitor  (we  assume  instantaneously). 
Then  the  cycle  repeats  itself  over  and  over. 


(a)  The  charge  q  on  the  capacitor  satisfies 
the  differential  equation 

«§  +  !!- V, 

at  c 

where  R  is  the  resistance,  C  is  the  ca¬ 
pacitance,  and  V  is  the  constant  d-c 


- vWVW - 

=^v  0= 

_  Neon 

“  9  tube  W 

voltage,  as  shown  in  the  diagram.  Show  that  if  q  =  0  when  t  =  0,  then  at  any 
later  time  t  (during  one  cycle,  that  is,  before  the  neon  tube  fires) 


q  =  CV{  l-e~t/RG). 


(b)  Suppose  the  neon  tube  fires  at  t  =  \RC.  Sketch  q  as  a  function  of  t  for  several 
cycles. 

(c)  Expand  the  periodic  q  in  part  (b)  in  an  appropriate  Fourier  series. 


5.  Consider  one  arch  of  f(x)  =  sin®.  Show  that  the  average  value  of  /( x)  over  the 
middle  third  of  the  arch  is  twice  the  average  value  over  the  end  thirds. 

6.  Let  f(t)  =  elult  on  (—n,n).  Expand  /(f)  in  a  complex  exponential  Fourier  series  of 
period  27t.  (Assume  u  ^  integer.) 

7.  Given  f(x)  =  |*|  on  (— n,n),  expand  f(x)  in  an  appropriate  Fourier  series  of  pe¬ 
riod  2-7T. 


8.  From  facts  you  know,  find  in  your  head  the  average  value  of 

(a)  x3  —  3  sinh  2x  +  sin2 8  nx  +  cos  37ro;  on  (—5,  5). 

(b)  2  sin2  3a:  —  4  cos  a:  +  5a:  cosh  2x  —  x  cos2  x  on  (— n,  n). 


9.  Given  f(x) 


x,  0  <  x  <  1, 
-2,  1  <  a:  <  2. 


(a)  Sketch  at  least  three  periods  of  the  graph  of  the  function  represented  by  the 
sine  series  for  f(x).  Without  finding  any  series,  answer  the  following  questions: 

(b)  To  what  value  does  the  sine  series  in  (a)  converge  at  x  =  1?  At  x  =  2?  At 
x  =  0?  At  x  =  —  1? 

(c)  If  the  given  function  is  continued  with  period  2  and  then  is  represented  by  a 

complex  exponential  series  X^^L^oo  ,  what  is  the  value  of  Yl°°=-oo  I 1 2 ? 
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10.  (a)  Sketch  at  least  three  periods  of  the  graph  of  the  function  represented  by  the 

cosine  series  for  f(x)  in  Problem  9. 

(b)  Sketch  at  least  three  periods  of  the  graph  of  the  exponential  Fourier  series  of 
period  2  for  /( x)  in  Problem  9. 

(c)  To  what  value  does  the  cosine  series  in  (a)  converge  at  x  =  0?  At  x  =  1?  At 
x  =  2?  At  x  =  -2 ? 

(d)  To  what  value  does  the  exponential  series  in  (b)  converge  at  x  =  0?  At  x  =  1? 
At  x  —  |?  At  x  =  —2. 

11.  Find  the  three  Fourier  series  in  Problems  9  and  10. 

12.  What  would  be  the  apparent  frequency  of  a  sound  wave  represented  by 

.  cos  60mvt 

P(*)  =  l,1oo(ra_3)a  +  i? 

n=  1 

13.  (a)  Given  f(x)  =  (n  —  x)/2  on  (0,  n),  find  the  sine  series  of  period  2n  for  f(x). 

(b)  Use  your  result  in  (a)  to  evaluate  J/)  1  /n2 . 

14.  (a)  Find  the  Fourier  series  of  period  2  for  f(x)  =  (x  —  l)2  on  (0,  2). 

(b)  Use  your  result  in  (a)  to  evaluate  J/)  1/n4- 

15.  Given 

1,  -2  <  x  <  0, 


/(*)  = 


-1, 


0  <  x  <  2, 


find  the  exponential  Fourier  transform  g(a)  and  the  sine  transform  g3(a).  Write 
f(x)  as  an  integral  and  use  your  result  to  evaluate 


f 

Jo 


(cos  2a  —  1)  sin  2a 


da. 


16.  Given 


x,  0  <  x  <  1, 
f(x)={  2-x,  1  <  x  <  2, 

0,  x  >  2, 

find  the  cosine  transform  of  f(x)  and  use  it  to  write  /(*)  as  an  integral.  Use  your 
result  to  evaluate 

cos2  a  sin2  a/2 


/" 

Jo 


■  da. 


17.  Show  that  the  Fourier  sine  transform  of  x~x^2  is  a-1//2.  Hint:  Make  the  change 
of  variable  z  =  ax.  The  integral  J0°°  z smzdz  can  be  found  by  computer  or  in 
tables. 

18.  Let  f(x)  and  g(a)  be  a  pair  of  Fourier  transforms.  Show  that  df  /dx  and  iag(a)  are 
a  pair  of  Fourier  transforms.  Hint:  Differentiate  the  first  integral  in  (12.2)  under 
the  integral  sign  with  respect  to  x.  Use  (12.23)  to  show  that 

/°°  1  r°°  -  <7 

^a\g(a)\2da=—  J_J(x)-f(x)dx. 

Comment:  This  result  is  of  interest  in  quantum  mechanics  where  it  would  read,  in 
the  notation  of  Problem  12.35: 


J  p\Hp)\2  dP  =  I  tp*(x)  ^{x)dx- 
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19.  Find  the  form  of  Parseval’s  theorem  (12.24)  for  sine  transforms  (12.14)  and  for  cosine 
transforms  (12.15). 

20.  Find  the  exponential  Fourier  transform  of 


/(*) 


2 a  —  \x\,  |*|  <  2a, 

0,  |*|  >  2a, 


and  use  your  result  with  Parseval’s  theorem  to  evaluate 


Jo 


■  da. 


21.  Define  a  function  h(x)  =  YlkL- oo  f(x  +  2/cvr) ,  assuming  that  the  series  converges 
to  a  function  satisfying  Dirichlet  conditions  (Section  6).  Verify  that  h(x)  does  have 
period  27r. 

(a)  Expand  h(x)  in  an  exponential  Fourier  series  h(x)  =  X)Xoo  cne™x',  show  that 
Cn  =  g(n)  where  g(a)  is  the  Fourier  transform  of  /(*).  Hint:  Write  cn  as  an 
integral  from  0  to  2-7r  and  make  the  change  of  variable  u  =  x  +  2kn.  Note  that 
e~2inkn  _  ^  and  ^e  gum  on  gjves  a  single  integral  from  — oo  to  oo. 

(b)  Let  *  =  0  in  (a)  to  get  Poisson’s  summation  formula  X^oo  /(2fc n)  =  X)^°  g(n). 
This  result  has  many  applications;  for  example:  statistical  mechanics,  commu¬ 
nication  theory,  theory  of  optical  instruments,  scattering  of  light  in  a  liquid, 
and  so  on.  (See  Problem  22.) 

22.  Use  Poisson’s  formula  (Problem  21b)  and  Problem  20  to  show  that 


—  OO 


0  <  6  <  7T. 


(This  sum  is  needed  in  the  theory  of  scattering  of  light  in  a  liquid.)  Hint:  Consider 
/(*)  and  g(a)  as  in  Problem  20.  Note  that  f(2kn)  =  0  except  for  k  =  0  if  a  <  n. 
Put  a  =  n,  a  =  6 . 

23.  Use  Parseval’s  theorem  and  Problem  12.11  to  evaluate 


cos2(a7r/2) 
(1  -a2)2 


da. 


Ordinary  Differential  Equations 


1.  INTRODUCTION 

A  great  many  applied  problems  involve  rates,  that  is,  derivatives.  An  equation 
containing  derivatives  is  called  a  differential  equation.  If  it  contains  partial  deriva¬ 
tives,  it  is  called  a  partial  differential  equation ;  otherwise  it  is  called  an  ordinary 
differential  equation.  In  this  chapter  we  shall  consider  methods  of  solving  ordinary 
differential  equations  which  occur  frequently  in  applications.  Let  us  look  at  a  few 
examples. 

Newton’s  second  law  in  vector  form  is  F  =  ma.  If  we  write  the  acceleration  as 
dv/dt ,  where  v  is  the  velocity,  or  as  d2r/dt2,  where  r  is  the  displacement,  we  have 
a  differential  equation  (or  a  set  of  differential  equations,  one  for  each  component). 
Thus  any  mechanics  problem  in  which  we  want  to  describe  the  motion  of  a  body 
(automobile,  electron,  or  satellite)  under  the  action  of  a  given  force,  involves  the 
solution  of  a  differential  equation  or  a  set  of  differential  equations. 

The  rate  at  which  heat  Q  escapes  through  a  window  or  from  a  hot  water  pipe  is 
proportional  to  the  area  A  and  to  the  rate  of  change  of  temperature  with  distance 
in  the  direction  of  flow  of  heat.  Thus  we  have 


(1.1) 


dt  dx 


(, k  is  called  the  thermal  conductivity  and  depends  on  the  material  through  which  the 
heat  is  flowing).  Here  we  have  two  different  derivatives  in  the  differential  equation. 
In  such  a  problem  we  might  know  either  dT/dx  or  dQ/dt  and  solve  the  differential 
equation  to  find  either  T  as  a  function  of  x,  or  Q  as  a 
function  of  t.  (See  Problems  2.23  to  2.25.) 

Consider  a  simple  series  circuit  (Figure  1.1)  contain¬ 
ing  a  resistance  R,  a  capacitance  C .  an  inductance  L, 
and  a  source  of  emf  V.  If  the  current  flowing  around  the 
circuit  at  time  t  is  I{t)  and  the  charge  on  the  capacitor 
is  q(t),  then  /  =  dq/dt.  The  voltage  across  R  is  RI , 
the  voltage  across  C  is  q/C,  and  the  voltage  across  L  is  Figure  1.1 
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L{dl/dt).  Then  at  any  time  we  must  have 

(L2)  Lil  +  RI+l  =  V. 

If  we  differentiate  this  equation  with  respect  to  t  and  substitute  dq/dt  =  /,  we  have 

d2I  dl  I  dV 
^  L~dt2+Rdt  +  C  =  ~dt 

as  the  differential  equation  satisfied  by  the  current  /  in  a  simple  series  circuit  with 
given  L,  R ,  and  C ,  and  a  given  V(t). 

There  are  many  more  examples  of  physical  problems  leading  to  differential  equa¬ 
tions;  we  shall  consider  some  of  them  later  in  the  text  and  problems.  You  might  find 
it  interesting  at  this  point  to  browse  through  the  problems  to  see  the  wide  range  of 
topics  giving  rise  to  differential  equations. 

The  order  of  a  differential  equation  is  the  order  of  the  highest  derivative  in  the 
equation.  Thus  the  equations 

y'  +  xy 2  =  1, 
xy'  +  y  =  ex, 

(1-4)  ± 

dt  9' 

L-j-  +  RI  =  V, 

dt 

are  first-order  equations,  while  (1.3)  and 

d2r 

m-rpr  =  -hr 

dtz 

are  second-order  equations.  A  linear  differential  equation  (with  x  as  independent 
and  y  as  dependent  variable)  is  one  of  the  form 

aoy  +  ary1  +  a2y”  +  o3 y"’  -I - =  6, 

where  the  a’s  and  b  are  either  constants  or  functions  of  x.  The  first  equation  in 
(1.4)  is  not  linear  because  of  the  y 2  term;  all  the  other  equations  we  have  mentioned 
so  far  are  linear.  Some  other  examples  of  nonlinear  equations  are: 


y1  =  cot  y 
yy'  =  1 
/2 

V  =xy 


(not  linear  because  of  the  term  coty); 
(not  linear  because  of  the  product  yy') ; 
(not  linear  because  of  the  term  y'2). 


Many  of  the  differential  equations  which  occur  in  applied  problems  are  linear  and 
of  the  first  or  second  order;  we  shall  be  particularly  interested  in  these. 


A  solution  of  a  differential  equation  (in  the  variables  x  and  y)  is  a  relation  between 
x  and  y  which,  if  substituted  into  the  differential  equation,  gives  an  identity. 
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Example  1.  The  relation 

(1.5)  y  =  smx  +  C 
is  a  solution  of  the  differential  equation 

(1.6)  y' =  cosx 

because  if  we  substitute  (1.5)  into  (1.6)  we  get  the  identity  cos  a;  =  cos  a:. 

Example  2.  The  equation  y"  =  y  has  solutions  y  =  ex  or  y  =  e~x  or  y  =  Aex  +  Be~x  as 
you  can  verify  by  substitution. 

If  we  integrate  y '  =  f(x),  the  expression  for  y,  namely  y  =  f  f(x)dx  +  C, 
contains  one  arbitrary  constant  of  integration.  If  we  integrate  y"  =  g{ x)  twice  to 
get  y(x),  then  y  contains  two  independent  integration  constants.  We  might  expect 
that  in  general  the  solution  of  a  differential  equation  of  the  nth  order  would  contain 
n  independent  arbitrary  constants.  Note  that  in  Example  1  above,  the  solution 
of  the  first-order  equation  y'  =  coscc  contained  one  arbitrary  constant  C,  and  in 
Example  2  the  solution  y  =  Aex  +  Be~x  of  the  second-order  equation  y"  =  y 
contained  two  arbitrary  constants  A  and  B. 


Any  linear  differential  equation  of  order  n  has  a  solution  containing  n  independent 
arbitrary  constants,  from  which  all  solutions  of  the  differential  equation  can  be 
obtained  by  letting  the  constants  have  particular  values.  This  solution  is  called 
the  general  solution  of  the  linear  differential  equation. 


(This  may  not  be  true  for  nonlinear  equations;  see  Section  2.) 

In  applications,  we  usually  want  a  particular  solution,  that  is,  one  which  satis¬ 
fies  the  differential  equation  and  some  other  requirements  as  well.  Here  are  some 
examples  of  this. 


Example  3.  Find  the  distance  which  an  object  falls  under  gravity  in  t  seconds  if  it  starts 
from  rest. 

Let  x  be  the  distance  the  object  has  fallen  in  time  t.  The  acceleration  of  the 
object  is  g,  the  acceleration  of  gravity.  Then  we  have 


(1.7) 


d2x 

~dP=9' 


Integrating,  we  get 

dx 

(1.8)  A—  =  gt  +  const.  =  gt  +  vq, 


(1.9)  x=\gt2+v0t  +  x  o, 

where  vq  and  Xo  are  the  values  of  v  and  x  at  t  =  0.  Now  (1.9)  is  the  general  solution 
of  (1.7)  (because  it  is  a  solution  of  a  second-order  linear  differential  equation  and 
contains  two  independent  arbitrary  constants) .  We  want  the  particular  solution  for 
which  vq  =  0  (since  the  object  starts  from  rest),  and  xo  =  0  (since  the  distance  the 
object  has  fallen  is  zero  at  t  =  0).  Then  the  desired  particular  solution  is 

(1.10)  x  =  \gt2. 
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Example  4.  Find  the  solution  of  y"  =  y  which  passes  through  the  origin  and  through  the 
point  (In  2,  |). 

The  general  solution  of  the  differential  equation  is 


y  =  Aex  +  Be~x 


(see  Example  2).  If  the  given  points  satisfy  the  equation  of  the  curve,  we  must  have 


0  =  A  +  B  or  A  =  —B, 

|  =  Aeln2  +  Be~ln2  =  A  ■  2  +  B  ■  \  =  2A  —  =  \A. 


Thus  we  get 


A  =  —B  = 


and  the  desired  particular  solution  is 


y  =  \(ex  —  e  x)  =  sinli x. 


The  given  conditions  which  are  to  be  satisfied  by  the  particular  solution  are 
called  boundary  conditions,  or  when  they  are  conditions  at  t  =  0  they  may  be  called 
initial  conditions.  For  linear  equations,  the  desired  particular  solution  can  be  found 
from  the  general  solution  by  determining  the  values  of  the  constants  as  we  did  in 
Example  4.  (For  nonlinear  equations,  see  Section  2.) 

As  you  study  methods  of  solving  various  types  of  differential  equations  in  the 
following  sections,  you  may  wonder  whether  you  can  use  computer  solutions  and 
not  bother  to  learn  these  techniques.  Just  as  for  indefinite  integrals  (see  Chapter  5, 
Section  1),  there  may  be  various  forms  for  the  solution  of  a  differential  equation, 
and  your  computer  may  not  give  the  one  you  need.  In  order  to  make  intelligent  use 
of  computer  solutions,  you  need  to  know  something  about  what  to  expect,  and  an 
effective  way  of  gaining  this  knowledge  is  to  solve  some  equations  by  hand.  (See 
Example  1,  Section  3.)  By  comparing  your  solutions  with  computer  solutions,  you 
will  learn  what  you  can  (and  cannot)  expect  from  your  computer. 

The  graphing  capabilities  of  your  computer  are  very  useful  in  differential  equa¬ 
tions.  Consider  a  first-order  equation,  say  y'  =  f(x,y).  If  the  solution  of  this 
differential  equation  is  y  =  y(x),  the  differential  equation  gives  the  slope  y'  of  the 
solution  curve  at  each  point  (x,y).  Suppose,  for  a  large  number  of  points,  we  draw 
a  short  line  (or  vector)  centered  on  each  point  and  with  the  slope  y'  at  that  point. 
(This  would  be  a  big  job  by  hand,  but  your  computer  does  it  easily.)  This  plot  is 
called  a  slope  field ,  or  a  direction  field ,  or  a  vector  field.  From  such  a  diagram  we 
can  see  the  general  trend  of  the  solution  curves  even  without  solving  the  equation. 
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Example  5.  In  Figure  1.2,  we  have  plotted  a  “slope  field”  for  the  differential  equation 
y'  =  cos  x.  Note  how  you  can  trace  the  general  shape  of  the  solution  curves,  even 
without  knowing  from  Example  1  that  their  equations  are  y  =  sin  x  +  C. 

PROBLEMS,  SECTION  1 

1.  Verify  the  statement  of  Example  2.  Also  verify  that  y  =  cosh®  and  y  =  sinh®  are 
solutions  of  y"  =  y. 

2.  Solve  Example  4  using  the  general  solution  y  =  a  sinh®  +  &cosh®. 

3.  Verify  that  y  =  sin®,  y  =  cos®,  y  =  exx ,  and  y  =  e~xx  are  all  solutions  of  y"  =  —y. 

4.  Find  the  distance  which  an  object  moves  in  time  t  if  it  starts  from  rest  and  has 
an  acceleration  d2x/dt2  =  ge~kt.  Show  that  for  small  t  the  result  is  approximately 
(1.10),  and  for  very  large  t,  the  speed  dx/dt  is  approximately  constant.  The  constant 
is  called  the  terminal  speed.  (This  problem  corresponds  roughly  to  the  motion  of  a 
parachutist.) 

5.  Find  the  position  x  of  a  particle  at  time  t  if  its  acceleration  is  d2x/dt2  =  A  sin  tut. 

6.  A  substance  evaporates  at  a  rate  proportional  to  the  exposed  surface.  If  a  spherical 
mothball  of  radius  I  cm  has  radius  0.4  cm  after  6  months,  how  long  will  it  take: 

(a)  For  the  radius  to  be  i  cm? 

(b)  For  the  volume  of  the  mothball  to  be  half  of  what  it  was  originally? 

7.  The  momentum  p  of  an  electron  at  speed  v  near  the  speed  c  of  light  increases 
according  to  the  formula  p  =  mv/y/ 1  —  v2 /c2  ,  where  m  is  a  constant  (mass  of 
the  electron).  If  an  electron  is  subject  to  a  constant  force  F,  Newton’s  second  law 
describing  its  motion  is 

dp  d  mv  p 

dt  dt  yj\  —  u2/c2 

Find  v(t)  and  show  that  v  — >  c  as  t  — >  oo.  Find  the  distance  traveled  by  the  electron 
in  time  t  if  it  starts  from  rest. 
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2.  SEPARABLE  EQUATIONS 

Every  time  you  evaluate  an  integral 

(2.1)  y=Jf(x)dx, 
you  are  solving  a  differential  equation,  namely 

(2.2)  y'  =  £  =  /(*)• 

This  is  a  simple  example  of  an  equation  which  can  be  written  with  only  y  terms  on 
one  side  of  the  equation  and  only  x  terms  on  the  other: 

(2.3)  dy  =  f(x)  dx. 

Whenever  we  can  separate  the  variables  this  way,  we  call  the  equation  separable, 
and  we  get  the  solution  by  just  integrating  each  side  of  the  equation. 


Example  1.  The  rate  at  which  a  radioactive  substance  decays  is  proportional  to  the 
remaining  number  of  atoms.  If  there  are  No  atoms  at  t  =  0,  find  the  number  at 
time  t. 

The  differential  equation  for  this  problem  is 


(2.4) 


dN 

dt 


=  -A  N. 


(The  proportionality  constant  A  is  called  the  decay  constant.)  This  is  a  separable 
equation;  we  write  it  as  dN/N  =  —A  dt.  Then  integrating  both  sides,  we  get 
In  IV  =  —\t  +  const.  Since  we  are  given  N  =  No  at  t  =  0,  we  see  that  the  constant 
is  In  Nq.  Solving  for  N,  we  have 


(2.5) 


N  =  N0e~xt. 


(For  further  discussion  of  radioactive  decay  problems,  see  Section  3,  Example  2, 
and  Problems  2.19b  and  3.19  to  3.21.) 


Example  2.  Solve  the  differential  equation 

(2.6)  xy'  =  y+  1. 

To  separate  variables,  we  divide  both  sides  of  (2.6)  by  x(y  +  1);  this  gives 


dy 

y  + 1 


dx 

x 


(2.7)  -2—  =  -  or 

y  +  1  x 

Integrating  each  side  of  (2.7),  we  have 

(2.8)  ln(y  +  1)  =  In  x  +  const.  =  In  x  +  In  a  =  ln(ax) . 

(We  have  called  the  constant  of  integration  In  a  for  simplicity.)  Then  (2.8)  gives  the 
solution  of  (2.6),  namely 


(2.9) 


y  +  1  =  ax. 


This  general  solution  represents  a  family  of  curves  in  the  (x,  y)  plane,  one  curve 
for  each  value  of  the  constant  a.  Or  we  may  call  the  general  solution  (2.9)  a  family 
of  solutions  of  the  differential  equation  (2.6).  Finding  a  particular  solution  means 
selecting  one  particular  curve  from  the  family. 
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Orthogonal  Trajectories  In  Figure  2.1,  the  straight  lines  through  (0,  —1)  are 
the  family  of  curves  given  by  the  solutions 
(2.9)  of  the  differential  equation  (2.6). 

They  might  represent,  for  example,  the 
lines  of  electric  force  due  to  an  electric 
charge  at  (0,-1).  The  circles  in  Figure 
2.1  are  then  curves  of  constant  electro¬ 
static  potential  (called  equipotentials — see 
Chapter  6,  Sections  5  and  6).  Note  that 
the  lines  of  force  intersect  the  equipoten- 
tial  curves  at  right  angles;  each  family  of 
curves  is  called  a  set  of  orthogonal  trajec¬ 
tories  of  the  other  family.  It  is  often  of 
interest  to  find  the  orthogonal  trajecto¬ 
ries  of  a  given  family  of  curves.  Let  us  do 
this  for  the  family  (2.9).  (In  this  case  we 
know  in  advance  that  our  answer  will  be 
the  set  of  circles  in  Figure  2.1.) 


Figure  2.1 


First  we  find  the  slope  of  a  line  of  the  family  (2.9),  namely, 


(2.10) 


y  =  a. 


For  each  a  this  gives  the  slope  of  one  line.  We  want  a  formula  (as  a  function  of  x 
and  y)  which  gives  the  slope,  at  any  point  of  the  plane,  of  the  line  through  that 
point.  To  obtain  this,  we  eliminate  a  between  (2.9)  and  (2.10)  to  get 


(2.11) 


/ 

y  = 


y  + 1 

X 


[Or,  given  (2.6)  rather  than  (2.9),  we  could  simply  solve  for  y' .}  Now  recall  from 
analytic  geometry  that  the  slopes  of  two  perpendicular  lines  are  negative  reciprocals. 
Then  at  each  point  we  want  the  slope  of  the  orthogonal  trajectory  curve  to  be  the 
negative  reciprocal  of  the  slope  of  the  line  given  by  (2.11).  Thus, 


(2.12) 


y  = 


X 

y  + 1 


gives  the  slope  of  the  orthogonal  trajectories,  and  we  solve  (2.12)  to  obtain  the 
equation  of  the  orthogonal  trajectory  curves.  Now  (2.12)  is  separable;  we  obtain 


(y  +  1)  dy  =  — x  dx, 

\y2  +  y  =  -\x2  +  C, 

x2  +  y2  +  2y  =  2  C, 
x2  +  (y  +  l)2  =  2  C  +  1. 


This  is,  as  we  expected,  a  family  of  circles  with  centers  at  the  point  (0,  — 1). 


Nonlinear  Differential  Equations  We  have  said  that  for  linear  differential 
equations  of  order  n  there  is  always  a  general  solution  containing  n  independent 
constants,  and  all  solutions  can  be  obtained  by  specializing  the  constants.  You 
should  be  aware  that  this  may  not  be  true  for  some  nonlinear  equations,  and  routine 
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methods  of  solution  (including  computer)  may  sometimes  give  partially  incorrect  or 
incomplete  solutions.  It  is  beyond  our  scope  to  discuss  this  in  detail  (see  differential 
equations  books),  but  here  are  some  examples. 


Example  3.  Solve  the  differential  equation  y'  =  \J  1  —  y2  and  computer  plot  the  slope 
field  and  a  set  of  solution  curves.  Find  particular  solutions  satisfying  (a)  y  =  0 
when  x  =  0,  and  (b)  y  =  1  when  x  =  0. 

We  separate  variables  and  integrate  to  get 

dy  .  .  .  . 

j :  =  ax,  arcsmy  =  x  +  a,  y  =  sm(i  +  a). 

v  i —  y2 
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Figure  2.2 

A  computer  gives  the  same  answer.  However  if  we  look  at  either  a  computer  plot 
of  the  slope  field  (Figure  2.2),  or  the  differential  equation  itself,  we  see  that  the 
slope  y'  is  always  non-negative.  Thus  the  solution  of  the  given  differential  equation 
includes  only  the  parts  of  the  sine  curves  with  non-negative  slopes  (Figure  2.2). 
A  second  difficulty  is  that  part  of  the  solution  is  missing.  From  either  the  slope 
field  (Figure  2.2)  or  directly  from  the  differential  equation  we  can  see  that  y  =  1 
and  y  =  —  1  are  solutions  not  obtainable  from  the  sine  solution  by  any  choice  of 
a.  (These  are  sometimes  called  singular  solutions.)  The  fact  that  we  did  not  find 
these  solutions  by  separation  of  variables  should  not  surprise  us  when  we  note  that 
in  separating  variables  we  divided  by  \/\  —  y2  and  this  step  is  not  valid  if  y2  =  1. 

Now  for  the  particular  solution  (a)  passing  through  (0,  0),  the  sine  solution  gives 
either  y  =  sin  a;  or  y  =  sin(a;  +  n)  =  —  sin  a:.  But  since  we  know  that  y'  is  non¬ 
negative,  only  the  y  =  sin  2;  solution  is  correct  in  the  vicinity  of  x  =  0.  In  fact 
(Figure  2.2),  y  =  sin  a:  is  a  correct  particular  solution  from  x  =  —tt/2  to  x  =  7r/2. 
We  could  construct  a  continuous  solution  from  —00  to  00  by  letting  y  =  —  1  from 
—00  to  — 7t/2,  y  =  sin  x  from  — 7t/2  to  7r/2,  and  y  =  1  from  7t/2  to  00.  Thus  for  case 
(a)  [solution  passing  through  the  origin]  we  find  just  one  particular  solution. 

For  particular  solution  (b)  [passing  through  (0, 1)],  we  find  either  y  =  sin(a:-|-§)  = 
cos  2:,  or  y  =  1;  the  cos  a:  solution  is  valid  from  x  =  —  7r  to  x  =  0.  As  in  (a),  we 
can  extend  it  by  using  parts  of  y  —  —1  and  y  =  1;  this  is  one  particular  solution. 
But  there  are  an  infinite  number  of  other  particular  solutions  passing  through  (0, 1) 
obtained  by  moving  this  one  solution  any  distance  to  the  left  (Figure  2.2). 
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PROBLEMS,  SECTION  2 


For  each  of  the  following  differential  equations,  separate  variables  and  find  a  solution 
containing  one  arbitrary  constant.  Then  find  the  value  of  the  constant  to  give  a  particular 
solution  satisfying  the  given  boundary  condition.  Computer  plot  a  slope  field  and  some  of 


the  solution  curves. 

1. 

Pi 

II 

"pi 

y  = 

3  when  x  = 

2. 

2. 

Xy/l  —  y2  dx  +  yV  1  —  x2  dy  =  0, 

V  = 

|  when  x  = 

1 

2  ’ 

3. 

y'  sin®  =  t/lny, 

y  = 

e  when  x  = 

7r/3, 

4. 

(1  +  y2)  dx  +  xy  dy  =  0, 

y  = 

0  when  x  - 

5. 

5. 

Pi 

II 

Pi 

1 

"Pi 

y  = 

1  when  x  = 

1. 

6. 

,  2  xy2  +  x 

y  o  ’ 

y  = 

0  when  x  -- 

y/2. 

x2y-y 

7. 

y  dy  +  ( xy 2  —  8a;)  dx  =  0, 

y  = 

3  when  x  = 

1. 

8. 

y'  +  2  xy2  =  0, 

y  = 

1  when  x  = 

2. 

9. 

(i  +  y)y'  =  y, 

y  = 

1  when  x  = 

1. 

10. 

y'  -xy  =  x, 

y  = 

1  when  x  -- 

0. 

11. 

2y' =  3(y-2)1'3, 

y  = 

3  when  x  - 

1. 

12. 

(x  +  xy)y'  +  y  =  0, 

y  = 

1  when  x  = 

1. 

In  Problems  13  to  15,  find  a  solution  (or  solutions)  of  the  differential  equation  not  ob¬ 
tainable  by  specializing  the  constant  in  your  solution  of  the  original  problem.  Hint:  See 

Example  3. 

13.  Problem  2.  14.  Problem  8.  15.  Problem  11. 

16.  By  separation  of  variables,  find  a  solution  of  the  equation  y'  =  y Hj  containing  one 
arbitrary  constant.  Find  a  particular  solution  satisfying  y  =  0  when  x  =  0.  Show 
that  y  =  0  is  a  solution  of  the  differential  equation  which  cannot  be  obtained  by 
specializing  the  arbitrary  constant  in  your  solution  above.  Computer  plot  a  slope 
field  and  some  of  the  solution  curves.  Show  that  there  are  an  infinite  number  of 
solution  curves  passing  through  any  point  on  the  x  axis,  but  just  one  through  any 
point  for  which  y  >  0.  Hint:  See  Example  3.  Problems  17  and  18  are  physical 
problems  leading  to  this  differential  equation. 

17.  The  speed  of  a  particle  on  the  x  axis,  x  >  0,  is  always  numerically  equal  to  the 
square  root  of  its  displacement  x.  If  x  =  0  when  t  =  0,  End  i  as  a  function  of  t. 
Show  that  the  given  conditions  are  satisfied  if  the  particle  remains  at  the  origin  for 
any  arbitrary  length  of  time  to  and  then  moves  away;  find  x  for  t  >  to  for  this  case. 

18.  Let  the  rate  of  growth  dN/dt  of  a  colony  of  bacteria  be  proportional  to  the  square 
root  of  the  number  present  at  any  time.  If  there  are  no  bacteria  present  at  t  =  0, 
how  many  are  there  at  a  later  time?  Observe  here  that  the  routine  separation  of 
variables  solution  gives  an  unreasonable  answer,  and  the  correct  answer,  N  =  0,  is 
not  obtainable  from  the  routine  solution.  (You  have  to  think,  not  just  follow  rules!) 
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19.  (a)  Consider  a  light  beam  traveling  downward  into  the  ocean.  As  the  beam  pro¬ 

gresses,  it  is  partially  absorbed  and  its  intensity  decreases.  The  rate  at  which 
the  intensity  is  decreasing  with  depth  at  any  point  is  proportional  to  the  in¬ 
tensity  at  that  depth.  The  proportionality  constant  p  is  called  the  linear 
absorption  coefficient.  Show  that  if  the  intensity  at  the  surface  is  Jo,  the  in¬ 
tensity  at  a  distance  s  below  the  surface  is  I  =  7oe_MS.  The  linear  absorption 
coefficient  for  water  is  of  the  order  of  10-2  ft-1  (the  exact  value  depending  on 
the  wavelength  of  the  light  and  the  impurities  in  the  water).  For  this  value 
of  p,  find  the  intensity  as  a  fraction  of  the  surface  intensity  at  a  depth  of  1  ft, 
50  ft,  500  ft,  1  mile.  When  the  intensity  of  a  light  beam  has  been  reduced 
to  half  its  surface  intensity  (7  =  \lo),  the  distance  the  light  has  penetrated 
into  the  absorbing  substance  is  called  the  half-value  thickness  of  the  substance. 
Find  the  half-value  thickness  in  terms  of  p.  Find  the  half-value  thickness  for 
water  for  the  value  of  p  given  above. 

(b)  Note  that  the  differential  equation  and  its  solution  in  this  problem  are  mathe¬ 
matically  the  same  as  those  in  Example  1,  although  the  physical  problem  and 
the  terminology  are  different.  In  discussing  radioactive  decay,  we  call  A  the 
decay  constant,  and  we  define  the  half-life  T  of  a  radioactive  substance  as  the 
time  when  N  =  ^No  (compare  half- value  thickness).  Find  the  relation  between 
A  and  T. 

20.  Consider  the  following  special  cases  of  the  simple  series  circuit  [Figure  1.1  and  equa¬ 
tion  (1.2)]. 

(a)  RC  circuit  (that  is,  L  =  0)  with  V  =  0;  find  q  as  a  function  of  t  if  qo  is  the 
charge  on  the  capacitor  at  t  —  0. 

(b)  RL  circuit  (that  is,  no  capacitor;  this  means  1/C  =  0)  with  V  =  0;  find  I(t) 
given  I  =  7o  at  t  =  0. 

(c)  Again  note  that  these  are  the  same  differential  equations  as  in  Problem  19  and 
Example  1.  The  terminology  is  again  different;  we  define  the  time  constant  r 
for  a  circuit  as  the  time  required  for  the  charge  (or  current)  to  fall  to  1/e  times 
its  initial  value.  Find  the  time  constant  for  the  circuits  (a)  and  (b).  If  the 
same  equation,  say  y  =  yoe~at ,  represented  either  radioactive  decay  or  light 
absorption  or  an  RC  or  RL  circuit,  what  would  be  the  relations  among  the 
half-life,  the  half-value  thickness,  and  the  time  constant? 

21.  Suppose  the  rate  at  which  bacteria  in  a  culture  grow  is  proportional  to  the  number 
present  at  any  time.  Write  and  solve  the  differential  equation  for  the  number  N  of 
bacteria  as  a  function  of  time  t  if  there  are  No  bacteria  when  t  =  0.  Again  note  that 
(except  for  a  change  of  sign)  this  is  the  same  differential  equation  and  solution  as  in 
the  preceding  problems. 

22.  Solve  the  equation  for  the  rate  of  growth  of  bacteria  if  the  rate  of  increase  is  pro¬ 
portional  to  the  number  present  but  the  population  is  being  reduced  at  a  constant 
rate  by  the  removal  of  bacteria  for  experimental  purposes. 

23.  Heat  is  escaping  at  a  constant  rate  [dQ/dt  in  (1.1)  is  constant]  through  the  walls 
of  a  long  cylindrical  pipe.  Find  the  temperature  T  at  a  distance  r  from  the  axis  of 
the  cylinder  if  the  inside  wall  has  radius  r  =  1  and  temperature  T  =  100  and  the 
outside  wall  has  r  —  2  and  T  =  0. 

24.  Do  Problem  23  for  a  spherical  cavity  containing  a  constant  source  of  heat.  Use  the 
same  radii  and  temperatures  as  in  Problem  23. 
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25.  Show  that  the  thickness  of  the  ice  on  a  lake  increases  with  the  square  root  of  the 
time  in  cold  weather,  making  the  following  simplifying  assumptions.  Let  the  water 
temperature  be  a  constant  10°C,  the  air  temperature  a  constant  —10°,  and  assume 
that  at  any  given  time  the  ice  forms  a  slab  of  uniform  thickness  x.  The  rate  of 
formation  of  ice  is  proportional  to  the  rate  at  which  heat  is  transferred  from  the 
water  to  the  air.  Let  t  =  0  when  x  =  0. 

26.  An  object  of  mass  m  falls  from  rest  under  gravity  subject  to  an  air  resistance  pro¬ 
portional  to  its  speed.  Taking  the  y  axis  as  positive  down,  show  that  the  differential 
equation  of  motion  is  m(dv/dt)  =  mg  —  kv,  where  k  is  a  positive  constant.  Find  v 
as  a  function  of  t,  and  find  the  limiting  value  of  v  as  t  tends  to  infinity;  this  limit 
is  called  the  terminal  speed.  Can  you  find  the  terminal  speed  directly  from  the  dif¬ 
ferential  equation  without  solving  it?  Hint:  What  is  dv/dt  after  v  has  reached  an 
essentially  constant  value? 

Consider  the  following  specific  examples  of  this  problem. 

(a)  A  person  drops  from  an  airplane  with  a  parachute.  Find  a  reasonable  value 
of  k. 

(b)  In  the  Millikan  oil  drop  experiment  to  measure  the  charge  of  an  electron,  tiny 
electrically  charged  drops  of  oil  fall  through  air  under  gravity  or  rise  under  the 
combination  of  gravity  and  an  electric  held.  Measurements  can  be  made  only 
after  they  have  reached  terminal  speed.  Find  a  formula  for  the  time  required 
for  a  drop  starting  at  rest  to  reach  99%  of  its  terminal  speed. 

27.  According  to  Newton’s  law  of  cooling,  the  rate  at  which  the  temperature  of  an 
object  changes  is  proportional  to  the  difference  between  its  temperature  and  that 
of  its  surroundings.  A  cup  of  coffee  at  200°  in  a  room  of  temperature  70°  is  stirred 
continually  and  reaches  100°  after  10  min.  At  what  time  was  it  at  120°? 

28.  A  glass  of  milk  at  38°  is  removed  from  the  refrigerator  and  left  in  a  room  at  tem¬ 
perature  70°.  If  the  temperature  of  the  milk  is  54°  after  10  min,  what  will  its 
temperature  be  in  half  an  hour?  (See  Problem  27.) 

29.  A  solution  containing  90%  by  volume  of  alcohol  (in  water)  runs  at  1  gal/min  into  a 
100-gal  tank  of  pure  water  where  it  is  continually  mixed.  The  mixture  is  withdrawn 
at  the  rate  of  1  gal/min.  When  will  it  start  coining  out  50%  alcohol? 

30.  If  P  dollars  are  left  in  the  bank  at  interest  /  percent  per  year  compounded  contin¬ 
uously,  find  the  amount  A  at  time  t.  Hint:  Find  dA,  the  interest  on  A  dollars  for 
time  dt. 

Find  the  orthogonal  trajectories  of  each  of  the  following  families  of  curves.  In  each  case, 
sketch  or  computer  plot  several  of  the  given  curves  and  several  of  their  orthogonal  trajec¬ 
tories.  Be  careful  to  eliminate  the  constant  from  y '  for  the  original  curves;  this  constant 
takes  different  values  for  different  curves  of  the  original  family,  and  you  want  an  expression 
for  y'  which  is  valid  for  all  curves  of  the  family  crossed  by  the  orthogonal  trajectory  you 
are  trying  to  find.  See  equations  (2.10)  to  (2.12). 

31.  x2  +  y2  =  const. 

32.  y  =  kx2 . 

33.  y  =  kxn.  (Assume  that  n  is  a  given  number;  the  different  curves  of  the  family  have 
different  values  of  k.) 

34.  xy  =  k. 

35.  (y  —  l)2  =  x2  +  k. 
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3.  LINEAR  FIRST-ORDER  EQUATIONS 

A  first-order  equation  contains  y'  but  no  higher  derivatives.  A  linear  first-order 
equation  means  one  which  can  be  written  in  the  form 

(3.1)  y'  +  Py  =  Q , 


where  P  and  Q  are  functions  of  x.  To  see  how  to  solve  (3.1),  let  us  first  consider 
the  simpler  equation  when  Q  =  0.  The  equation 

(3.2)  y'  +  Py  =  0  or  ^  =  —  Py 

ax 

is  separable.  As  in  Section  2,  we  obtain  the  solution  as  follows: 


(3.3) 


*  =  -Pdx, 

V 

In  y  =  —  J  P  dx  +  C, 
y  =  e~SPdx+c  =  Ae~fpdx 


where  A  =  ec .  Let  us  simplify  the  notation  for  future  use;  we  write 


(3.4) 


I  = 


/ 


P  dx. 


Then 

(3.5) 


and  we  can  write  (3.3)  as  y  =  Ae  1  or 

(3.6)  ye1  =  A. 


We  can  now  see  how  to  solve  (3.1).  If  we  differentiate  (3.6)  with  respect  to  x 
and  use  (3.5),  we  get 

(3.7)  -y-(ye7)  =  y'e1  +  ye1^-  =  y'e1  +  ye1  P  =  e\y'  +  Py), 

dx  dx 

which  is  the  left-hand  side  of  (3.1)  multiplied  by  e1 .  (We  call  e1  an  integrating 
factor — see  Section  4.)  Thus,  we  can  write  (3.1)  (times  e1)  as 

(3.8)  ^( yeJ )  =  e\y  +  Py)  =  Qe1 . 

Since  Q  and  e1  are  functions  of  x  only,  we  can  now  integrate  both  sides  of  (3.8) 
with  respect  to  x  to  get 
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This  is  the  general  solution  of  (3.1).  Note  that  it  contains  one  arbitrary  constant 
as  expected  for  a  first-order  linear  equation.  The  term  ce is  a  solution  of  equa¬ 
tion  (3.2);  the  first  term  in  y  is  one  particular  solution  of  (3.1).  Borrowing  notation 
which  we  shall  use  in  Section  6,  let’s  call  the  term  ce~4  =  yc  and  the  particular 
solution  =  yp.  Then  yp  +  yc  is  a  solution  of  (3.1)  for  any  value  of  c.  Also  note 
that  ypeT  =  f  Qe1  dx  is  an  indefinite  integral  which,  as  we  know  (see  Chapter  5, 
Section  1),  has  infinitely  many  answers  differing  from  each  other  by  constants  of 
integration.  Thus  the  particular  solution  obtained  by  you  and  by  your  computer 
may  not  be  the  same  (see  Example  1  and  Problems). 

Example  1.  Solve  (1  +  x2)y'  +  6 xy  =  2x.  In  the  form  of  (3.1),  this  is 


6x 


2x 


1  +  x- 


■y  = 


1  +  x2 


From  (3.9),  we  get 


1  = 


6x 


■  dx  =  31n(l  +  x2) 


j  1  +  x2 
I  =  e31n(l+x2)  =  (1  +  x2)3 


■ye1  = 


J  — — j(l  +  x2)3  dx  =  J  2x(l  +  x2)2  dx  =  ^(1  +  x2)3  +  c 


y  =  \ 


3  (1  +  x2y  ' 

A  computer  gives  the  answer 

V  = 


3x2  +  3x4  +  x6 


A 


3(1  +  x2)3  (1  +  a;2)3' 


Let  us  show  that  the  answers  agree  (see  comments  just  after  (3.9)).  If  we  put 
A  =  c  +  1/3  in  the  computer  solution  above  and  combine  terms,  we  get 


y  = 


3x 2  +  3x4  +  x6  +  1 
3(1  +  a;2)3 


(1  +  cc2) 


2\3 


(1  +  a;2)3  3(1  +  a;2)3  (1  +  a:2)3’ 


which,  after  cancelling,  is  our  solution  above.  We  see  that  the  computer  program 
chose  a  more  complicated  particular  solution  yp  which  differed  from  our  yp  by  a 
multiple  of  yc  =  1/(1  +  a:2)3.  Always  be  aware  of  the  possibility  of  simplifying  a 
particular  solution  by  adding  a  multiple  of  yc. 


Example  2.  Radium  decays  to  radon  which  decays  to  polonium.  If  at  t  =  0,  a  sample  is 
pure  radium,  how  much  radon  does  it  contain  at  time  f? 

Let  No  =  number  of  radium  atoms  at  t  =  0, 

Ai  =  number  of  radium  atoms  at  time  t, 

N2  =  number  of  radon  atoms  at  time  t, 

Ai  and  A2  =  decay  constants  for  Ra  and  Rn. 

As  in  Section  2,  we  have  for  radium 
dN 1 


dt 


—  N\  —  Noe 


—  Ait 
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The  rate  at  which  radon  is  being  created  is  the  rate  at  which  radium  is  decaying, 
namely  A1.ZV1  or  Ai N0e~Xlt.  But  the  radon  is  also  decaying  at  the  rate  A2-/V2.  Hence, 
we  have 

=  AiiVi  —  A2-/V2,  or 
at 

—7- — h  A2-/V2  =  A1./V1  =  AiA^0e  ^lt. 
at 

This  equation  is  of  the  form  (3.1),  and  we  solve  it  as  follows: 


PROBLEMS,  SECTION  3 

Using  (3.9),  find  the  general  solution  of  each  of  the  following  differential  equations.  Com¬ 
pare  a  computer  solution  and,  if  necessary,  reconcile  it  with  yours.  Hint:  See  comments 
just  after  (3.9),  and  Example  1. 


1. 

/  -  X 

V  +y  =  e 

2. 

x2y  +  3  xy  =  1 

3. 

2 

dy  +  (2 xy  —  xe  )  dx 

=  0 

4. 

2  xy'  +  y  =  2®5/2 

5. 

y  cos  x  +  y  =  cos2  x 

6. 

y  +  y/V  x2  + 1  =  l/(*  +  V  x2  + 1 

7. 

(1  +  ex)y'  +  2ex  y  =  (1 

1  X\  X 

+  e  )e 

8. 

(xln  x)y  +  y  =  In  a: 

9. 

(1  -  x2)y  =  xy  +  2 x\J 

1  —  X2 

10. 

y1  +  y  tanh  x  =  2ex 

12. 

dx 

11. 

y  +  y  cos  x  =  sin  2x 

—  =  cos  y  —  x  tan  y 

13. 

dx  +  (x  —  ev)  dy  =  0 

14. 

dy  3  y 

dx  3?/2/3  —  x 

Hint: 

For  Problems  12  to  14 

,  solve  for  x 

in  terms 

of  y. 
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15.  Water  with  a  small  salt  content  (5  lb  in  1000  gal)  is  flowing  into  a  very  salty  lake 

at  the  rate  of  4  •  105  gal  per  hr.  The  salty  water  is  flowing  out  at  the  rate  of  10s 

gal  per  hr.  If  at  some  time  (say  t  =  0)  the  volume  of  the  lake  is  109  gal,  and  its 

salt  content  is  107  lb,  find  the  salt  content  at  time  t.  Assume  that  the  salt  is  mixed 
uniformly  with  the  water  in  the  lake  at  all  times. 

16.  Find  the  general  solution  of  (1.2)  for  an  RL  circuit  (1/C  =  0)  with  V  =  Vo  cos  u>t 
(lo  =  const.). 

17.  Find  the  general  solution  of  (1.3)  for  an  RC  circuit  ( L  =  0),  with  V  =  Vo  coscut. 

18.  Do  Problems  16  and  17  using  V  =  Voe1"*,  and  find  the  solutions  for  16  and  17  by 

taking  real  parts  of  the  complex  solutions. 

19.  If  Ai  =  A2  =  A  in  (3.10),  then  f  dt  =  J  dt.  Find  N2  for  this  case. 

20.  Extend  the  radioactive  decay  problem  (Example  2)  one  more  stage,  that  is,  let  A3 
be  the  decay  constant  of  polonium  and  find  how  much  polonium  there  is  at  time  t. 

21.  Generalize  Problem  20  to  any  number  of  stages. 

22.  Find  the  orthogonal  trajectories  of  the  family  of  curves  x  =  y  +  1  +  cev .  (See  the 
instructions  above  Problem  2.31.) 

2  2 

23.  Find  the  orthogonal  trajectories  of  the  family  of  curves  y  =  —ex  erf  x  +  Cex  .  Hint: 
See  Chapter  11,  equation  (9.1)  for  definition  of  erf  x,  and  Chapter  4,  Section  12,  for 
differentiation  of  an  integral.  Solve  for  x  in  terms  of  y. 

4.  OTHER  METHODS  FOR  FIRST-ORDER  EQUATIONS 

Separable  equations  and  linear  equations  are  the  two  types  of  first-order  equations 
you  are  most  apt  to  meet  in  elementary  applications.  However,  we  shall  also  mention 
briefly  a  few  other  methods  of  solving  special  first-order  equations.  You  will  find 
more  details  in  the  problems  and  in  most  differential  equations  books. 

The  Bernoulli  Equation  The  differential  equation 

(4.1)  y’  +  Py  =  Qyn, 

where  P  and  Q  are  functions  of  x,  is  known  as  the  Bernoulli  equation.  It  is  not 
linear  but  is  easily  reduced  to  a  linear  equation.  We  make  the  change  of  variable 

(4.2)  *  =  y1~n- 
Then 

(4.3)  z'  =  (1  -  n)y~ny' . 

Next  multiply  (4.1)  by  (1  —  n)y~n  and  make  the  substitutions  (4.2)  and  (4.3)  to  get 

(1  -  n)y~ny'  +  (1  -  n)Py1~n=  (1  -  ri)Q, 
z'  +  (1  —  n)Pz  =  (1  —  n)Q. 

This  is  now  a  first-order  linear  equation  which  we  can  solve  as  we  did  the  linear 
equations  above.  (See  Section  7  for  an  example  of  a  physical  problem  in  which  we 
need  to  solve  a  Bernoulli  equation.) 
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Exact  Equations;  Integrating  Factors  Recall  from  Chapter  6,  Section  8,  that 
the  expression  P(x ,  y)  dx+Q(x,  y)  dy  is  an  exact  differential  [that  is,  the  differential 
of  a  function  F(x,  y)\  if 


(4.4) 


dP__dQ 
dy  dx 


If  (4.4)  holds,  then  there  is  a  function  F(x,y )  such  that 


(4.5) 


p  =  Q 


-7— ,  P  dx  +  Q  dy  =  dF. 
dy 


In  Chapter  6  we  considered  ways  of  finding  F  when  (4.4)  holds.  The  differential 
equation 


(4.6) 


P  dx  +  Q  dy  =  0 


or 


y'  = 


P 

Q 


is  called  exact  if  (4.4)  holds.  In  this  case 


P  dx  +  Q  dy  =  dF  =  0, 
and  the  solution  of  (4.6)  is  then 
(4.7)  F(x,y)  =  const. 


We  find  F  as  in  Chapter  6,  Section  8. 

An  equation  which  is  not  exact  may  often  be  made  exact  by  multiplying  it  by 
an  appropriate  factor. 


Example  1.  The  equation 

(4.8)  xdy  —  ydx  =  0 
is  not  exact  [by  (4.4)].  But  the  equation 

xdy~ydx  1  ,  y  ,  ,/V\ 

(4.9)  y  y  =-dy--Z5dx  =  d(H-)=  0, 

Xz  X  xz  \x / 

obtained  by  dividing  (4.8)  by  x2,  is  exact  [use  (4.4)],  and  its  solution  is 

(4.10)  —  =  const. 

x 

We  multiplied  (4.8)  by  1/x2  to  make  the  equation  exact;  the  factor  1/x 2  is  called 
an  integrating  factor.  To  see  another  example  of  an  integrating  factor,  look  back  at 
Section  3.  The  expression  e1  is  an  integrating  factor  for  equations  (3.1)  and  (3.2); 
as  you  can  see  in  (3.8),  multiplying  (3.1)  by  e1  makes  it  an  exact  equation. 

The  method  of  finding  an  integrating  factor  and  solving  the  resulting  exact 
equation  is  useful  mainly  in  simple  cases  when  we  can  see  the  result  by  inspection. 
It  is  not  usually  worth  while  to  spend  much  time  searching  for  integrating  factors. 
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Homogeneous  Equations  A  homogeneous  function  of  x  and  y  of  degree  n  means 
a  function  which  can  be  written  as  xn  f  (y/x).  For  example,  x3  —xy2  =  ;r3[l  —  (y/x)2] 
is  a  homogeneous  function  of  degree  3.  (Also  see  Problem  21.)  An  equation  of  the 
form 


(4.11) 


P{x,  y)  dx  +  Q(x,  y)  dy  =  0, 


where  P  and  Q  are  homogeneous  functions  of  the  same  degree  is  called  homogeneous. 
(The  term  homogeneous  is  also  used  in  another  sense;  see  Section  5.)  If  we  divide 
two  homogeneous  functions  of  the  same  degree,  the  xn  factors  cancel  and  we  have 
a  function  of  y/x.  Thus,  from  (4.11)  we  can  write 


(4.12) 


/  dy 
V=d~x 


P{x,  y )  =  ,  /y 

Q(x,y )  Vz 


and  we  can  say  that  a  differential  equation  is  homogeneous  if  it  can  be  written  as 
y'  =  a  function  of  y/x.  This  suggests  that  we  solve  homogeneous  equations  by 
making  the  change  of  variables  v  =  y/x,  or 


(4.13) 


y  =  xv. 


This  substitution  does,  in  fact,  give  us  a  separable  equation  in  x  and  v  (see  Prob¬ 
lem  22).  We  solve  it  to  find  a  relation  between  v  and  x  and  then  put  back  v  =  y/x 
to  find  the  solution  of  (4.11). 

Also  see  Problem  23  for  another  way  to  solve  homogeneous  equations. 


Change  of  Variables  We  have  solved  both  Bernoulli  equations  and  homoge¬ 
neous  equations  by  making  changes  of  variables.  Other  equations  may  yield  to  this 
method  also.  If  a  differential  equation  contains  some  combination  of  the  variables 
x,  y  (especially  if  this  combination  appears  more  than  once),  we  try  replacing  this 
combination  by  a  new  variable.  See  Problems  11,  15,  and  16  for  examples. 


PROBLEMS,  SECTION  4 

Use  the  methods  of  this  section  to  solve  the  following  differential  equations.  Compare 
computer  solutions  and  reconcile  differences. 


1. 

/  .  2/3 

y  +y  =  xy' 

2. 

y  + 

-y  =  2  x3/2y 

X 

1/2 

3. 

3  xy2y  +  3y3  =  1 

4. 

(2xe 

3y+ex)dx  + 

(3x2e3y  - 

y2)dy 

5. 

(x  -  y)  dy  +  (y  +  x  +  1)  dx  =  0 

6. 

(cos  x  cos  y  +  sin2  x)  dx  — 

(sin  x : 

sin  y  +  cos2 

y)  dy  -- 

=  0 

7. 

x2  dy  +  (y2  —  xy)  dx  =  0 

8. 

y  dy 

=  (-*  +  \fx 

2  +  y2  )  dx 

9. 

xy  dx  +  ( y 2  —  x2)  dy  =  0 

10. 

( v 2  - 

-  xy)  dx  +  (a; 

2  +  xy)  dy 

=  0 

11. 

y'  =  cos(x  +  y)  Hint: 

Let  u 

—  x  +  y\  then  u'  - 

+ 

II 

12. 

y  =  —  —  tan  — 

X  X 

13. 

yy  - 

-  2 y2  cot  x  = 

sin  x  cos  x 
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14.  (x  —  l)y'  +  y  —  x  2  +  2x  3=0  15.  xy  +  y  =  exy  Hint:  Let  u  =  xy 

16.  Solve  the  differential  equation  yy'2  +  2 xy'  —  y  =  0  by  changing  from  variables  y,  x, 
to  r,  x,  where  y2  =  r2  —  x2\  then  yy'  =  rr1  —  x. 

17.  If  an  incompressible  fluid  flows  in  a  corner  bounded  by  walls  meeting  at  the  origin  at 
an  angle  of  60° ,  the  streamlines  of  the  flow  satisfy  the  equation  2xy  dx  +  (x2  —  y2)  dy 
=  0.  Find  the  streamlines. 

18.  Find  the  family  of  orthogonal  trajectories  of  the  circles  (x  —  h)2  +  y2  =  h2 .  (See  the 
instructions  above  Problem  2.31.) 

19.  Find  the  family  of  curves  satisfying  the  differential  equation  ( x+y )  dy+  ( x—y )  dx  =  0 
and  also  find  their  orthogonal  trajectories. 

20.  Find  the  shape  of  a  mirror  which  has 
the  property  that  rays  from  a  point 
O  on  the  axis  are  reflected  into  a  par¬ 
allel  beam.  Hint:  Take  the  point  O 
at  the  origin.  Show  from  the  figure 
that  tan  29  =  y /x.  Use  the  formula 
for  tan  28  to  express  this  in  terms  of 
tan  6  =  dy/dx  and  solve  the  result¬ 
ing  differential  equation.  ( Hint :  See 
Problem  16.) 


21.  As  in  text  just  before  (4.11),  show  that 

(a)  x2  —  5 xy  +  y3/x  is  a  homogeneous  function  of  degree  2; 

(b)  x~4(y4  —  x3y)  —  xy2  sin (x/y)  is  homogeneous  of  degree  3; 

(c)  x2y3  +  x 5  In (y/x)  —  y6 / \/x2  +  y2  is  homogeneous  of  degree  5; 

(d)  x2  +  y,  x  +  cos  y,  and  y  +  1  are  not  homogeneous. 

See  Chapter  4,  Section  13,  Problem  1  for  a  more  general  definition  of  a  homogeneous 
function  of  any  number  of  variables. 

22.  Show  that  the  change  of  variables  (4.13)  in  (4.11)  or  (4.12)  gives  a  separable  equation. 
Hints:  Substitute  y  =  xv  and  dy  =  xdv  +  vdx  from  (4.13)  into  (4.12)  and  rearrange 
terms  to  get  the  equation 

(a)  lf(v)  —  u]  dx  =  x  dv. 

Alternatively,  suppose  P  and  Q  are  homogeneous  of  degree  n;  that  is  P(x ,  y)  = 
xnP(l,y/x)  =  xnP(l,v)  and  a  similar  equation  for  Q.  Substitute  these  results  and 
dy  =  xdv  +  v  dx  into  (4.11),  divide  by  xn,  and  rearrange  terms  to  get 

(b)  [P(l,  v)  +  Q(l,  v)  v]  dx  +  Q(l,  v)x  dv  =  0. 

Write  both  (a)  and  (b)  with  variables  separated. 

23.  Show  that  (xP  +  yQ)~4  is  an  integrating  factor  for  (4.11).  Hint:  You  want  to  show 
that  (P  dx  +  Q  dy)/(xP  +  yQ)  is  an  exact  differential  (see  Chapter  6,  Section  8). 
Remember  that  P  and  Q  are  homogeneous  of  the  same  degree.  Divide  numerator 
and  denominator  by  Q  and  use  P/Q  =  — f(y/x )  from  (4.12).  Now  find  the  needed 
partial  derivatives.  Comment:  If  (x  P  +  y  Q)  turns  out  to  be  very  simple,  this  may 
be  an  easier  way  to  solve  a  homogeneous  equation  than  the  v  =  y/x  substitution 
(see  Problem  24). 
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Solve  Problems  9  and  10  by  using  an  integrating  factor  as  discussed  in  Problem  23. 

An  equation  of  the  form  y'  =  f(x)y2  +  g{x)y  +  h(x)  is  called  a  Riccati  equation. 
If  we  know  one  particular  solution  yp,  then  the  substitution  y  =  yp  +  \  gives  a 
linear  first-order  equation  for  2.  We  can  solve  this  for  2  and  substitute  back  to  find 
a  solution  of  the  y  equation  containing  one  arbitrary  constant  (see  Problem  26). 
Following  this  method,  check  the  given  yp,  and  then  solve 

(a)  y  =xy  -  -y  - yp  =  — ; 

2  1 

(b)  y'  =  -  y2  +  -  y  -  2®,  yP  =  x; 

(c)  y'  =  e“V  +  y  -  e* ,  yp=e*. 

Show  that  the  substitution  given  in  Problem  25  does  in  general  give  a  solution  of 
the  Riccati  equation.  Hints:  First  show  that  the  substitution  y  =  yp  +  u  yields  the 
following  equation  for  u:  v!  —  (g  +  2fyp)u  =  fu2.  Note  by  text  equation  (4.1)  that 
this  is  a  Bernoulli  equation  with  n  =  2,  so  by  equation  (4.2)  we  let  2  =  u-1.  Show 
that  the  2  equation  is  the  linear  first-order  equation  z'  +  (g  +  2 fyp)z  =  — /.  Note 
that  we  could  have  obtained  the  2  equation  in  one  step  by  substituting  y  =  yp  +  2-1 
in  the  original  equation  as  claimed  in  Problem  25. 

5.  SECOND-ORDER  LINEAR  EQUATIONS  WITH  CONSTANT 
COEFFICIENTS  AND  ZERO  RIGHT-HAND  SIDE 

Because  of  their  importance  in  applications,  we  are  going  to  consider  carefully  the 
solution  of  differential  equations  of  the  form 

d2y  dy 

(5.1)  a2-rT  +  di— +  d0y  =  0, 

dxz  ax 

where  a 2,  a\,  a 0  are  constants;  also  we  shall  consider  (Section  6)  the  corresponding 
equation  when  the  right-hand  side  of  (5.1)  is  a  function  of  x.  Equations  of  the  form 
(5.1)  are  called  homogeneous  because  every  term  contains  y  or  a  derivative  of  y. 
Equations  of  the  form  (6.1)  are  called  inhomogeneous  because  they  contain  a  term 
which  does  not  depend  on  y.  (Note,  however,  that  this  use  of  the  term  homogeneous 
is  completely  unrelated  to  its  use  in  Section  4.)  Although  we  shall  concentrate  on 
second-order  equations,  which  are  the  ones  that  occur  most  frequently  in  applica¬ 
tions,  most  of  our  discussion  can  be  extended  immediately  to  linear  equations  of 
higher  order  with  constant  coefficients  (see  Problems  21  to  30). 

These  problems  are  pretty  simple  by  hand;  you  may  be  able  to  write  down 
answers  faster  than  you  can  type  the  problem  into  a  computer!  Remember  that  a 
computer  may  not  give  an  answer  in  the  form  you  need.  To  use  computer  solutions 
effectively,  you  need  to  know  what  to  expect,  and  you  can  learn  this  by  studying  the 
following  methods  and  doing  some  problems  by  hand.  Let  us  consider  an  equation 
of  the  form  (5.1). 


24. 

25. 


Example  1.  Solve  the  equation 

(5.2)  y"  +  5y'  +  4y  =  0. 

It  is  convenient  to  let  D  stand  for  d/dx\  then 

d 


n  dy  ' 

Dy=^-  =  y, 

ax 


D  y  = 


dx 


d2y  _ 
dx 2  V 


// 


(5.3) 


dy_ 

dx 


Section  5  Second-Order  Linear  Equations  (Constant  Coefficients,  Zero  Right-Hand  Side)  409 


Expressions  involving  D ,  such  as  D  +  1  or  D 2  +  5  D  +  4,  are  called  differential 
operators.  (See  Problem  31.)  In  this  notation  (5.2)  becomes 

(5.4)  D2y  +  5Dy  +  4y  =  0  or  {D2  +  5D  +  4)y  =  0. 

The  algebraic  expression  D2  +  5 D  +  4  can  be  factored  as  ( D  +  1)(D  +  4)  or  ( D  + 
4 ){D  +  1).  You  should  satisfy  yourself  that 

(5.5)  (D  +  1  )(£>  +  4  )y  =(D  +  4  )(D  +  1  )y  =  ( D 2  +  5  D  +  4  )y 

when  D  =  d/dx,  and,  in  fact,  that  a  similar  statement  is  true  for  ( D  —  a)(D  —  b) 
where  a  and  b  are  any  constants.  (This  is  not  necessarily  true  if  a  and  b  are  functions 
of  x:  see  Problem  31.)  Then  we  can  write  (5.2)  or  (5.4)  as 

(5.6)  (D  +  1)(£)  +  4)y  =  0  or  (D  +  4)(£>  +  l)y  =  0. 

To  solve  (5.4)  [or  (5.6)  which  is  the  same  equation  rewritten],  we  shall  first  solve 
the  simpler  equations 

(5.7)  (D  +  4)y  =  0  and  (D  +  l)y  =  0. 

These  are  separable  equations  (Section  2)  with  solutions 

(5.8)  y  =  c\e~Ax ,  y  =  c2e~x . 

Now  if  ( D  +  4 )y  =  0,  then 

(D  +  1  )(D  +  4)y  =  {D  +  1)  •  0  =  0, 

so  any  solution  of  (D  +  4 )y  =  0  is  a  solution  of  the  differential  equation  (5.6)  or 
(5.4).  Similarly,  any  solution  of  ( D  +  1  )y  =  0  is  a  solution  of  (5.6)  or  (5.4).  Since 
the  two  solutions  (5.8)  are  linearly  independent  [Problem  13;  also  see  Chapter  3, 
equation  (8.5)],  a  linear  combination  of  them  contains  two  arbitrary  constants  and 
so  is  the  general  solution.  Thus 

(5.9)  y  =  c\e~Ax  +  c2e~x 

is  the  general  solution  of  (5.4).  Note  that  we  can  think  of  the  two  solutions  e~Ax 
and  e~x  as  basis  vectors  of  a  2-dimensional  linear  vector  space  (see  Chapter  3, 
Section  14).  Then  the  general  solution  (5.9)  gives  all  the  vectors  of  that  space.  (See 
Problem  21.) 

Now  we  must  investigate  whether  we  can  solve  all  second-order  linear  equations 
with  constant  coefficients  (and  zero  right-hand  side)  by  this  method.  We  first  wrote 
the  differential  equation  using  D  for  d/dx,  and  then  factored  the  D  expression  to 
get  (5.5).  In  this  last  step,  we  treated  D  as  if  it  were  an  algebraic  letter  instead 
of  d/dx\  this  is  justified  by  checking  the  result  (5.5)  when  D  =  d/dx.  Recall  from 
algebra  that  saying  that  the  algebraic  expression  D2  +  5D  +  4  has  the  factors  (D  +  4) 
and  ( D  +  1)  is  equivalent  to  saying  that  the  quadratic  equation 

(5.10)  D2  +  5D  +  4  =  0 

has  roots  —4  and  —1.  The  equation  (5.10)  is  called  the  auxiliary  (or  characteristic) 
equation  for  the  given  differential  equation  (5.2).  From  equations  (5.6)  to  (5.9),  we 
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see  that  to  solve  a  linear  second-order  equation  with  constant  coefficients,  we  should 
first  solve  the  auxiliary  equation;  if  the  roots  of  the  auxiliary  equation  are  a  and  b 
(a  yf  b),  the  general  solution  of  the  differential  equation  is  a  linear  combination  of 
eax  and  ebx. 


(5.11)  y  =  Cieaa:  +  C2eb:i:  is  the  general  solution  of  {D  —  a){D  —  b)y  =  0,  a^b. 


(If  a  =  b,  we  get  only  one  solution  this  way;  we  shall  consider  this  case  shortly.) 
Recall  from  algebra  that  the  roots  of  a  quadratic  equation  (with  real  coefficients; 
see  Problem  19)  can  be  real  and  unequal,  real  and  equal,  or  a  complex  conjugate 
pair.  The  equation  (5.2)  which  we  have  solved  is  an  example  in  which  the  roots  are 
real  and  unequal.  Let  us  consider  the  other  two  cases. 


Equal  Roots  of  the  Auxiliary  Equation  If  the  two  roots  of  the  auxiliary 
equation  are  equal,  then  the  differential  equation  can  be  written 

(5.12)  {D-a){D-a)y  =  0, 

where  a  is  the  value  of  the  two  equal  roots.  From  our  previous  discussion  (5.5)  to 
(5.11),  we  know  that  one  solution  of  (5.12)  is  y  =  C\eax .  But  our  previous  second 
solution  y  =  C2ebx  in  (5.11)  is  not  a  second  solution  here  since  b  =  a.  To  find  the 
second  solution  for  this  case,  we  let 

(5.13)  u=  (D  —  a)y. 

Then  (5.12)  becomes 

(D  —  a)u  =  0, 

from  which  we  get 

(5.14)  u  =  Aeax. 

We  substitute  (5.14)  into  (5.13)  to  get 

(D  —  a)y  =  Aeax  or  y' ~  ay  =  Aeax . 

This  is  a  first-order  linear  equation  which  we  solve  as  in  Section  3: 


Thus 


ye 


—ax 


e-axA(,ax  dx  = 


Adx  =  Ax  +  B. 


(5.15)  y  =  {Ax  +  B)eax  is  the  general  solution  of  (5.12). 


This  is  the  general  solution  of  (5.1)  for  the  case  of  equal  roots  of  the  auxiliary  equa¬ 
tion.  The  solution  eax  we  already  know;  what  is  new  here  is  the  fact  that  xeax  is  a 
second  (linearly  independent;  see  Problem  14)  solution  of  the  differential  equation 
when  a  is  a  double  root  of  the  auxiliary  equation.  Equations  (5.11)  and  (5.15)  then 
give  the  general  solution  of  (5.1)  for  both  unequal  and  equal  roots  of  the  auxiliary 
equation. 
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Complex  Conjugate  Roots  of  the  Auxiliary  Equation  Suppose  the  roots  of 
the  auxiliary  equation  are  aizifi.  These  are  unequal  roots,  so  by  (5.11)  the  general 
solution  of  the  differential  equation  is 


(5.16)  y  =  Ae^a+i0)x  +  Be{a~^)x  =  eax(Aei0x  +  Be~i0x). 


There  are  two  other  very  useful  forms  of  (5.16).  If  we  substitute  e±i/3x  =  cos  fix  ± 
i  sin  /3x  [see  Chapter  2,  equation  (9.3)]  into  (5.16),  then  the  parenthesis  becomes  a 
linear  combination  of  sin /3a;  and  cos  fix  and  we  can  write  (5.16)  as 


(5.17) 


y  =  eax(c i  sin  /3x  +  c-i  cos  fix), 


where  Ci  and  C2  are  new  arbitrary  constants.  We  can  also  write  (5.17)  in  the  form 


(5.18) 


y  =  ceax  sin(/3a;  +  7), 


where  c  and  7  are  now  the  arbitrary  constants.  An  easy  way  to  see  that  this  is  cor¬ 
rect  is  to  expand  sin(/3a:  +  7)  by  the  trigonometric  addition  formula;  this  gives  a 
linear  combination  of  sin/3a’  and  cos  fix  as  in  (5.17).  Although  it  is  not  hard  to 
express  any  one  of  the  sets  of  arbitrary  constants  [A,  B  in  (5.16);  ci,  C2  in  (5.17); 
and  c,  7  in  (5.18)]  in  terms  of  either  of  the  other  sets,  there  is  seldom  any  need  to  do 
this.  In  solving  actual  problems  we  simply  write  whichever  one  of  the  three  forms 
seems  best  for  the  problem  at  hand  and  then  determine  the  arbitrary  constants  in 
that  form  from  the  given  data. 


Example  2.  Solve  the  differential  equation 

(5.19)  y"  —  6y'  +  9y  =  0. 

We  can  write  the  equation  as 

(5.20)  {D2  -  6D  +  9)y  =  0  or  (D  -  3)(L>  -  3)y  =  0. 

Since  the  roots  of  the  auxiliary  equation  are  equal,  we  know  that  the  solution  is  of 
the  form  (5.15)  and  we  simply  write  the  result 


(5.21) 


y  =  {Ax  +  B)e3x. 
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Example  3.  In  Section  16,  Chapter  2,  we  discussed  the  differential  equation  for  the  motion 
of  a  mass  m  oscillating  at  the  end  of  a  spring,  and  we  solved  it  by  guessing  the 
solution.  Now  let’s  solve  it  by  the  methods  of  this  chapter.  The  differential  equation 
is  [see  Chapter  2,  equation  (16.21)] 

(r  00n  d2y  I  d2y  k  2  ■(  2  k 

(5.22)  m-77y  =  ~kv  or  ~u2  =  — y  =  y  w  =  —  • 

dtz  dtz  m  m 

We  can  write  this  differential  equation  as 

(5.23)  D2y  +  uj2y  =  0  or  (D2  +  u2)y  =  0 

where  D  =  d/dt.  The  roots  of  the  auxiliary  equation  are  D  =  ±iu;  the  solution 
may  be  written  in  any  of  the  three  forms,  (5.16),  (5.17),  or  (5.18): 

y  =  Aeiwt  +  Be~iwt 

(5.24)  =  Ci  sin  u)t  +  C2  cos  u>t 

=  csin(a4  +  7). 

An  object  whose  displacement  from  equilibrium  satisfies  (5.22)  or  (5.24)  is  said  to 
be  executing  simple  harmonic  motion.  (Recall  Chapter  7,  Section  2.) 

Equations  (5.24)  are  general  solutions  of  (5.22),  each  containing  two  arbitrary 
constants.  Let  us  find  a  particular  solution  corresponding  to  given  initial  condi¬ 
tions. 

Example  4.  Suppose  the  mass  is  held  at  rest  at  a  distance  10  cm  below  equilibrium  and 
then  suddenly  let  go.  If  we  agree  to  call  y  positive  when  m  is  above  the  equilibrium 
position,  then  at  t  =  0,  we  have  y  =  —10,  and  dy/dt  =  0.  Using  the  second  solution 
in  (5.24),  we  get 

dy 

—  =  C\oj  cos  tot  —  C2wsinwt, 
dt 

so  the  initial  conditions  give 


—  10  =  Ci  •  0  +  c2  •  1, 

0  =  ci  a;  •  1  —  C20J  ■  0. 


Thus  we  find 


ci  =  0,  c2  =  —10, 


and  the  particular  solution  we  wanted  is 


(5.25)  y  =  —  lOcoswf. 

You  can  verify  that  either  of  the  other  solutions  in  (5.24)  gives  the  same  particular 
solution  (5.25)  for  the  same  initial  conditions  (Problem  32). 

This  solution  is  pretty  unrealistic  from  the  practical  viewpoint.  Equations  (5.24) 
and  (5.25)  imply  that  the  mass  to,  once  started,  will  simply  oscillate  up  and  down 
forever!  This  is  certainly  not  true;  what  will  happen  is  that  the  oscillations  will 
gradually  die  down.  The  reason  for  the  discrepancy  between  the  physical  facts  and 
our  mathematical  answer  is  that  we  have  neglected  “friction”  forces. 
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Example  5.  A  fairly  reasonable  assumption  for  this  problem  and  many  other  similar  ones 
is  that  there  is  a  retarding  force  proportional  to  the  velocity;  let  us  call  this  force 
—l(dy/dt)  ( l  >  0).  Then  (5.22),  revised  to  include  this  force,  becomes 


(5.26) 

d2y  ,  , dy 

m——r  =  —ky  —  L  — 

dt 2  y  dt 

(i>  o) 

or  with  the  abbreviations 

9  k  „ ,  l 

w2  =  — ,  2  b—  — 

m  m 

(b  >  0) 

it  is 

(5.27) 

d2y  ,  o udy  j  2 
_  +  26_+wy 

=  0. 

To  solve  (5.27),  we  find  the  roots  of  the  auxiliary  equation 
(5.28)  D2  +  2bD  +  to2  =  0, 


which  are 


(5.29)  D=  2b±y/^b2  4u;2  =-b±Vb2-u>2. 

There  are  three  possible  types  of  answer  here  depending  on  the  relative  size  of 
b 2  and  u>2,  and  there  are  three  special  names  given  to  the  corresponding  types  of 
motion.  We  say  that  the  motion  is 


overdamped  if 
critically  damped  if 
underdamped  or  oscillatory  if 


b 2  >  w2, 
b2  =  W2, 
b2  <  u2. 


Let  us  discuss  the  corresponding  general  solutions  of  the  differential  equation  for 
the  three  cases. 


Overdamped  Motion  Since  \Jb2  —  to2  is  real  and  less  than  b ,  both  roots  of  the 
auxiliary  equation  are  negative,  and  the  general  solution  is  a  linear  combination  of 
two  negative  exponentials: 


(5.30)  y  =  Ae~xt  +  Be-^,  where  |A 

V  ;  \n  =  b-  Vb2  -lu2. 

Critically  Damped  Motion  Since  b  =  u>,  the  auxiliary  equation  has  equal  roots 
and  the  general  solution  is 

(5.31)  y  =  (A  +  Bt)e~bt . 

In  both  overdamped  and  critically  damped  motion,  the  mass  m  is  subject  to  such 
a  large  retarding  force  that  it  slows  down  and  returns  to  equilibrium  rather  than 
oscillating  repeatedly. 
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Underdamped  or  Oscillatory  Motion  In  this  case  b 2  <  u>2  so  \Jb2  —  to2  is 
imaginary.  Let  (3  =  V 'u2  —  b2-  then  \/b2  —  u> 2  =  i/3  and  the  roots  (5.29)  of  the 
auxiliary  equation  are  —6  ±  i[3.  The  general  solution  in  the  form  (5.17)  is  then 

(5.32)  y  =  e~bt{Asm/3t  +  B  cos /3t) 


This  result  is  more  in  accord  with  what  we  know  actually  happens  to  the  mass 
m;  because  of  the  factor  e~bt,  the  oscillations  in  this  case  decrease  in  amplitude 
as  time  goes  on.  Also  note  that  the  frequency  of  the  damped  vibrations,  namely 
/3  =  \Ju)2  —  b2,  is  less  than  the  frequency  to  of  the  undamped  vibrations. 


Although  we  have  stated  a  rather  special  physical  problem,  the  mathematics  we 
have  just  discussed  applies  to  a  great  variety  of  problems.  First,  there  are  many 
kinds  of  mechanical  vibrations  besides  a  mass  attached  to  a  spring.  Think  of  a 
tuning  fork,  a  pendulum,  the  needle  on  the  scale  of  a  measuring  device,  and  as 
more  involved  examples,  the  vibrations  of  complicated  structures  such  as  bridges 
or  airplanes,  and  the  vibrations  of  atoms  in  a  crystal  lattice.  In  such  problems,  we 
need  to  solve  differential  equations  similar  to  the  ones  we  have  discussed.  Differential 
equations  of  the  same  form  arise  in  electricity.  Consider  equations  (1.2)  and  (1.3) 
when  V  =  0.  Remembering  that  I  =  dq/dt,  we  can  write  (1.2)  as 


(5.33) 

and  (1.3)  as 


rd2q  ,  udq  ,  1  _  n 

Ld^  +  Rtt  +  cq~° 


(5.34) 


d2I 
dt, 2 


R 


dt,+C 


1  =  0. 


Both  these  equations  are  of  the  form  (5.27)  which  we  have  solved.  Thus  there  is  an 
analogy  between  a  series  circuit  and  the  motion  of  a  mass  to  described  by  (5.26); 
L  corresponds  to  to,  R  to  the  “friction”  constant  l,  and  1/C  to  the  spring  constant  k. 


PROBLEMS,  SECTION  5 

Solve  the  following  differential  equations  by  the  methods  discussed  above  and  compare 
computer  solutions. 


1. 

+ 

1 

to 

o 

2. 

// 

y  - 

-  4 y'  +  4y  =  0 

3. 

y"  +  9y  =  o 

4. 

n  . 

y  3 

-2y  +2y  =  0 

5. 

(D2  -2D  +  1  )y  =  0 

6. 

(D2 

+  16  )y  =  0 

7. 

(D2  -5 D  +  6 )y  =  0 

8. 

D{D  +  5)y  =  0 

9. 

(D2  -4 D  +  13 )y  =  0 

10. 

// 

y  - 

-  2y'  =  0 

11. 

4  y"  +  12  y'  +  9  =  0 

12. 

{2D 

2+D-l)y  = 

Recall  from  Chapter  3,  equation  (8.5),  that  a  set  of  functions  is  linearly  independent  if 
their  Wronskian  is  not  identically  zero.  Calculate  the  Wronskian  of  each  of  the  following 
sets  to  show  that  in  each  case  they  are  linearly  independent.  For  each  set,  write  the 
differential  equation  of  which  they  are  solutions.  Also  note  that  each  set  of  functions  is  a 
set  of  basis  functions  for  a  linear  vector  space  (see  Chapter  3,  Section  14,  Example  2)  and 
that  the  general  solution  of  the  differential  equation  gives  all  vectors  of  the  vector  space. 
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13. 

—x  —Ax 

e  ,  e 

14. 

eax ,  ebx,  a  ^  b  (a,  b,  real  or  complex) 

15. 

ax  ax 

e  ,  xe 

16. 

sin  / 3x ,  cos  f3x 

17. 

i  2 

1,  X,  X 

18. 

ax  ax  2  ax 

e  ,xe  ,x  e 

19. 

Solve  the  algebraic  equation 

D2  +  (1  +  2i)D  +  i  —  1  =  0 


(note  the  complex  coefficients)  and  observe  that  the  roots  are  complex  but  not 
complex  conjugates.  Show  that  the  method  of  solution  of  (5.6)  (case  of  unequal 
roots)  is  correct  here,  and  so  find  the  general  solution  of 

y"  +  (1  +  2  i)y  +(i-  1  )y  =  0. 


20.  As  in  Problem  19,  solve  y"  +  (1  —  i)y'  —  iy  =  0.  Hint:  See  Chapter  2,  Section  10, 
for  a  method  of  finding  the  square  root  of  a  complex  number. 

21.  By  the  method  used  in  solving  (5.4)  to  get  (5.9),  show  that  the  solution  of  the 
third-order  equation 

(D  —  a)(D  —  b)(D  —  c)y  =  0 
is 

ax  ,  bx  .  cx 

y  =  cie  +  C2e  +  C3e 

if  a,  b,  c  are  all  different,  and  find  the  solutions  if  two  or  three  of  the  roots  of  the 
auxiliary  equation  are  equal.  Generalize  the  result  to  higher-order  equations.  State 
your  results  in  vector  space  language  [see  comment  following  equation  (5.9)]. 

Use  the  results  of  Problem  21  to  find  the  general  solutions  of  the  following  equations  and 
compare  computer  solutions. 


22. 

23. 

24. 

25. 

26. 

27. 

28. 

29. 

30. 

31. 


{D  —  1)(D  +  S)(D  +  5)y  =  0 

(. D 2  +  1  )(D2  -  l)y  =  0  Hint:  D2  +  1  =  (D  +  i)(D  -  i). 

y"'  +  y  =  o 

(D3  +  D2  -  6D)y  =  0 

y'"  -  3 y"  -  9y'  -  by  =  0 

D2(D  —  1)2(D  +  2)3y  =  0 

(D4  +  4 )y  =  0  Hint:  Find  the  four  4th  roots  of  —4  (see  Chapter  2,  Section  10). 
(D  +  1)2(D4  —  16)y  =  0 
(D4-l)2y  =  0 

Let  D  stand  for  d/dx,  that  is,  Dy  =  dy/dx ;  then 


D2y  =  D(Dy)  = 


d 

dx 


d*y 
dx 2’ 


°3y  =  U’ etc- 


D  (or  an  expression  involving  D)  is  called  a  differential  operator.  Two  operators  are 
equal  if  they  give  the  same  results  when  they  operate  on  y.  For  example, 


D{D  +  X)y=i{t+Xy) 


^+Xt  +  V  =  {D2+XD  +  1)y 


so  we  say  that 

D(D  +  x)  =  D2  +  xD  +  1. 


In  a  similar  way  show  that: 
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(a)  ( D  —  a)(D  —  b)  =  (D  —  b)(D  —  a)  =  D2  —  (b  +  a)D  +  ab  for  constant  a  and  b. 

(b)  D3  +  1  =  {D  +  1)(D2  -D+l). 

(c)  Dx  =  xD  +  1.  (Note  that  D  and  x  do  not  commute,  that  is,  Dx  ^  xD.) 

(d)  (D  -  x)(D  +  x)  =  D2  -  x2  +  1,  but  ( D  +  x)(D  -  x)  =  D2  -  x2  -  1. 

Comment:  The  operator  equations  in  (c)  and  (d)  are  useful  in  quantum  mechanics; 
see  Chapter  12,  Section  22. 

32.  In  Example  3,  we  used  the  second  solution  in  (5.24),  and  obtained  (5.25)  as  the 
particular  solution  satisfying  the  given  initial  conditions.  Show  that  the  first  and 
third  solutions  in  (5.24)  also  give  the  particular  solution  (5.25)  satisfying  the  given 
initial  conditions. 

33.  A  particle  moves  along  the  x  axis  subject  to  a  force  toward  the  origin  proportional  to 
x  (say  —kx).  Show  that  the  particle  executes  simple  harmonic  motion  (Example  3). 
Find  the  kinetic  energy  \mv2  and  the  potential  energy  \ kx 2  as  functions  of  t  and 
show  that  the  total  energy  is  constant.  Find  the  time  averages  of  the  potential  energy 
and  the  kinetic  energy  and  show  that  these  averages  are  each  equal  to  one-half  the 
total  energy  (see  average  values,  Chapter  7,  Section  4). 

34.  Find  the  equation  of  motion  of  a  simple  pendulum  (see  Chapter  7,  Problem  2.13), 
that  is,  the  differential  equation  for  9  as  a  function  of  t.  Show  that,  for  small  9,  this 
is  approximately  a  simple  harmonic  motion  equation,  and  find  9  if  9  =  6>o,  d9/dt  —  0 
when  t  =  0. 

35.  The  gravitational  force  on  a  particle  of  mass  m  inside  the  earth  at  a  distance  r  from 
the  center  (r  <  the  radius  of  the  earth  R)  is  F  =  —mgr/R  (Chapter  6,  Section  8, 
Problem  21).  Show  that  a  particle  placed  in  an  evacuated  tube  through  the  center 
of  the  earth  would  execute  simple  harmonic  motion.  Find  the  period  of  this  motion. 

36.  Find  (in  terms  of  L  and  C)  the  frequency  of  electrical  oscillations  in  a  series  circuit 
(Figure  1.1)  if  R  =  0  and  V  =  0,  but  I  0.  (When  you  tune  a  radio,  you  are 
adjusting  C  and/or  L  to  make  this  frequency  equal  to  that  of  the  radio  station.) 

37.  A  block  of  wood  is  floating  in  water;  it  is  depressed  slightly  and  then  released  to 
oscillate  up  and  down.  Assume  that  the  top  and  bottom  of  the  block  are  parallel 
planes  which  remain  horizontal  during  the  oscillations  and  that  the  sides  of  the  block 
are  vertical.  Show  that  the  period  of  the  motion  (neglecting  friction)  is  2-rv^/h/g, 
where  h  is  the  vertical  height  of  the  part  of  the  block  under  water  when  it  is  floating 
at  rest.  Hint:  Recall  that  the  buoyant  force  is  equal  to  the  weight  of  displaced  water. 

38.  Solve  the  RLC  circuit  equation  [(5.33)  or  (5.34)]  with  V  =  0  as  we  did  (5.27),  and 
write  the  conditions  and  solutions  for  overdamped,  critically  damped,  and  under¬ 
damped  electrical  oscillations  in  terms  of  the  quantities  R,  L ,  and  C. 

39.  (a)  Find  numerical  values  of  the  constants  and  computer  plot  together  on  the 

same  axes  graphs  of  (5.30),  (5.31)  and  (5.32)  in  order  to  compare  overdamped, 
critically  damped,  and  oscillatory  motion.  Suggested  numbers:  Let  w  —  1,  and 
b  —  13/5,  1,  5/13  for  the  three  kinds  of  motion.  Let  y( 0)  =  1  and  y'( 0)  =  0. 

(b)  Repeat  the  problem  with  the  same  set  of  u>  and  b  values  and  with  y(0)  =  1, 
but  with  y'( 0)  =  1. 

(c)  Again  repeat,  with  y'( 0)  =  —1. 

40.  The  natural  period  of  an  undamped  system  is  3  sec,  but  with  a  damping  force 
proportional  to  the  velocity,  the  period  becomes  5  sec.  Find  the  differential  equation 
of  motion  of  the  system  and  its  solution. 
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6.  SECOND-ORDER  LINEAR  EQUATIONS  WITH  CONSTANT 
COEFFICIENTS  AND  RIGHT-HAND  SIDE  NOT  ZERO 


So  far  we  have  considered  second-order  linear  equations  with  constant  coefficients 
and  zero  right-hand  side  (5.1).  Such  equations  describe  free  vibrations  or  oscillations 
of  mechanical  or  electrical  systems.  But  often  such  systems  are  not  free  but  are 
subject  to  an  applied  force  or  emf.  The  vibrations  are  then  called  forced  vibrations 
and  the  differential  equation  describing  the  system  is  of  the  form 


(6.1) 


«2 


d2y 

dx 2 


+  +  ao  v  =  /(*)> 


d2y  aidy  a0  ,  . 
d^  +  -2^  +  ~,ll  =  FM- 


or 


The  function  f{x)  is  often  called  the  forcing  function;  it  represents  the  applied  force 
or  emf.  We  want  to  find  the  general  solution  of  equations  of  the  form  (6.1). 


Example  1.  Consider  the  equation 

(6.2)  {D2  +  5D  +  4  )y  =  cos  2x. 

We  already  know  (from  Section  5,  Example  1)  the  general  solution  of  the  corre¬ 
sponding  equation  (5.2)  with  the  right-hand  side  equal  to  zero.  This  solution  (5.9) 
is  called  the  complementary  function ;  it  is  not  a  solution  of  (6.2)  but  is  related  to 
it  as  we  shall  see.  We  shall  denote  the  complementary  function  by  yc.  Thus  for 
equation  (6.2)  the  complementary  function  is 

(6.3)  yc  =  Ae~x  +  Be~4x. 

Now  suppose  we  know  just  any  solution  of  (6.2);  we  call  this  solution  a  particular 
solution  and  denote  it  by  yp.  You  can  easily  verify  that 

(6.4)  Vp  =  To  sin  2a: 

is  a  particular  solution  of  (6.2),  and  we  shall  soon  consider  ways  of  finding  such 
solutions.  Then  we  have 

(6.5)  {D2  +  5  D  +  4  )yp  =  cos  2x 
and  from  Section  5,  Example  1, 

(6.6)  (D2  +  5D  +  4)yc  =  0. 

Adding  (6.5)  and  (6.6),  we  find 

(. D 2  +  5  D  +  4  ){yp  +  yc)  =  cos  2x  +  0  =  cos  2x. 


Thus 

(6.7)  y  =  yc  +  yp  =  Ae~x  +  Be~ix  +  ^  sin  2a; 

is  a  solution  of  (6.2).  In  fact,  it  is  the  general  solution  of  (6.2)  since  it  contains  two 
independent  arbitrary  constants  (Problem  27). 
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Thus  we  see  how  to  solve  6.1): 


The  general  solution  of  an  equation  of  the  form  (6.1)  is 

(6.8)  y  =  yc  +  yp 

where  the  complementary  function  yc  is  the  general  solution  of  the  homogeneous 
equation  (as  in  Section  5)  and  yp  is  a  particular  solution  of  (6.1). 


We  shall  now  discuss  some  ways  of  finding  particular  solutions.  It  is  worthwhile 
to  know  about  this  even  if  you  are  using  a  computer  to  find  the  solution.  When  you 
know  what  to  expect,  you  are  better  able  to  judge  whether  a  computer  solution  is  in 
the  best  form  for  your  purposes,  and  if  not,  to  find  a  better  form.  (See  problems.) 

Inspection  If  there  is  a  very  simple  particular  solution,  we  may  be  able  to  guess 
and  verify  it. 

Example  2.  Consider  the  equation  y"  —  2 y'  +  3y  =  5. 

It  is  easy  to  see  that  yp  =  |  is  a  particular  solution  of  this  equation  since  if  y  is 
constant,  y"  and  y'  are  zero. 

Example  3.  As  a  less  trivial  problem,  consider 

(6.9)  y"  —  6?/  +  9y  =  %ex . 

We  might  suspect  that  a  multiple  of  ex  is  a  solution  of  this  equation,  and  it  is  easy 
to  verify  that  y  =  2ex  is  a  solution.  But  trying  the  same  method  for  the  equation 

(6.10)  y"  +  y'  -2y  =  ex, 
we  fail  to  find  a  particular  solution  since  ex  satisfies 

y"  +  y'  -  2y  =  o. 

The  method  of  inspection  is  very  good  in  simple  cases  where  it  gives  us  an  answer 
quickly,  but  usually  we  need  other  methods. 

Successive  Integration  of  Two  First-Order  Equations  This  is  a  straight¬ 
forward  method  which  can  always  be  used  to  solve  equations  of  the  form  (6.1).  In 
practice,  however,  it  often  involves  more  work  than  various  special  methods;  we 
shall  find  it  particularly  useful  in  deriving  the  special  methods. 

Example  4.  Let’s  solve  (6.10)  again.  We  can  write  this  differential  equation  as 

(6.11)  (D-l)(D  +  2)y  =  ex. 


Let 

(6.12) 


u  =  ( D  +  2)y . 
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Then  the  differential  equation  (6.11)  becomes 

(6.13)  (D  —  \)u  =  ex  or  u'  —  u  =  ex. 

This  is  a  first-order  linear  differential  equation  which  we  solve  as  in  Section  3. 


I  =  J  —dx  =  —x, 

<614)  «e-  =  /e-e- *  =  *  + 

u  =  xex  +  c\ex . 

Then  the  differential  equation  for  y  becomes 

{D  +  2)y  =  xex  +  C\ex  or  y' +  2y  =  xex  +  C\ex . 
This  is  again  a  linear  first-order  equation  which  we  solve  as  follows: 

/  =  f  2  dx  =  2x, 


(6.15) 


ye2x  = 


J  e2x(xex  +  c\ex)  dx  =  \xe3x  —  | e3x  +  | cie3x  +  C2 


=  r,xe3x  +  dxe3x  +  C2, 

O 

y  =  \xex  +  dxex  +  C2e~2x. 

Notice  that  here  we  have  obtained  the  general  solution  all  in  one  process  rather 
than  finding  the  complementary  function  plus  a  particular  solution  in  two  separate 
processes.  However,  we  could  have  obtained  just  the  particular  solution  xex /3  by 
omitting  the  arbitrary  constant  at  each  integration  (these  led  to  the  complementary 
function)  and  also  dropping  terms  which  are  already  in  the  complementary  function 
(—ex/9  in  this  example).  Since  it  is  easy  to  write  the  complementary  function  (by 
Section  5),  it  saves  time  to  omit  those  terms  when  we  are  finding  a  particular 
solution.  You  may  find  that  your  computer  gives  a  more  complicated  particular 
solution  by  including  terms  of  the  complementary  function  in  the  particular  solution. 
Now  that  you  know  to  watch  for  this,  you  can  simplify  a  computer  solution  by 
removing  those  terms. 


Exponential  Right-Hand  Side  Let  us  consider  how  to  find  a  particular  solution 
when  the  right-hand  side  of  (6.1)  is  F(x)  =  kecx  where  k  and  c  are  given  constants. 
Observe  that  c  may  be  complex;  we  shall  be  especially  interested  in  this  case  later. 
Let  a  and  b  be  the  roots  of  the  auxiliary  equation  of  (6.1);  then  (6.1)  becomes 

(6.16)  (D  —  a)(D  —  b)y  =  F(x)  =  kecx . 

Let  us  first  suppose  that  c  is  not  equal  to  either  a  or  b.  Solving  (6.16)  by  successive 
integration  of  two  first-order  equations  as  in  the  last  paragraph  is  straightforward 
(Problem  28)  and  gives  the  result  that  the  particular  solution  in  this  case  is  simply 
a  multiple  of  ecx .  It  is  not  necessary  to  remember  the  formula  for  the  constant 
factor  or  to  go  through  this  process  each  time.  Now  that  we  know  the  form  of 
the  particular  solution,  we  simply  assume  a  solution  of  this  form  and  solve  for  the 
constant. 
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Example  5.  Solve  the  equation 

(6.17)  (D-  l)(D  +  5)y  =  le2x. 

We  observe  that  c  =  2  is  not  equal  to  either  of  the  roots  of  the  auxiliary  equation. 
To  find  a  particular  solution  we  substitute  yp  =  Ce2x  into  (6.17)  and  get 

y;  +  4 y'p  -  5 yp  =  C( 4e2*  +  8e2*  -  5e2*)  =  7e2*. 

Thus  we  must  have  (7=1,  and  the  general  solution  of  (6.17)  is 

y  =  Aex  +  Be~5x  +  e2x. 

We  have  already  seen  in  solving  (6.11)  that  if  c  is  equal  to  either  a  or  b  (a  ^  b ), 
the  particular  solution  is  of  the  form  Cxecx.  By  the  same  method  used  for  (6.11), 
you  can  easily  discover  that  if  a  =  b  =  c,  the  particular  solution  is  of  the  form 
Cx2ecx  (Problem  28c).  In  practice,  then,  we  find  a  particular  solution  of  (6.16)  by 
assuming  a  solution  of  the  form: 


1 

(Cecx 

if  c  is  not  equal  to  either  a  or  6; 

(6.18) 

Cxecx 

if  c  equals  a  or  fe,  a  /  6; 

' 

[<7a;2ecx 

if  c  =  a  =  b. 

Now  that  we  know  this,  we  would  solve  (6.10)  as  follows.  Substitute 
yp  =  Cxex ,  y'p  =  C(xex  +  ex),  y"  =  C(xex  +  2ex) 
into  (6.10)  and  get 

Dp  +  y'p  -  2 yp  =  C(xex  +  2ex  +  xex  +  ex  -  2xex)  =  ex. 

Thus  we  find  C  =  |  as  in  (6.15)  (but  with  much  less  work). 

Use  of  Complex  Exponentials  In  applied  problems,  the  function  F(x)  on  the 
right-hand  side  of  (6.1)  is  very  often  a  sine  or  a  cosine  representing  alternating  emf 
or  a  periodic  force.  We  could  find  yp  for  such  a  problem  either  by  the  method  of 
integrating  two  successive  first-order  equations  or  by  replacing  the  sine  or  cosine  by 
its  complex  exponential  form  and  using  the  method  of  the  last  paragraph.  There  is 
a  still  more  efficient  variation  of  the  latter  method  which  we  shall  now  show. 

Example  6.  Solve 

(6.19)  y"  +  y'  —  2y  =  4  sin  2x. 

Instead  of  tackling  this  problem  directly,  we  are  first  going  to  solve  the  equation 

(6.20)  Y"  +  Y'  -2Y  =  4e2“. 

Since  e2“  =  cos  2x  +  i  sin  2x  is  complex,  the  solution  Y  may  be  complex  also.  Then 
if  Y  =  Yp/  +  iYj,  (6.20)  is  equivalent  to  two  equations 

yr  +yr-  2YR  =  Re4e2ix  =  4  cos  2x, 

V/'  +  Y[  -  2Y,  =  Im  Ae2ix  =  4  sin  2x. 


(6.21) 
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Since  the  second  equation  in  (6.21)  is  the  same  as  (6.19),  we  see  that  the  solution  of 
(6.19)  is  the  imaginary  part  of  Y.  Thus  to  find  yp  for  (6.19),  we  find  Yp  for  (6.20) 
and  take  its  imaginary  part.  We  observe  that  2 i  is  not  equal  to  either  of  the  roots 
of  the  auxiliary  equation  in  (6.20).  Following  the  method  of  the  last  paragraph,  we 
assume  a  solution  of  the  form 


Yp  =  Ce2ix 


and  substitute  it  into  (6.20)  to  get 


(—4  +  2i  —  2)Ce2ix  =  4e2ix, 
4  4(—2i  —  6) 


2i-  6 


—  -\{i  +  3), 


Yp  =  -\(i  +  Z)t 


Taking  the  imaginary  part  of  Yp,  we  find  yp  for  (6.19): 
(6.22)  yp  =  —  i  cos  2x  —  |  sin  2x. 

We  summarize  the  method  of  complex  exponentials: 


Method  of  Undetermined  Coefficients  The  method  we  have  just  discussed 
of  assuming  an  exponential  solution  and  determining  the  constant  factor  C  is  an 
example  (and  in  practice  the  most  important  case)  of  the  method  of  undetermined 
coefficients.  In  (6.18)  we  outlined  the  form  of  yp  to  assume  for  equation  (6.16), 
that  is,  when  the  right-hand  side  of  (6.1)  is  an  exponential.  It  is  straightforward 
but  tedious  (Problems  29  and  32)  to  find  the  corresponding  result  (6.24)  when  the 
right-hand  side  is  an  exponential  times  a  polynomial. 

A  particular  solution  yp  of  {D—a){D—b)y  =  ecxPn(x)  where  Pn(x)  is  a  polynomial 
of  degree  n  is 

{ecxQn(x)  if  c  is  not  equal  to  either  a  or  b , 

xecxQn(x)  if  c  equals  a  or  b,  a  b, 
x2ecxQn(x )  if  c  =  a  =  b, 

where  Qn(x )  is  a  polynomial  of  the  same  degree  as  Pn{x)  with  undetermined 
coefficients  to  be  found  to  satisfy  the  given  differential  equation.  Note  that  sines 
and  cosines  are  included  in  ecx  by  use  of  complex  exponentials  as  in  (6.19)  to 
(6.23).  (Also  see  Problem  29.) 
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Example  7.  To  illustrate  using  (6.24),  let’s  find  a  particular  solution  of 

(6.25)  ( D  -  1  ){D  +  2 )y  =  y"  +  y'  -  2y  =  I8xex. 

In  the  notation  of  (6.24)  we  have  a  =  1,  b  =  —2,  c  =  1;  also  P„( x)  =  18a;  =  Pi  (a:) 
is  a  polynomial  of  degree  1.  Then  Q\  is  a  polynomial  of  degree  1,  namely  Ax  +  B. 
Since  c  =  a  ^  b,  we  see  by  (6.24)  that  the  form  to  assume  for  a  particular  solution 
of  (6.25)  is 

yp  =  xex(Ax  +  B)  =  ex(Ax2  +  Bx). 

We  substitute  this  into  (6.25)  and  find  A  and  B  so  that  we  have  an  identity. 

yp'  =  ex(Ax2  +  Bx  +  2  Ax  +  B), 
yp "  =  ex{Ax 2  +  Pa;  +  4,4a:  +  2P  +  2  A) 

Dp"  +  Up  ~~  2 yp  =  ex(6Ax  +  3 P  +  2,4)  =  18a;ex 

To  make  this  an  identity,  we  must  have 

6 A  =  18,  3 P  +  2 A  =  0,  or  A  =  3,  P  =  —2,  so 


(6.26)  yp  =  (3a;2  —  2x)ex . 

A  computer  solution  may  add  to  this  a  constant  times  ex,  but  this  is  an  unnecessary 
complication  since  ex  is  a  term  in  the  complementary  function. 

If  the  right-hand  side  of  a  differential  equation  is  a  polynomial,  then  c  =  0  in 
(6.24),  and  we  assume  for  yp  a  polynomial  as  indicated  in  (6.24). 

Example  8.  To  solve 

(6.27)  (P  -  1)(P  +  2 )y  =  y"  +  y'  -  2y  =  x2  -  x 
we  assume  yp  =  ,4a;2  +  Bx  +  C,  and  find  the  particular  solution 

(6.28)  yp  =  ~\{x2  +  l). 

A  computer  solution  gives  the  same  result. 

PROBLEMS,  SECTION  6 

Find  the  general  solution  of  the  following  differential  equations  (complementary  function 
+  particular  solution).  Find  the  particular  solution  by  inspection  or  by  (6.18),  (6.23), 
or  (6.24).  Also  find  a  computer  solution  and  reconcile  differences  if  necessary,  noticing 
especially  whether  the  particular  solution  is  in  simplest  form  [see  (6.26)  and  the  discussion 
after  (6.15)]. 


1. 

y"  ~  Ay  =  10 

2. 

(D  -  2  fy  =  16 

3. 

V  +V  ~2y  =  e 

4. 

(D  +  l)(£>-3)y  =  : 

5. 

(D2  +  1  )y  =  2e* 

6. 

y"  +  6  y'  +  9  y  =  12  e 

7. 

V  -y  -  2y  =  3e 

8. 

y"  —  16  y  =  40e4a: 
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9. 

(D2  +2D  +  1  )y  =  2e~x 

10. 

(D  -  3 )2y  =  6e3x 

11. 

y"  +  2 y'  +  10 y  =  100  cos  4x  Hint:  First 

solve  y"  +  2 y'  +  lOj /  =  100e4xx . 

12. 

(D2  +  47?  +  12)y  =  80  sill  2x 

13. 

( D 2  —  2D  +  1  )y  =  2  cos  x 

14. 

y"  +  8y'  +  25y  =  120  sin  5x 

15. 

5  y"  +  12  y  +  20y  =  120  sin  2a: 

16. 

( D 2  +  9)y  =  30  sin  3a: 

17. 

y”  +  16  y  =  16  cos  4a: 

18. 

(D2 +2D+17)y  =  60e-4x  sin  5a; 

Hint:  First  solve  (D2+2D+17)y  =  60e(~4+si)x . 

19. 

(4 D2  +  4 D  +  5 )y  =  ^Oe~3x^2  sin  2a: 

20. 

y"  +  Ay'  +  8y  =  30e  x^2cos5a:/2 

21. 

5  y"  +  Gy'  +  2y  =  x2  +  6x 

22. 

2y"+y'  =2x 

23. 

y"  +  y  =  2xex 

24. 

y"  —  Gy'  +  9  y  =  12xe3x 

25. 

(D  —  3)(D  +  l)y  =  16x2e~x 

26. 

( D 2  +  1  )y  =  8xsina: 

27. 

Verify  that  (6.4)  is  a  particular  solution  of  (6.2).  Verify  that  another  particular 
solution  of  (6.2)  is 

1  •  rt  —X 

yp  =  To  sin  lx  —  e 

Observe  that  we  obtain  the  same  general  solution  (6.7)  whichever  particular  solution 
we  use  [since  ( A  —  1)  is  just  as  good  an  arbitrary  constant  as  A],  Show  in  general 
that  the  difference  between  two  particular  solutions  of  (a2-D2  +  a\D  +  ao)y  =  f(x) 
is  always  a  solution  of  the  homogeneous  equation  (d2 D2  +  a\D  +  ao)y  =  0,  and  thus 
show  that  the  general  solution  is  the  same  for  all  choices  of  a  particular  solution. 

28.  Solve  (6.16)  by  the  method  used  in  solving  (6.11),  for  the  following  three  cases,  to 
obtain  the  result  (6.18). 

(a)  c  is  not  equal  to  either  a  or  b; 

(b)  a  b,  c  =  a; 

(c)  a  —  b  =  c. 

29.  Consider  the  differential  equation  ( D  —  a)(D  —  b)y  =  Pn(x),  where  Pn(x)  is  a  poly¬ 
nomial  of  degree  n.  Show  that  a  particular  solution  of  this  equation  is  given  by 
(6.24)  with  c  =  0;  that  is,  yv  is 

{a  polynomial  Qn(x)  of  degree  n  if  a  and  b  are  both  different  from  zero; 
xQ„(x)  if  a  ^  0,  but  b  =  0; 
x2Q„(x)  if  a  =  6  =  0. 

Hint:  To  show  that  Qn(x)  =  ’Y^anxn  is  a  solution  of  the  differential  equation  for 
a  given  Pn  =  ^2  bnx 71 ,  you  have  only  to  show  that  the  coefficients  an  can  be  found 
so  that  (D  —  a)(D  —  b)Qn(x)  =  Pn(x).  Equate  coefficients  of  xn,  a;n_1,  •  •  • ,  to  see 
that  this  is  always  possible  if  a  ^  b.  For  6  =  0,  the  differential  equation  becomes 
(. D  —  a)Dy  =  Pn\  what  is  Dy  if  y  =  xQ„t  Similarly,  consider  D2y  if  y  =  x2Q„. 

30.  (a)  Show  that 

(D  —  a)ecx  =  (c  —  a)ecx; 

(D2  +  5 D-  3)ecx  =  (c2  +  5c  -  3)ecx; 

L(D)ecx  =  L(c)ecx ,  where  L(D)  is  any  polynomial  in  D ; 

( D  —  c)xe  =  e  ; 

/  \2  2  cx  o  cx 

(D  —  c)  x  e  =  2e  . 
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(b)  Define  the  expression  y  =  [1  / L(D)]u(x)  to  mean  a  solution  of  the  differential 
equation  L(D)y  =  u.  Using  part  (a),  show  that 


1 

D  —  a 

1 

D2  +  5D  —  3 

1 

L(D) 

1 

D-c 

1 


c  —  a 
ec 


c^a; 


c2  +  5c  —  3  ’ 
ecx 


L(c)  ’ 

CX 

xe  ; 


L(c)  ^  0; 


1  2  cx 

oX  e  . 


(c)  The  expressions  1/L(D)  in  (b)  are  called  inverse  operators.  They  can  be  used 
to  find  particular  solutions  of  differential  equations.  As  an  example  consider 
Problem  3.  We  write 


(D2  +  D-2)y  =  e2x, 

=  1  2x  =  e2x 

V  D2  +  D-  2e  22  +  2  —  2 

Using  inverse  operators,  find  particular  solutions  of  Problems  4  to  20.  Be 
careful  to  use  parts  4  or  5  of  (b)  if  c  is  a  root  of  the  auxiliary  equation.  For 
example, 


(. D-a){D-c ) 


D  —  c  D  —  a 


D  —  c  c  —  a 


31.  (a)  Show  that 


D(eaxy)  =  eax(D  +  a)y , 

D2(eaxy)  =  eax(D  +  a)2y, 

and  so  on;  that  is,  for  any  positive  integral  n, 

D  (e  y)  =  e  (D  +  a)  y. 

Thus  show  that  if  L(D)  is  any  polynomial  in  the  operator  D,  then 
L(D)(eaxy)=eaxL(D  +  a)y. 

This  is  called  the  exponential  shift. 

(b)  Use  (a)  to  show  that 

(D-l)3(exy)=exD3y, 

(D2  +  D-  6 )(e~3xy)  =  e~3x(D2  -  5 D)y. 

(c)  Replace  D  by  D  —  a,  to  obtain 

eaxP(D)y  =  P(D  -  a)eaxy. 


This  is  called  the  inverse  exponential  shift. 
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(d)  Using  (c),  we  can  change  a  differential  equation  whose  right-hand  side  is  an 
exponential  times  a  polynomial,  to  one  whose  right-hand  side  is  just  a  polyno¬ 
mial.  For  example,  consider  ( D 2  —  D  —  6 )y  =  10xe3x\  multiplying  both  sides 
by  e~3x  and  using  (c),  we  get 

e~3x(D2  —  D  —  6)y  =  [(D  +  3)2  —  (D  +  3)  —  6  ]ye~3x 
=  ( D 2  +  5  D)ye~3x  =  10®. 

Show  that  a  solution  of  (D2  +  5D)u  =  10®  is  it  =  ®2  —  |®;  then  ye~3x  =  ®2  —  |® 
or  y  =  e3x(x 2  —  |®).  Use  this  method  to  solve  Problems  23  to  26. 

32.  Using  Problems  29  and  31b,  show  that  equation  (6.24)  is  correct. 

Several  Terms  on  the  Right-Hand  Side:  Principle  of  Superposition  So 

far  we  have  brushed  over  a  question  which  may  have  occurred  to  you:  What  do 
we  do  if  there  are  several  terms  on  the  right-hand  side  of  the  equation  involving 
different  exponentials? 

Example  9.  As  an  artificial  problem  to  illustrate  the  ideas,  consider  the  equation 

(6.29)  y"  +  y'  —  2y  =  (D  —  1)(D  +  2  )y  =  [ ex ]  +  [4  sin  2x]  +  [x2  —  x\. 

We  have  already  solved  differential  equations  with  the  same  left-hand  sides  as  (6.29) 
and  with  right-hand  sides  equal  in  turn  to  each  of  the  three  expressions  in  brackets  in 

(6.29)  [see  (6.11)  to  (6.15),  (6.19)  to  (6.22),  (6.27),  and  (6.28)].  Thus  we  know  that 

(D  —  1  ){D  +  2 )y  =  ex  has  the  particular  solution  yp i  =  | xex ; 

{D  —  1  )(D  +  2 )y  =  4  sin  2®  has  the  particular  solution  yp2  =  —  ^  cos  2®  —  |  sin2®; 

{D  —  1  ){D  +  2 )y  =  x2  —  x  has  the  particular  solution  yp3  =  —  ^(®2  +  1). 

Adding  these  three  solutions,  we  see  that 

(6.30)  yp  =  ypi  +  yp2  +  yp 3  =  \xex  -  ^  cos  2®  -  f  sin  2®  -  \(x2  +  1) 
is  a  particular  solution  of  (6.29). 

This  is  the  easiest  way  of  handling  a  complicated  right-hand  side:  Solve  a  sep¬ 
arate  equation  for  each  different  exponential  and  add  the  solutions.  The  fact  that 
this  is  correct  for  a  linear  equation  is  often  called  the  principle  of  superposition. 
As  we  can  see  from  (6.29)  and  (6.30),  this  amounts  to  a  fancy  name  for  the  fact 
that  the  derivative  (of  any  order)  of  a  sum  of  terms  is  equal  to  the  sum  of  the 
derivatives  of  the  individual  terms.  Notice  that  the  principle  holds  only  for  linear 
equations;  for  example,  if  the  equation  contained  j/'2,  the  principle  would  not  hold 
since  (y[  +  y2)2  is  not  equal  to  y[ 2  +  y'22.  In  fact,  an  operator  (such  as  the  D  oper¬ 
ators  we  have  been  using)  which  satisfies  the  principle  of  superposition  is  called  a 
linear  operator.  [See  Chapter  3,  equation  (7.4)  and  Problem  7.12.]  Linear  operators 
are  of  particular  importance  because  they  obey  the  principle  of  superposition;  for 
example,  D2(yi  +  y2)  =  y'{  +  y"  =  D2y\  +  D2y2l  so  D 2  is  a  linear  operator.  We 
shall  make  use  of  this  principle  shortly  in  our  discussion  of  the  use  of  Fourier  series 
in  finding  particular  solutions. 
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Forced  Vibrations  Let’s  return  now  to  the  physical  problem  we  considered  at 
the  end  of  Section  5.  There  we  set  up  and  solved  the  differential  equation  which 
describes  the  free  (zero  right-hand  side,  no  forcing  function)  vibrations  of  a  damped 
oscillator.  We  commented  that  the  same  mathematics  applies  to  a  variety  of  me¬ 
chanics  problems  and  also  to  a  simple  RLC  series  electric  circuit.  As  we  know  from 
experiment  and  as  we  can  see  from  (5.30),  (5.31),  and  (5.32),  the  free  vibrations  we 
considered  in  Section  5  die  out  as  time  passes.  Such  oscillations  are  referred  to  as 
transients.  We  next  want  to  consider  the  vibrations  obtained  when  a  periodic  force 
(or  emf  in  the  electric  case)  is  applied.  This  means  mathematically  that  we  want  to 
solve  (5.27)  with  a  function  of  t  on  the  right-hand  side.  The  solution  will  contain 
the  appropriate  one  of  (5.30),  (5.31),  (5.32);  this  is  the  complementary  function  and 
it  is  also  the  transient  since  it  tends  to  zero  as  t  tends  to  infinity.  The  solution  will 
also  contain  a  particular  solution  which  does  not  tend  to  zero  as  t  tends  to  infinity; 
this  is  the  steady-state  solution  which  we  want  to  find. 


Example  10.  Let  us  solve 
(6.31) 


+  26^  +  uj2y  =  Fsino/f  (F  =  const.). 
dtz  dt 


By  the  method  of  complex  exponentials,  we  solve  first 


(6.32) 


d2Y  dY  , 

—4  +  26--—  +  u2Y  =  Feiu}  *. 
dtz  dt 


Substitute 

(6.33)  Yp  =  Ceiu,t 

into  (6.32)  to  get 


(6.34) 


C  = 


{-oj'2  +  2biJ  +  w2)C'e<ta,'t  =  Feiu}'\ 

F  _  [(lo2  -  u’2)  -  2biu/]F 


(cj2  —  cj'  )  +  2 bicj'  (a;2  —  a;'2)  +  4 b2io'2 

It  is  convenient  to  write  the  complex  number  C  in  the  relS  form.  We  have 

F 

M  =  , 

(6.35) 


\C\  =  -j= 

(uj2  -  uj'2Y  +  462w'2 

angle  of  C  =  — <f> ,  where  <j>  is  given  by  Figure  6.1. 


2  bco' 


Thus 

(6.36) 


Figure  6.1 


C  = 


\J  (w2  —  w'2)2  +  462u/ 


0—i4> 


and  from  (6.33) 


yP  = 


F 


\J  (lo2  —  u'2)2  +  Ab2uj' 


(6.37) 
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To  find  yp  we  take  the  imaginary  part  of  Yp: 

F 

(6.38)  yp  =  .  sin(a/f  —  </>). 

{u2  -  u'2)2  +  Ab2u>'2 

This  is  the  steady-state  solution,  so-called  because  as  t,  increases,  the  rest  of  the 
solution  [given  by  (5.30),  (5.31),  or  (5.32)]  becomes  negligible.  For  example,  when 
you  turn  on  an  electric  light,  the  current  is  given  by  (5.32)  plus  (6.38).  The  transient 
(5.32)  tends  to  zero  rapidly  and  the  steady-state  solution  (6.38)  becomes  essentially 
the  whole  solution. 


Resonance  We  note,  by  comparing  (6.38)  and  the  forcing  function  in  (6.31), 
that  the  applied  force  (or  emf)  and  the  solution  y  (which  represents  displacement, 
current,  etc.)  are  out  of  phase',  that  is,  their  maximum  values  do  not  occur  at 
the  same  time  because  of  the  phase  angle  f>.  We  also  see  from  (6.38)  that  for  a 
given  forcing  frequency  io' ,  the  largest  amplitude  of  y  (also  of  dy/dt ,  Problem  40) 
occurs  if  the  natural  (undamped)  frequency  u>  is  equal  to  a/.  This  situation  is 
often  called  resonance.  In  the  RLC  series  circuit  problem,  y  represents  the  charge 
q  on  the  capacitor  if  the  forcing  function  is  the  emf,  and  y  represents  the  current 
/  =  dq/dt  if  the  forcing  function  is  the  time  derivative  of  the  emf.  For  such  a 
circuit,  given  the  frequency  w'  of  the  applied  emf,  the  current  (or  charge)  will 
have  the  largest  amplitude  when  the  natural  (undamped)  frequency  w  is  equal  to 
u>' .  This  is  almost  always  called  the  resonance  condition  for  the  electrical  case. 
However,  there  is  another  question  we  could  ask  here  which  is  of  particular  interest 
in  mechanics.  Given  the  natural  (undamped)  frequency  u>  of  the  system,  what 
frequency  of  the  forcing  function  will  produce  the  largest  amplitude  of  yl  In  (6.38), 
we  want  to  maximize  the  coefficient  of  the  sine;  we  can  instead  minimize  the  square 
of  the  denominator  of  the  coefficient;  that  is,  we  want  to  find  the  value  of  u>'  which 
minimizes  (w2  —  a/2)2  +  4 b2u>'2  for  given  oj.  Setting  the  derivative  of  this  function 
(with  respect  to  u/)  equal  to  zero  and  solving  for  u>',  we  get 


(6.39) 


2{uj2  -  w,2)(- 2J)  +  8&V  =  0, 

u/2  =  a,2  -  2b2. 


Note  that  this  value  of  u>'  is  not  equal  to  either  the  natural  undamped  frequency 
oj  or  the  natural  damped  frequency  /3  where  (32  =  u>2  —  b2  [see  (5.32)].  However,  if 
we  define  resonance  as  the  situation  in  which  we  get  the  maximum  amplitude  for 
y  for  a  given  value  of  u>,  then  the  resonance  condition  is  (6.39).  (The  maximum 
amplitude  for  the  velocity — or  current  in  the  electrical  case — is  still  obtained  for 
to'  =  uj:  Problem  40.)  The  resonance  condition  (6.39)  is  of  particular  importance 
in  mechanics  where  we  are  apt  to  be  interested  in  the  displacement  y  of  a  given 
system  under  the  action  of  various  forces.  For  example,  consider  a  bridge;  we  would 
want  to  avoid  periodic  forces  with  an  to'  given  by  (6.39)  since  such  forces  would 
produce  large  vibrations.  In  this  case  resonance  is  undesirable.  It  may  in  other 
cases  be  desirable;  for  example,  when  you  tune  your  radio  to  the  frequency  of  a 
given  station,  you  are  given  u/  and  you  adjust  the  circuit  in  your  radio  to  make  its 
natural  frequency  w  equal  to  the  given  t J . 
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Use  of  Fourier  Series  in  Finding  Particular  Solutions  In  simple  problems, 
the  forcing  function  in  either  the  electrical  or  mechanical  case  is  just  a  sine  or 
cosine  and  the  problem  can  be  solved  as  we  have  just  done.  In  more  complicated 
(and  realistic)  cases,  however,  the  forcing  function  may  very  well  be  some  more 
complicated  function;  it  is  often  a  periodic  function,  however,  and  we  shall  assume 
this.  Suppose,  for  example,  that  the  periodic  emf  applied  to  a  circuit  is  given  by  one 
of  the  graphs  in  Figure  3.2  of  Chapter  7.  We  learned  in  Chapter  7  how  to  expand 
such  a  function  in  a  Fourier  series.  Let  us  suppose  that  this  has  been  done,  using 
for  definiteness  the  complex  exponential  form  of  the  Fourier  series.  Then  we  can 
write  (6.1)  as 

(6.40)  a2^f  +  +  aoy  =  f(x)  =  X!  c^mx ■ 

n=— oo 

We  know  how  to  solve  the  equation 

(6.41)  a2^  +  ai^  + aoy  =  Cneinx 

with  the  right-hand  side  equal  to  any  one  term  of  the  series.  If  we  now  add  the 
solutions  of  all  the  equations  (6.41)  for  all  n,  we  have  a  solution  of  (6.40)  (see 
principle  of  superposition  above) . 


Example  11.  Solve 

(0,2,  §+^  +  10, 

where  f(t)  is  a  function  of  period  27t  and 


1,  0  <  t  <  7T, 

0,  7 T  <  t  <  27 T. 


The  auxiliary  equation  is 

D2  +2D  +  10  =  0; 


its  roots  are 


D  =  — 1  ±  3i, 


so  the  complementary  function  is 


yc  =  e  4  {A  cos  3 1  +  B  sin  3 1) . 


To  find  a  particular  solution,  we  first  expand  f{t)  in  a  Fourier  series;  from  Chapter  7, 
equation  (7.8),  we  have 

(6.43)  /(f)  =  i  +  ^[ei4  -  e~lt  +  i(e3i4  -  e"344)  +  •••]. 

We  next  write  and  solve  a  whole  set  of  differential  equations  like  (6.42)  but  each 
having  just  one  term  of  the  series  (6.43)  on  the  right-hand  side.  For  the  first  term 
(namely  i)  we  see  by  inspection  that  a  particular  solution  of 


d2y  „ dy 
dt 2  dt 


+ 10  y 


1 

2 
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PROBLEMS,  SECTION  6 

In  Problem  33  to  38,  solve  the  given  differential  equations  by  using  the  principle  of  su¬ 
perposition  [see  the  solution  of  equation  (6.29)].  For  example,  in  Problem  33,  solve  three 
differential  equations  with  right-hand  sides  equal  to  the  three  different  brackets.  Note  that 
terms  with  the  same  exponential  factor  are  kept  together;  thus  a  polynomial  of  any  degree 
is  kept  together  in  one  bracket. 

33.  y"  +  y  =  [*3  —  1]  +  [2  cos  x\  +  [(2  —  4x)ex] 

34.  y"  -  5 y  +  6y  =  2ex  +  6x  -  5 

35.  ( D 2  —  1  )y  =  sinha; 

36.  (D2  +  l)y  =  2  sin*  +  4*  cos  x 

37.  (D  -  l)2y  =  4ex  +  (1  -  x)(e2x  -  1) 

38.  y"  —  2  y  =  9xe~x  —  6x2  +  4e2x 
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39.  Find  the  solutions  of  (1.2)  (put  I  =  dq/dt)  and  (1.3),  if  V  =  Vosinu/f  (u/  =const.). 

40.  In  (6.38),  show  that  for  a  given  forcing  frequency  ui',  the  displacement  y  and  the 
velocity  dy/dt  have  their  largest  amplitude  when  u>  =  u/. 

For  a  given  w,  we  have  shown  in  Section  6  that  the  maximum  amplitude  of  y  does 
not  correspond  to  u/  =  u>.  Show,  however,  that  the  maximum  amplitude  of  dy/dt 
for  a  given  u>  does  correspond  to  <*/  =  lo. 

State  the  corresponding  results  for  an  electric  circuit  in  terms  of  L,  R ,  C. 


Solve  Problems  41  and  42  by  use  of  Fourier  series.  Assume  in  each  case  that  the  right-hand 
side  is  a  periodic  function  whose  values  are  stated  for  one  period. 


41. 

y" 

+  2  y‘  +  2  y  = 

1*1. 

—  7T  • 

1  x. 

0  <  * 

<  1, 

42. 

y" 

+  9y=  <  ’ 

1°, 

-1  < 

x  <  0. 

43. 

Consider  an  equation 

for  dr 

which  the  right-hand  side  is  a  sum  of  several  forces  or  emfs  of  different  frequencies. 
For  example,  in  (6.32)  let  the  right-hand  side  be 


Fie 


iui[t  uo'3t 


Write  the  solution  by  the  principle  of  superposition.  Suppose,  for  given  a4,  u>3, 
that  we  adjust  the  system  so  that  u>  =  u>i\  show  that  the  principal  term  in  the 
solution  is  then  the  first  one.  Thus  the  system  acts  as  a  “filter”  to  select  vibrations 
of  one  frequency  from  a  given  set  (for  example,  a  radio  tuned  to  one  station  selects 
principally  the  vibrations  of  the  frequency  of  that  station). 


7.  OTHER  SECOND-ORDER  EQUATIONS 

Although  second-order  linear  equations  with  constant  coefficients  are  the  ones  used 
most  frequently  in  applications,  there  are  a  few  other  kinds  of  second-order  equations 
and  methods  of  solving  them  which  are  also  important.  We  shall  discuss  several 
of  these  here,  namely  (a)  equations  with  y  missing;  (b)  equations  with  x  missing; 
(c)  equations  of  the  form  y"  +  f(y)  =  0;  (d)  Euler-Cauchy  equations;  (e)  reduc¬ 
tion  of  order.  For  still  more  methods,  see  Section  9  (Laplace  transforms),  Section 
12  (Green  functions),  Problem  12.14b  (variation  of  parameters),  and  Chapter  12 
(special  functions,  series  solutions,  ladder  operators).  You  can  also  find  computer 
solutions  but,  as  we  have  said,  they  may  not  always  be  in  the  simplest  form  or  the 
form  you  need.  Comparing  hand  solutions  can  show  you  what  to  expect  and  help 
you  make  more  efficient  use  of  computer  solutions. 

To  solve  either  (a)  or  (b),  we  make  the  substitution 

(7.1)  y'  =  P- 


Case  (a):  Dependent  variable  y  missing. 

(7.2)  y'  =  p,  y"  =  p'. 


After  these  substitutions,  an  equation  of  the  type  (a)  is  of  the  first  order  with  p 
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as  the  dependent  variable  and  x  as  the  independent  variable.  First,  we  solve  it  for 
p  as  a  function  of  x\  then  we  put  back  p  =  y'  and  solve  the  resulting  first-order 
equation  for  y. 


Case  (b):  Independent  variable  x  missing. 


(7.3) 


y  =p,  y 


dp 

dx 


dp  dy  dp 
dy  dx  ^  dy 


What  we  are  doing  here  is  to  change  the  independent  variable  from  x  to  y.  Observe 
that  there  is  one  independent  variable  in  an  ordinary  differential  equation.  We  were 
originally  thinking  of  x  as  the  independent  variable  with  y  and  pdy/dx  as  functions 
of  x.  Now  we  think  of  y  as  the  independent  variable  with  p  a  function  of  y;  (7.3) 
is  just  the  chain  rule  (Chapter  4,  Section  5)  for  differentiating  a  function  p{y)  with 
respect  to  x  if  y  is  a  function  of  x.  With  the  substitutions  (7.3),  a  differential  equa¬ 
tion  with  x  missing  becomes  a  first-order  equation  with  p  as  the  dependent  variable 
and  y  as  the  independent  variable. 


Example  1.  In  Section  5,  we  discussed  the  motion  of  a  mass  m  subject  to  a  restoring 
force  —ky  and  a  damping  force  l{dy / di).  Let  us  now  consider  a  similar  problem  but 
with  the  damping  force  proportional  to  the  square  of  the  velocity.  The  differential 
equation  of  motion  is  then  [compare  (5.26)] 

(7  A)  +  t„  =  0  «>0), 


where  the  plus  or  minus  sign  must  be  chosen  correctly  at  each  stage  of  the  motion 
so  that  the  retarding  force  opposes  the  motion.  Let  us  solve  the  following  special 
case  of  this  problem.  Discuss  the  motion  of  a  particle  which  is  released  from  rest  at 
the  point  y  =  1  when  t  =  0,  and  obeys  the  equation  of  motion 

<->  +»-<>• 

This  is  an  example  of  case  (b)  (for  “x  missing”  read  “t  missing,”  that  is,  the  inde¬ 
pendent  variable  missing).  Using  (7.3)  (with  x  replaced  by  t),  we  have 


(7.6) 


dy_ 

dt 


=  P, 


d2y  dp 
dt2  P  dy  ’ 


so  (7.5)  becomes 


(7.7) 


4 p^-  ±  2p2  +  y  =  0 
dy 


or 


dp  ,  i  i_i 


This  is  a  Bernoulli  equation  [compare  (4.1)  with  y  replaced  by  p ,  and  P  and  Q 
functions  of  y}.  We  have  n  =  — 1,  and  the  substitution  (4.2)  is 


(7.8) 


432  Ordinary  Differential  Equations 


Chapter  8 


Then 


dz  dp 
dy  Pdy 


and  (7.7)  becomes 


(7.9) 


dz 

dy 


±z  =  -\y. 


This  is  a  first-order  linear  equation;  solving  it  (see  Section  3) ,  we  get 

3  J  ye±v  dy  =  -\e±v{±y-  1)  +  c, 
f  (±y  -  1)  +  ceTy. 


(7.10) 


ze±v  =  —  ■ 


Since  initially  dy/dt  =  0  and  y  >  0,  we  see  from  (7.5)  that  the  initial  acceleration  is 
in  the  negative  direction;  since  the  particle  starts  from  rest,  its  velocity  for  small  t 
is  also  in  the  negative  direction.  Then  the  damping  force  must  be  in  the  positive 
direction  so  we  must  use  the  lower  sign  in  (7.10)  for  the  first  part  of  the  motion. 
Thus  we  have 


(7.11)  z  =  \{y  +  1)  +  cev  (for  small  t). 

We  determine  c  from  the  initial  conditions  dy/dt  =  0,  y  =  1,  at  t  =  0;  we  have 
z  =  p2  =  {dy/dt)2  =  0  when  y  =  1;  therefore  from  (7.11)  we  get 

0  =  1  +  ce,  c  =  — e_1. 


Then  we  have 


(7.12) 


\{y  +  l)  -  ev  1 


(for  small  t). 


This  is  a  valid  solution  as  long  as  dy/dt  <  0  (this  is  what  small  t  means).  Thus 
the  particle  initially  moves  in  the  negative  direction  for  a  while.  To  continue  the 
problem  we  would  need  to  find  whether  it  stops  and  if  so  where.  This  means  solving 
a  transcendental  equation  which  has  to  be  done  by  some  approximation  method. 
It  turns  out  that  when  it  stops,  y  is  negative;  at  this  point  the  force  —y  is  in  the 
positive  direction  and  the  particle,  after  stopping,  moves  in  the  positive  direction. 
The  solution  for  {dy/dt)2  is  then  given  by  (7.10)  with  the  upper  sign.  After  another 
interval  of  time,  the  particle  again  reverses  its  motion  and  we  again  use  the  solution 
(7.11)  (with  a  different  c),  and  so  on,  the  total  motion  appearing  something  like 
a  damped  vibration.  We  shall  not  continue  the  details  further  here  since  we  have 
already  accomplished  our  purpose  of  illustrating  case  (b)  and  the  solution  of  a 
Bernoulli  equation. 
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Case  (c)  appears  to  be  very  special  and  is  obviously  included  by  (b);  however, 
it  is  very  important  to  know  the  easy  way  to  solve  it  because  it  so  frequently  arises 
in  applications.  The  trick  is  simply  to  multiply  the  equation  by  y'\  we  can  then 
integrate  each  term. 


This  equation  is  separable  and  so  can  be  solved  (except  for  possible  difficulty  in  eval¬ 
uating  the  integrals).  We  say  that  the  problem  is  reduced  to  quadratures  (indicated 
integrations);  this  means  that  we  can  write  the  answer  in  terms  of  integrals  which 
may  or  may  not  be  easy  to  evaluate! 

Example  2.  Consider  a  particle  of  mass  m  moving  along  the  x  axis  under  the  action  of  a 
force  F{x).  Then  the  equation  of  motion  is 


Recall  (Chapter  6,  Section  8)  that  the  potential  energy  of  a  particle  is  the  negative 
of  the  work  done  by  the  force.  Thus 

(7.16)  \mv2  -  j  F(x)  dx 

is  the  kinetic  energy  plus  the  potential  energy;  equation  (7.15)  expresses  the  law 
of  conservation  of  energy  for  this  problem.  This  energy  equation  is  often  of  more 
interest  than  the  equation  of  motion  (a:  as  a  function  of  t)  and  so  it  is  useful  to  be 
able  to  find  it  directly,  as  we  have  done,  without  solving  the  differential  equation 
for  x.  Equation  (7.15)  is  known  as  a  first  integral  of  the  differential  equation  since 
we  have  integrated  a  second-order  equation  once  to  get  it. 
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Case  (d):  An  equation  of  the  form 


2  d  V  ,  dV 


(7.17)  a2x2— +  aix-^  +  a0y  =  f(x) 

(called  an  Euler  or  Cauchy  equation)  can  be  reduced  to  a  linear  equation  with 
constant  coefficients  by  changing  the  independent  variable  from  x  to  z  where 


For  then  we  have  (see  Problem  14  and  also  Chapter  4,  Section  11) 

dy  dy  ,2  d2y  d2y  dv 

(7.19)  x—  =  —  and  x  — w  =  - — . 

ax  dz  dxz  dzz  dz 


Substituting  (7.18)  and  (7.19)  into  (7.17)  gives 

(7.20)  a2^y|  +  («i  -  02)^7  +  a0y  = /(ez). 

This  is  a  linear  equation  with  constant  coefficients  which  can  be  solved  by  the 
methods  of  Sections  5  and  6. 

It  is  worth  noting  that  the  solutions  of  (7.17)  when  f{x)  =  0  are  often  powers 
of  x,  so  a  way  to  solve  this  case  is  to  assume  y  =  xk  and  solve  the  resulting  quadratic 
equation  for  k.  However,  if  the  values  of  k  turn  out  to  be  complex,  or  equal,  or 
if  f(x)  ^  0,  you  may  find  it  easier  to  use  (7.18)  which  reduces  the  problem  to  a 
familiar  one.  (See  Problems  15  to  23.) 


You  can  verify  that  when  you  substitute  (7.22)  into  (7.21),  the  coefficient  of  v(x) 
is  u"  +  f(x)u'  +  g(x)u.  This  expression  is  equal  to  zero  because  we  assumed  that 
u( x)  is  a  solution  of  (7.21).  Then  the  equation  for  v'(x)  is  a  separable  first-order 
equation  (Problem  24). 

Example  3.  Solve  x3y"  +  xy'  —  y  =  0,  given  that  u  =  x  is  a  solution. 

We  let  y  =  uv  =  xv.  Then  y'  =  xv'  +  v,  y"  =  xv"  +  2v',  and  the  differential 
equation  becomes 


x3(xv"  +  2v')  +  x(xv'  +  v)  —  xv  =  0  or 
x4v"  +  (2x3  +  x2)v'  =  0. 
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Separating  variables  and  integrating,  we  find 

dv'  (2  1  \  , 

— -  =  —  (  — ) — tt  )  dx,  In  v  =  —2  In 

iv  \  o'*  I 


x  H - 1-  In  I\ . 

x 


Solving  for  v' ,  integrating  again,  and  writing  y  =  uv  gives 

v'  =  v  =  —Kel/X,  y  =  —Kxel^x 


Thus  the  general  solution  of  the  given  equation  is  y  =  Ax  +  Bxe 1/x 


PROBLEMS,  SECTION  7 

Solve  the  following  differential  equations  by  method  (a)  or  (b)  above. 

1.  y"+yy '  =  0.  Find  a  solution  satisfying  each  of  the  following  sets  of  initial  conditions. 
If  your  computer  says  there  is  no  such  solution,  don’t  believe  it  -do  it  by  hand. 

(a)  1/(0)  =  5,  y'( 0)  =  0  (b)  2/(0)  =  2,  y'( 0)  = -2 

(c)  y(0)  =  1,  2/'(0)  =  — 1  (d)  2/(0)  =  0,  2/(0)  =  2 

2.  y"  +  2xy'  =  0  Hint:  The  solution  is  y  =  ci  erf  x  +  C2;  see  Chapter  11,  Section  9 
for  the  definition  of  erf  x. 

3.  2yy  =  y  4.  xy  =  y  +  y 

5.  The  differential  equation  of  a  hanging  chain  supported  at  its  ends  is 

y  =k  [l+y  J  . 

Solve  the  equation  to  find  the  shape  of  the  chain. 

6.  The  curvature  of  a  curve  in  the  ( x ,  y)  plane  is 


With  K  =  const.,  solve  this  differential  equation  to  show  that  curves  of  constant 
curvature  are  circles  (or  straight  lines). 

7.  Solve  y"  +  uj2y  =  0  by  method  (c)  above  and  compare  with  the  solution  as  a  linear 
equation  with  constant  coefficients. 

8.  The  force  of  gravitational  attraction  on  a  mass  m  at  distance  r  from  the  center  of 
the  earth  (r  >  radius  R  of  the  earth)  is  mgR2 /r2.  Then  the  differential  equation  of 
motion  of  a  mass  m  projected  radially  outward  from  the  surface  of  the  earth,  with 
initial  velocity  Vo,  is 

md2r/dt2  =  —mgR2/r2. 

Use  method  (c)  above  to  find  v  as  a  function  of  r  if  v  =  vo  initially  (that  is,  when 
r  =  R).  Find  the  maximum  value  of  r  for  a  given  «o,  that  is,  the  value  of  r  when 
v  =  0.  Find  the  escape  velocity,  that  is,  the  smallest  value  of  i>o  for  which  r  can  tend 
to  infinity. 

9.  Show  that  (7.15)  is  a  separable  equation.  [You  may  find  it  helpful  to  write  f  F(x)dx  = 
f{x).]  Thus  solve  (7.14)  in  terms  of  quadratures  (that  is,  indicated  integrations)  as 
in  Problem  2. 
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In  Problems  10  and  11,  solve  (7.14)  to  find  v(x)  and  then  x(t)  for  the  given  F(x)  and 
initial  conditions. 

10.  F[x)  =  m/x3,  v  ==  0,  x  =  1,  at  t  =  0. 

11.  F(x)  =  — 2m/x 5,  v  =  —1,  x  =  1,  at  t  =?0. 

12.  In  Problem  11,  find  v(x)  if  v  =  0,  x  =  1,  at  t  =  0.  Then  write  an  integral  for  t(x). 

13.  The  exact  equation  of  motion  of  a  simple  pendulum  is  d20/dt 2  =  — w2sin6l  where 

ui2  =  g/l.  By  method  (c)  above,  integrate  this  equation  once  to  find  d6/dt  if  d9/dt  = 
0  when  6  =  90°.  Write  a  formula  for  t(6 )  as  an  integral.  See  Problem  5.34. 

14.  Verify  (7.19)  and  (7.20).  Hint:  dy/dz  =  (dy / dx)(dx / dz)\  write  the  first  equation  of 

(7.19)  as  xDx  =  Dz ,  and  find  D2. 

15.  If  you  solve  (7.17)  when  f(x)  =  0  by  assuming  a  solution  y  =  xk,  show  that  the 
quadratic  equation  for  k  is  the  same  as  the  auxiliary  equation  for  the  z  equation 

(7.20) .  Thus  show  (see  Section  5)  that  if  the  two  values  of  k  are  equal,  the  second 
solution  is  not  a  power  of  x  but  is  xk\nx.  Also  show  that  if  k  is  complex,  say 
k  =  aFbi,  the  solutions  are  xa  cos(61na;)  and  xa  sin(61na;)  or  other  equivalent  forms 
[see  (5.16)  to  (5.18)]. 

16.  Solve  the  following  equations  either  by  method  (d)  above  or  by  assuming  y  =  xk  (or 
try  both  methods  to  compare  them).  See  Problem  15. 

(a)  a 2  y"  +  3 xy  —  3y  =  0  (b)  x2y"  +  xy  —  Ay  =  0 

(c)  x2 y"  +  7 xy'  +  9y  =  0  (d)  x2y"  —  xy'  +  6y  =  0 

Solve  the  following  equations  using  method  (d)  above. 


17. 

x2y"  +  xy1  —  16  y  = 

B 

oo 

18. 

2  ll.l  -1 

x  y  +  xy  —  y  =  x  —  x 

19. 

x2y"  —  5 xy’  +  9 y  - 

=  2x3 

20. 

x2y"  —  3 xy1  +  4 y  =  6x2  In  x 

21. 

x  y  +  y  =  ix 

22. 

x2y"  +  xy  +  y  =  2x 

23.  Solve  the  two  differential  equations  in  Problem  5.11  of  Chapter  13. 

24.  Substitute  (7.22)  into  (7.21)  to  obtain  the  equation  for  v'(x).  Show  that  this  equation 
is  separable. 

For  the  following  problems,  verify  the  given  solution  and  then,  by  method  (e)  above,  find 
a  second  solution  of  the  given  equation. 

25.  x2(2  —  x)y"  +  2xy'  —  2y  =  0,  u  =  x 

26.  (x2  +  1  )y"  —  2xy'  +  2y  =  0,  u  —  x 

27.  xy"  -  2(x  +  l)y'  +  (x  +  2)y  =  0,  u  =  ex 

28.  3 xy"  -  2(3x  -  1  )y'  +  (3*  -  2)y  =  0,  u  =  ex 

29.  x2y"  +  (x  +  1  )y'  —  y  =  0,  u  =  x  +  1 
x(x  +  1  )y"  —  (x  —  1  )y'  +  y  =  0,  u  =  x  —  1 


30. 
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8.  THE  LAPLACE  TRANSFORM 

As  you  will  see  in  Section  9,  Laplace  transforms  are  useful  in  solving  differential 
equations  (for  other  uses  see  end  of  Section  9,  page  442).  Here  we  want  to  define  the 
Laplace  transform  and  obtain  some  needed  formulas.  We  define  L(/),  the  Laplace 
transform  of  /(f)  [also  written  F(p)  since  it  is  a  function  of  p],  by  the  equation 


(8.1) 


/»00 

L(f)=  f{t)e~pt  dt  =  F(p). 

Jo 


This  is  an  example  of  an  integral  transform  (also  see  Fourier  transforms,  Chapter  7, 
Section  12,  and  Hilbert  transforms,  Chapter  14,  page  698).  If  we  start  with  a 
function  /(f),  multiply  by  a  function  of  t  and  p,  and  find  a  definite  integral  with 
respect  to  t,  we  have  a  function  F(p)  which  is  called  an  integral  transform  of  /(f). 
There  are  many  named  integral  transforms  which  you  may  discover  in  tables  and 
computer.  Observe  the  notation  for  Laplace  transforms  in  (8.1):  we  shall  consis¬ 
tently  use  a  small  letter  for  the  function  of  f,  and  the  corresponding  capital  letter 
for  the  transform  which  is  a  function  of  p,  for  example  /(f)  and  F{p),  or  g{t)  and 
G(p),  etc.  Also  note  from  (8.1)  that  since  we  integrate  from  0  to  oo,  F(p)  is  the 
same  no  matter  how  /(f)  is  defined  for  negative  t.  However,  it  is  desirable  to  define 
/(f)  =  0  for  t  <  0  (see  footnote,  page  447;  also  see  Bromwich  integral,  page  696). 

It  is  very  convenient  to  have  a  table  of  corresponding  /(f)  and  F{p)  when  we 
are  using  Laplace  transforms  to  solve  problems.  Let  us  calculate  some  of  the  entries 
in  the  table  of  Laplace  transforms  at  the  end  of  the  chapter  (pages  469  to  471). 
Note  that  numbers  preceded  by  L  (LI,  L2,  •  •  • ,  L35)  refer  to  entries  in  the  Laplace 
transform  table. 

Example  1.  To  obtain  LI  in  the  table,  we  substitute  /(f)  =  1  into  (8.1)  and  find 

r°°  1 

(8.2)  F{p)  =  1  •  e~pt  dt  =  —  e~pt 

Jo  P 

We  have  assumed  p  >  0  to  make  e~pt  zero  at  the  upper  limit;  if  p  is  complex, 
as  it  may  be,  then  the  real  part  of  p  must  be  positive  (Rep  >  0),  and  this  is  the 
restriction  we  have  stated  in  the  table  for  LI. 


1 

1 

P 


p  >  0. 


Example  2.  For  L2,  we  have 

fit)  =  e~at, 

(8  3)  r°°  i 

F(p)  =  /  e~{a+p)tdt  = - ,  Re(p  +  a)  >  0. 

Jo  P  +  a 

We  could  continue  in  this  way  to  obtain  the  function  F(p)  corresponding  to 
each  /(f)  by  using  (8.1)  and  evaluating  the  integral.  However,  there  are  some  easier 
methods  which  we  now  illustrate.  First  observe  that  the  Laplace  transform  of  a 
sum  of  two  functions  is  the  sum  of  their  Laplace  transforms;  also  the  transform  of 
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cf(t)  is  cL(f)  when  c  is  a  constant: 

pOO 

+  git)]  =  /  [f{t)  +  g(t)\e-pt  dt 

Jo 

poo  poo 

(8.4)  =  /  f[t)e~ptdt  +  /  g{t)e~pt  dt  =  L(f)  +  L(g), 

Jo  Jo 

poo  poo 

L[cf(t)]  =  cf{t)e~pt  dt  =  c  f{t)e~pt  dt  =  cL(f). 

J  o  J  o 

In  mathematical  language,  we  say  that  the  Laplace  transform  is  linear  (or  is  a  linear 
operator — see  Chapter  3,  Section  7). 

Example  3.  Now  let  us  verify  L3.  In  (8.3),  replace  the  a  by  —fa;  then  we  have 


f{t)  =  elat  =  cos  at  +  i  sin  at , 


(8.5) 


F[p)  = 


1 


p  +  ia  .  ,  . 

15 - - ,  Re(p  -  ta)  >  0. 

-U  n z 


p  —  ia  pz  +  a 
Remembering  (8.4),  we  can  write  (8.5)  as 

(8.6)  L[ cos  at  +  i  sin  at)  =  L[ cos  at)  +  iL[ sin  at)  = 

Similarly,  replacing  a  by  ia  in  (8.3),  we  get 


(8.7)  Li  cos  at  —  i  sin  at)  = 


P 


p 2  +  o2  p2  +  a2  ’ 

Adding  (8.6)  and  (8.7),  we  get  L4;  by  subtracting,  we  get  L3. 


p2  +  a2  p2  +  a2 


R .e(p  +  ia)  >  0. 


Example  4.  To  verify  L 11,  start  with  L4,  namely 

pOO 

(8.8)  L(cosaf)  =  /  e~pt  cos  at  dt  = 

Jo 


P 


0  P  +  a 

Differentiate  (8.8)  with  respect  to  the  parameter  a  to  get 

p(-2a) 

ip2  +  a2)2 


2 ' 


pOO 

/  e~pti~ f  sin  at)  dt  = 

Jo 


or 


e  pt  t  sin  at  dt  = 


2pa 


jo  ip2  +  a2)2 

which  is  Lll.  Ways  of  finding  other  entries  in  the  table  are  outlined  in  the  problems. 


PROBLEMS,  SECTION  8 

1.  For  integral  k,  verify  L5  and  L6  in  the  Laplace  transform  table.  Hint:  From  L2,  you 
can  write:  ff°  e~pte~at  dt  =  1  /ip  +  a).  Differentiate  this  equation  repeatedly  with 
respect  to  p.  (See  Chapter  4,  Section  12,  Example  4,  page  235.)  Also  note  L32.  For 
the  r  function  results  in  L5  and  L6,  see  Chapter  11,  Problem  5.7. 

2.  By  using  L'2,  verify  L7  and  L8  in  the  Laplace  transform  table. 
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3.  Using  either  L2,  or  L3  and  L4,  verify  L9  and  L10. 

4.  By  differentiating  the  appropriate  formula  with  respect  to  a,  verify  L12. 


5.  By  integrating  the  appropriate  formula  with  respect  to  a,  verify  L19. 


6.  By  replacing  a  in  L2  by  a  +  ib  and  then  by  a  —  ib,  and  adding  and  subtracting  the 
results  [as  in  (8.6)  and  (8.7)],  verify  L13  and  L14. 

7.  Verify  L15  to  L18,  by  combining  appropriate  preceding  formulas  using  (8.4). 


Find  the  inverse  transforms  of  the  functions  F(p)  in  Problems  8  to  13. 


8. 

9. 

10. 


11. 


14. 


^  ^  Hint:  Use  L6  and  L18. 

(P+  2)2 

„5  -  2p  Hint:  Use  L7  and  L8. 

p2  +p-  2 

2p  —  1  Hint:  You  can  use  L7  and  L8  with  complex  a 

p2  —  2p  +  10  and  6,  but  LI 3  and  L14  are  more  direct. 

3p  +  2  3p+ 10  6 -p 

3 p2  +  5p  —  2  p2  —  25  p2  +  4p  +  20 

Show  that  a  combination  of  entries  L3  to  L10,  L13,  L14  and  L18  in  the  table,  will 
give  the  inverse  transform  of  any  function  of  the  form 


Ap  +  B 

Cp2  +  Ep  +  F ’ 


where  A,  B,  C,  E,  and  F  are  constants. 


15.  Prove  L32  for  n  =  1.  Hint:  Differentiate  equation  (8.1)  with  respect  to  p. 

16.  Use  L32  and  L3  to  obtain  Lll. 

17.  Use  L32  and  Lll  to  obtain  L(f2  sin  at). 

18.  Use  L31  to  derive  L21. 


Table  entries  L28  and  L29  are  known  as  translation  or  shifting  theorems.  Do  Problems 
19  to  27  about  them. 


19.  Prove  the  general  formula  L29  using  (8.1). 

20.  Use  L29  to  verify  L6,  L13,  L14,  and  L18. 

21.  Use  L29  and  Lll  to  obtain  L(te~at  sinfet)  which  is  not  in  the  table. 

22.  Obtain  L(te~at  cos  bt)  as  in  Problem  21. 

23.  Use  the  results  which  you  have  obtained  in  Problems  21  and  22  to  find  the  inverse 
transform  of  (p2  +  2p  —  1)/ (p2  +  Ap  +  5)2. 

24.  Sketch  on  the  same  axes  graphs  of  sint,  sin(t  —  7r/2),  and  sin(t  +  n/2),  and  observe 
which  way  the  graph  shifts.  Hint:  You  can,  of  course,  have  your  calculator  or 
computer  plot  these  for  you,  but  it’s  simpler  and  much  more  useful  to  do  it  in  your 
head.  Hint:  What  values  of  t  make  the  sines  equal  to  zero?  For  an  even  simpler 
example,  sketch  on  the  same  axes  y  =  t,  y  =  t  —  7r/2,  y  =  t  +  n /2. 

25.  Use  L28  to  find  the  Laplace  transform  of 


fit) 


sin(t  —  7r/2),  t  >  7r/2, 
0,  t  <  7r/2. 


26. 

27. 


Use  L28  and  L4  to  find  the  inverse  transform  of  pe  pn /(p2  +  1). 
Find  the  transform  of 


f{t)  = 


sin  (x  —  vt),  t>x/v, 
0,  t  <  x/v , 


where  x  and  v  are  constants. 
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9.  SOLUTION  OF  DIFFERENTIAL  EQUATIONS  BY  LAPLACE  TRANSFORMS 

We  are  going  to  discuss  the  solution  of  linear  differential  equations  with  constant 
coefficients  (see  Sections  5  and  6).  Laplace  transforms  can  reduce  such  an  equation 
to  an  algebraic  equation  and  so  simplify  solving  it.  Also,  since  Laplace  transforms 
automatically  use  given  values  of  initial  conditions,  we  find  immediately  a  desired 
particular  solution  without  the  extra  step  of  determining  constants  to  satisfy  the 
initial  conditions.  Discontinuous  forcing  functions  are  messy  to  deal  with  by  Section 
6  methods;  the  Laplace  transform  method  handles  them  easily. 

We  are  going  to  take  Laplace  transforms  of  the  terms  in  differential  equations; 
to  do  this  we  need  to  know  the  transforms  of  derivatives  y'  =  dy/dt,y"  =  d2y/dt2, 
etc.  To  find  L(y'),  we  use  the  definition  (8.1)  and  integrate  by  parts,  as  follows 

nOO  OO  «00 

(9.1)  L(y')  =  /  y'(t)e~ptdt  =  e~pty(t)  -  (-p)  /  y(t)e~ptdt 

Jo  o  Jo 

=  -2/(0)  +  PL(y)  =  PY  -  2/o 

where  for  simplicity  we  have  written  L(y)  =  Y  and  j/(0)  =  yo-  To  find  L{y"),  we 
think  of  y"  as  (y')' ,  and  substitute  y'  for  y  in  (9.1)  to  get 

L(y")=pL(yl)-y'(0). 

Using  (9.1)  again  to  eliminate  L(y'),  we  finally  have 

(9.2)  L(y")  =  p2  L(y)  -  py{ 0)  -  y'(0)  =  p2Y  -  py0  -  y'Q. 

Continuing  this  process,  we  obtain  the  transforms  of  the  higher-order  derivatives 
(Problem  1  and  L35). 

We  are  now  ready  to  solve  differential  equations.  We  illustrate  the  method  by 
some  examples. 


Example  1.  Solve  y"  +  4 y'  +  Ay  =  t2e~2t  with  initial  conditions  yo  =  0,  y'0  =  0. 

We  take  the  Laplace  transform  of  each  term  in  the  equation,  using  L35  and  L6 
in  the  table  of  Laplace  transforms.  We  get 

p2Y  -  py0  -y'0  +  4 pY  -  4 y0  +  4 Y  =  L{t2e~2t)  =  (p  +  2^3- 

But  the  initial  conditions  are  2/o  =  2/6  =  0.  Thus  we  have 


( p 2  +  4 p  +  4  )Y  = 


(P  +  2)3 


or  Y  = 


(P  +  2)5 ' 


Now  we  want  y ,  which  is  the  inverse  Laplace  transform  of  Y .  We  look  in  the  table 
for  the  inverse  transform  of  2/(p+  2)5.  By  L6,  we  get 


2/  = 


2t4e~2t 

4! 


t4e~2t 

12 


This  is  much  simpler  than  the  general  solution;  we  have  obtained  just  the  solution 
satisfying  the  given  initial  conditions. 
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Example  2.  Solve  y"  +  4 y  =  sin  2 1,  subject  to  the  initial  conditions  r/o  =  10,  y'0  =  0. 

Using  the  table,  take  the  Laplace  transform  of  each  term  of  the  equation  to  get 

p2Y  -  py0  —  y'0  +  4Y  =  L( sin  2t)  =  2 

pz  +  4 

Then  we  substitute  the  initial  conditions  and  solve  for  Y  as  follows: 

(p2  +  4)y  _  iop  = 

y  =  2  . 

p2  +  4  ( p 2  +  4)2 

Finally,  taking  the  inverse  transform  using  L4  and  L17,  we  have  the  desired  solution: 
y  =  10cos2t  +  |(sin2t  —  2tcos2t)  =  10cos2t  +  |  sin  2t  —  Lfcos2f. 


Example  3.  Solve  y"  +  4 y'  +  13 y  =  20e  4,  yo  =  1 ,  y'0  =  3. 

We  take  the  transform  of  each  term  and  solve  for  Y  as  follows: 


p2Y  —  p  —  3  +  4 pY  —  4  +  13U  =  - , 

p+  1 

Y  =  1  (  20  7\  =  p2  +  8p+  27 

p2  +  4p+13\p+l  P  )  (p+  l)(p2  +  4p+ 13)’ 

Since  this  Y  is  not  in  our  table,  we  can  either  use  a  larger  table,  or  use  partial 
fractions  to  split  Y  into  fractions  which  are  in  our  table  (which  you  can  do  by 
computer)  or  find  the  inverse  transform  by  computer.  We  find: 

Y  _  2  —p  +  1  _  2 _ 3 _ p  +  2 

p  + 1  p2  +  4p  +  13  p+1  (p+2)2  +  9  (p+2)2  +  9 

and  by  L2,  L13,  and  L14, 


y  =  2e  4  +  e  24  sin  3 1  —  e  24  cos  3t. 

Sets  of  simultaneous  differential  equations  can  also  be  solved  by  using  Laplace 
transforms.  Here  is  an  example. 

Example  4.  Solve  the  set  of  equations 


y'  -2y  +  z  =  0, 
z'  —  y  —  2z  =  0, 

subject  to  the  initial  conditions  yo  =  1,  Zo  =  0. 

We  shall  call  L(z)  =  Z  and  L(y)  =  Y  as  before.  We  take  the  Laplace  transform 
of  each  of  the  equations  to  get 


pY  -  y0  -  2U  +  Z  =  0, 
pZ  —  zq  —  Y  —  2  Z  =  0. 
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After  substituting  the  initial  conditions  and  collecting  terms,  we  have 

(p-2)Y  +  Z  =  l, 

Y-(p-2)Z  =  0. 


We  solve  this  set  of  algebraic  equations  simultaneously  for  Y  and  Z  (by  any  of  the 
methods  usually  used  for  a  pair  of  simultaneous  equations — elimination,  determi¬ 
nants,  etc.).  For  example,  we  may  multiply  the  first  equation  by  {p  —  2)  and  add 
the  second  to  get 

[Cp-2)2  +  1]F  =  p-2  or  Y  =  (p  ^  +  Y 

We  find  y  by  looking  up  the  inverse  transform  of  Y  using  L14.  We  get 

y  =  e2*  cos  t. 


Similarly,  solving  for  Z  and  looking  up  the  inverse  transform,  we  find 

1 


Z  = 


(p- 2)2  +  1’ 


2  =  e2t  sin  t 


.  Alternatively,  we  could  find  2  from  the  first  differential  equation  by  substituting 
the  y  solution: 


z  =  2y  —  y'  =  2e2t  cos  t  +  e2t  sin  t  —  2e2t  cos  t  =  e2t  sin  t. 

Solving  linear  differential  equations  with  constant  coefficients  is  not  the  only  use 
of  Laplace  transforms.  As  you  will  see  in  Chapter  13,  Section  10,  we  may  solve  some 
kinds  of  partial  differential  equations  by  Laplace  transforms.  Also  a  table  of  Laplace 
transforms  can  be  used  to  evaluate  definite  integrals  of  the  type  /0°°  e~ptf(t)dt. 


Example  5.  By  L 15  with  a  =  3  and  p  =  2,  we  have 


e  2t(l  — cos3 t)dt 


32 

2(22  +  32) 


_9_ 

26' 


Actually,  there  is  more  to  the  subject  than  this.  Although  we  are  discussing 
in  this  chapter  the  use  of  Laplace  transforms  as  a  tool,  they  also  can  play  a  more 
theoretical  role  in  applied  problems.  It  is  often  possible  to  find  desired  information 
about  a  problem  directly  from  the  Laplace  transform  of  the  solution  without  ever 
finding  the  solution.  Thus  the  use  of  Laplace  transforms  may  lead  to  a  better 
understanding  of  a  problem  or  a  simpler  method  of  solution.  (Compare  the  use  of 
matrices,  for  example,  or  the  use  of  Fourier  transforms.) 


PROBLEMS,  SECTION  9 


1.  Continuing  the  method  used  in  deriving  (9.1)  and  (9.2),  verify  the  Laplace  trans¬ 
forms  of  higher-order  derivatives  of  y  given  in  the  table  (L35). 

By  using  Laplace  transforms,  solve  the  following  differential  equations  subject  to  the  given 
initial  conditions. 

2.  y'-y  =  2e\ 


3/0  =  3 
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3. 

y"  +  4t/  +  Ay  =  e_2t, 

yo  =  0,  ye,  =  4 

4. 

y"  +y  =  sin  1, 

yo  =  1,  2/o  =  o 

5. 

y"  +  y  =  sin  1, 

2/o  =  0,  I/O  =  -i 

6. 

y"  -  6 y'  +  9 y  =  le3t, 

2/o  =  0,  ?/o  =  5 

7. 

y"  -  V  +  4y  =  4, 

2/o  =  0,  2/0  =  -2 

8. 

y"  +  16  y  =  8  cos  4  1, 

2/o  =  2/o  =  0 

9. 

y"  +  16  y  =  8  cos  41, 

00 

II 

O 

II 

o 

10. 

y"  -4y'  +  Ay  =  6e2t, 

2/o  =  2/o  =  0 

11. 

y"  —  4y  =  4e2i , 

2/o  =  0,  2/0  =  1 

12. 

y"~y  =  e-t-  2te-*, 

2/o  =  1,  2/o  =  2 

13. 

y"  +  y  =  5sinh21, 

2/o  =  0,  2/0  =  2 

14. 

<N 

1 

II 

TP 

1 

2/o  =  0,  2/o  =  1 

15. 

y"  +  9  y  —  cos  31, 

2/o  =  0,  2/0  =  6 

16. 

y"  +  9  y  —  cos  31, 

2/o  =  2,  2/0  =  0 

17. 

y"  +  5  y'  +  6  y  =  12, 

?/o  =  2,  2/0  =  0 

18. 

y"  -  4j/  =  3e-*, 

2/o  =  1,  2/o  =  -3 

19. 

y"  +  y'  —  5y  =  e2t, 

2/o  =  1,  2/o  =  2 

20. 

y"  -  8 y'  +  16 y  =  321, 

2/o  =  1,  2/o  =  2 

21. 

y"  +  4y'+5y  =  26e3‘, 

2/o  =  1,  2/o  =  5 

22. 

y" +2y' +5y  =  10  cos  1, 

2/o  =  2,  2/o  =  1 

23. 

y” +2y' +5y  =  lOcosl, 

CO 

II 

cT 

II 

o 

24. 

y"  -  2y’  +  y  =  2cosl, 

2/o  =  5,  2/0  =  -2 

25. 

y"  +  Ay'  +  5 y  =  2e"2t  < 

:osl,  2/o  =  0,  2/0  =  3 

26. 

y"  +  2  y'  +  10  y  =  —6e~ 

4  sin  31,  2/o  -  0,  2/6  —4 

Solve  the  following  sets  of  equations  by  the  Laplace  transform  method 

27. 

+ 

1 

CO 

II 

o 

2/o  =  2/o  =  0 

y"  +  z'  =  0 

zo  =  I 

28. 

y'  +  z  —  2  cos  1 

2/o  =  -1 

z'  -y  =  1 

Zo  =  1 

29. 

y'  +  z‘  -  2y  =  1 

2/o  =  zo  =  1 

z-y1  =  t 

30. 

y1  +  2z  =  1 

2/o  =  0 

2y  —  z'  =  21 

zo  =  1 

31. 

r/"  +  z"  -  z'  =  0 

2/o  =  0,  2/o  =  l 

y'  +  z'  —  2«  =  1  —  e4 

Z0  =  1,  Zq  =  1 

32. 

z'  +  2y  =  0 

?/o  =  zo  =  0 

y'  -2z  =  2 
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33.  y'  —  z'  —  y  =  cos  t 


Vo  =  -i 
zo  =  0 


y'  +  y  —  2z  =  o 

Evaluate  each  of  the  following  definite  integrals  by  using  the  Laplace  transform  table. 


34. 


e  t  sin 3 tdt  =  A.  Hint:  In  (8.1),  let  p  =  2,  fit)  =  sin3t;  use  L3  with  a  =  3. 


37. 


39. 


te  1  sin  5 1  dt 

36. 

e~3t  sin  2t  , 

/  - dt 

Jo  t 

r  oo 

,5  -2 1  i, 

t  e  dt 

38. 

/  e_t(l  — cos2 1)  dt 

Jo 

—  t  -2 1 

roc  —2 1  —2  et 

e  e  dt 

40. 

e  e  dt 

41.  /  -  e-2t  sin(t\/2 )  dt 

.In  t 


42. 


—  e  tv^  sin  2t  cos  t  dt 
t 


10.  CONVOLUTION 

In  solving  differential  equations  by  Laplace  transforms  in  Section  9,  we  found  Y 
and  then  found  the  inverse  transform  y  either  in  a  table  or  by  computer.  We  had 
no  way  of  writing  a  formula  for  y.  We  now  want  to  consider  another  way  of  finding 
inverse  transforms.  (Also  see  Bromwich  integral,  Chapter  14,  page  696.) 

Let  us  first  see  why  the  method  we  are  going  to  discuss  in  this  section  is  useful. 
Consider  differential  equations  of  the  kind  discussed  in  Sections  5  and  6,  namely 
linear  second-order  equations  with  constant  coefficients.  Recall  that  such  equations 
describe  the  vibrations  or  oscillations  of  either  a  mechanical  or  an  electrical  system. 
If  the  right-hand  side  of  the  equation  is  a  function  of  t,  called  the  forcing  function, 
then  the  differential  equation  describes  forced  vibrations. 


Example  1.  Let  us  solve  the  following  representative  equation  by  Laplace  transforms, 
assuming  that  the  system  is  initially  at  rest  and  that  the  force  /(f)  starts  being 
applied  at  t  =  0. 

(10.1)  Ay"  +  By'  +  Cy  =  f(t),  y0  =  y'0  =  0. 

We  take  the  Laplace  transform  of  each  term,  substitute  the  initial  conditions,  and 
solve  for  Y  as  follows: 

(10.2)  Ap2Y  +  BpY  +  CY  =  L(f)  =  F(p),  Y  =  + 1  +  g  F(p). 

Note  that  Y  is  a  product  of  two  functions  of  p.  We  know  the  inverse  transform 
of  F(p),  namely  f(t).  The  factor  T(p)  =  ( Ap 2  +  Bp  +  C)_1  (called  the  transfer 
function)  can  always  be  written  as 


A(p+  a)(p  +  b) 

by  factoring  the  quadratic  expression  in  the  denominator.  Hence  by  L7  (or  L6 
if  a  =  b)  we  can  find  the  inverse  transform  of  T{p)  for  any  problem.  Then  y 
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[the  inverse  transform  of  Y  in  (10.2)]  is  the  inverse  transform  of  a  product  of  two 
functions  whose  inverse  transforms  we  know.  We  are  going  to  show  how  to  write  y 
as  an  integral  (that  is,  we  are  going  to  verify  L34  in  the  table). 

Let  G(p)  and  H(p)  be  the  transforms  of  g(t)  and  h(t).  We  want  the  inverse 
transform  of  the  product  G(p)H(p).  By  the  definition  (8.1) 


/»oo  /*oo 

(10.3)  G(p)H(p)  =  /  e~ptg(t)dt ■  /  e~pth(t)dt. 

Jo  Jo 

Let  us  rewrite  (10.3)  replacing  t  by  different  dummy  variables  of  integration  so  that 
we  can  write  the  product  of  the  two  integrals  as  a  double  integral.  We  then  have 


(10.4) 


0~P<7 


G(p)H(p)  =  I  e 
Jo 

nOO  nOO 

Jo  Jo 


g(a)  da  ■  e  pTh(r)  dr 

Jo 

Yp(a+T) g(a)h(T)  da  dr. 


Now  we  make  a  change  of  variables;  in  the  cr  integral  (that  is,  with  r  fixed),  let 
a  +  r  =  t.  Then  a  =  t  —  t,  da  =  dt,  and  the  range  of  integration  with  respect  to  t  is 
from  t  =  t  (corresponding  to  a  =  0)  to  t  =  oo  (corresponding  to  a  =  oo).  Making 
these  substitutions  into  (10.4),  we  get 


(10.5) 


G(P)H(p)  = 


e  ptg{t  —  r)h(r )  dt  dr. 


/  T—0  Jt  —  T 


Next  we  want  to  change  the  order  of 
integration.  From  Figure  10.1,  we  see 
that  the  double  integral  in  (10.5)  is  over 
the  triangle  in  the  first  quadrant  below 
the  line  t  =  r.  The  t  integral  ranges 
from  the  line  t  =  r  to  t  =  oo  (indi¬ 
cated  by  a  horizontal  strip  of  width  dr 
from  t  =  t  to  oo)  and  then  the  r  inte¬ 
gral  sums  over  the  horizontal  strips  from 
r  =  0  to  r  =  oo  covering  the  whole 
infinite  triangle.  Let  us  integrate  with 
respect  to  t  first;  r  then  ranges  from  0 
to  the  line  r  =  t  [indicated  by  a  vertical 
strip  in  Figure  10.1]  and  then  the  t  integral  sums  over  the  vertical  strips  from  t  =  0 
to  oo.  Making  this  change  in  (10.5),  we  get 


The  last  step  follows  from  the  definition  (8.1)  of  a  Laplace  transform. 
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Definition  of  Convolution  The  integral 

(10.7)  f  g(t  —  T)h(r)  dr  =  g  *  h 

Jo 

is  called  the  convolution  of  g  and  h  (or  the  resultant  or  the  Faltung).  Note  the 
abbreviation  g  *  h  for  the  convolution  integral,  and  do  not  confuse  the  symbol  *, 
written  on  the  line,  with  a  star  used  as  a  superscript  meaning  complex  conjugate. 
It  is  easy  to  show  (Problem  1)  that  g  *  h  =  h  *  g\  this  result  and  (10.6)  and  (10.7) 
give  L34  in  the  table. 

Now  let’s  see  how  to  use  (10.6)  or  L34  to  solve  the  kind  of  problem  indicated  in 
(10.1)  and  (10.2). 

Example  2.  Solve  y"  +  3y'  +  2y  =  e  *,  y0  =  y'Q  =  0. 

Taking  the  Laplace  transform  of  each  term,  substituting  the  initial  conditions, 
and  solving  for  Y,  we  get 


p2Y  +  3pY  +  2Y  =  Lie-*), 
1 


Y  = 


p2  +  3p  + 2 


L{e-*). 


Since  we  are  intending  to  use  the  convolution  integral,  we  do  not  bother  to  look  up 
the  transform  of  e~* .  We  do  want,  however,  the  inverse  transform  of  l/(p2  +  3p+2); 
by  L7,  this  is  e~ *  —  e~ 2t,  so  we  have 


Y  =  Lie-*-e-2*)Lie-*)  =  Gip)Hip), 


with  g)t)  =  e~ *  —  e~2t  and  h(t)  =  e~* .  We  now  use  L34  to  find  y.  Observe  from 
L34  that  we  may  use  either  git  —  r)/i(r)  or  </(r)/i(t  —  r)  in  the  integral.  It  is  well 
to  choose  whichever  form  is  easier  to  integrate;  usually  it  is  best  to  put  (t  —  r)  in 
the  simpler  function  [here  h{t)].  Then  we  have 


y=  0(r)/i(f  -  t)  dr 
Jo 


-  e-2T)ie~(*-T)  dr 


/v. 

Jo 


r)dr  =  e  *(r  +  e  T) 


t 

o 


=  e  *it  +  e  * —  l )=te  *  +  e  2*  —  e  *. 


It  is  not  always  as  easy  to  evaluate  the  convolution  integral  as  it  was  in  this 
example.  However,  let  us  observe  that,  at  the  very  worst,  we  can  always  write  the 
solution  to  a  forced  vibrations  problem  [equation  (10.1)]  as  an  integral  (which  can, 
if  necessary,  be  evaluated  numerically).  This  is  true  because,  as  we  showed  just 
after  (10.2),  we  can  always  find  the  inverse  transform  of  the  transfer  function  Tip), 
and  so  have  Y  as  a  product  of  two  functions  whose  inverse  transforms  we  know. 
Then  y  is  given  by  the  convolution  (10.7)  of  the  forcing  function  fit)  and  the  inverse 
transform  of  the  the  transfer  function.  Also  note  (Problem  16)  that  a  combination 
of  L6,  L7,  L8  and  L18  will  handle  any  terms  arising  in  a  problem  with  nonzero 
initial  conditions. 
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Fourier  Transform  of  a  Convolution  We  have  shown  that  the  Laplace  trans¬ 
form  of  the  convolution  of  two  functions  is  the  product  of  their  Laplace  transforms. 
There  is  a  similar  theorem  for  Fourier  transforms;  let  us  see  what  it  says.  Let  51(a) 
and  52(a)  be  the  Fourier  transforms  of  fi(x)  and  f2(x).  By  analogy  with  equa¬ 
tions  (10.3),  (10.4),  (10.5),  and  (10.6),  we  might  expect  the  product  51(a)  •  52(a) 
to  be  the  Fourier  transform  of  something;  let’s  investigate  this  idea.  Assuming  that 
IZ 1  h  ( x)f2(x)\dx  is  finite,  then  by  the  definition  of  a  Fourier  transform  [Chapter  7, 
equation  (12.2)],  we  have 

(10.8)  51(a)  •  52(a)  =  Z  f  fi{v)e~iav  dv  ■  Z  [  f2(u)e~l(XU  du 

(1  \  2  poo  poo 

^ )  J  J  e-ia^h{v)h{u)dvdu. 


[We  have  used  different  dummy  integration  variables  as  in  (10.4).]  Next  we  make 
the  change  of  variables  x  =  v  +  u,  dx  =  dv,  in  the  v  integral,  to  get 


(10.9) 


gi{a)g2(a)  = 


1 

27 r 
1 

27T 


poo  poo 


e  ZOiX  fi  ( x  —  u)  /2  ( u )  dx  du 


J  — OO  j  —  OO 
2  n  00 

g— icx.x 

—00 


fi(x  -  u)f2(u)  du 


dx. 


If  we  define  the  convolution  of  fi(x)  and  f2(x)  by 

/OO 

fi{x-  u)f2(u)du, f 


then  (10.9)  becomes 
1 


(10.11)  5iW2  = 


27T 


j_  r°° 

27T 


/1  *  /2  e“ 


:  dx 


27 r 


•  Fourier  transform  of  fi*  f2. 


In  other  words, 


(10.12) 


5i  •  52  and  — /1  *  f2  are  a  pair  of  Fourier  transforms. 
27T 


Because  of  the  symmetry  of  the  f(x)  and  5(a)  integrals,  there  is  a  similar  result 
relating  fi  ■  f2  and  the  convolution  of  51  and  g2.  We  find  that  (Problem  19) 


(10.13)  51  *  52  and  f\  ■  f2  are  a  pair  of  Fourier  transforms. 


As  discussed  in  Chapter  7,  after  (12.2)  and  after  (12.10),  various  references  differ 

'  Note  that  (10.10)  is  really  the  same  as  (10.7)  if  we  agree  that,  for  Laplace  transforms,  f(t  j  =  0 
when  t  <  0  (see  the  first  paragraph  of  Section  8,  page  437).  For  then  in  (10.7),  hir)  =  0  for  r  <  0 
and  g(t  —  r)  =  0  for  r  >  t,  so  the  integral  would  not  really  be  different  if  written  with  infinite 
limits  (in  fact,  it  is  sometimes  written  that  way). 


448  Ordinary  Differential  Equations 


Chapter  8 


in  the  position  of  the  factor  1/(27t).  Some  authors  include  factors  of  1/(27t)  or 
l/\/27r  in  the  convolution  definition  (10.10);  this  definition  as  well  as  Chapter  7, 
equation  (12.2),  affects  (10.12)  and  (10.13).  Check  the  notation  in  any  reference 
you  are  using. 


PROBLEMS,  SECTION  10 

1.  Show  that  g  *  h  =  h  *  g  as  claimed  in  L34.  Hint:  Let  u  =  t  —  t  in  (10.7). 

2.  Use  L34  and  L2  to  find  the  inverse  transform  of  G(p)H(p)  when  G(p)  =  l/(p  +  a) 
and  H(p)  =  l/(p  +  b);  your  result  should  be  L7. 

Use  the  convolution  integral  to  find  the  inverse  transforms  of: 


3. 

P  P 

( p 2  —  l)2  p2  —  : 

5. 

p 

( p  +  a)(p  +  b )2 

7. 

P 

(p  +  a)(p2  —  b2) 

2 

9. 

P3(p  +  2) 

11. 

P 

{p2  +  a2)(p2  +  b2) 

( p  +  a)(p  +  b )2 


(p  +  a)(p2  —  b2) 


{p  +  a)(p  +  b)(p  +  c) 


p(p2  +  a2)2 


p{p2  +  a2)(p2  +  b2) 


Hint:  In  Problems  11  and  12  use  2sin#cos</>  =  sin)#  +  </>)  +  sin)#  —  </>). 

13.  Use  the  Laplace  transform  table  to  find  f(t)  =  fg  e~T  sin(f  —  r)  dr.  Hint:  In  L34, 
let  g(t)  =  e-t  and  h(t)  =  sint,  and  find  G(p)H(p)  which  is  the  Laplace  transform  of 
the  integral  you  want.  Break  the  result  into  partial  fractions  and  look  up  the  inverse 
transforms. 

Use  the  convolution  integral  (see  Example  2)  to  solve  the  following  differential  equations. 

14.  y"  +  Gy'  +  6y  =  e~2t ,  yo  =  y'o  =  0. 

15.  y"  +  3y'  -  4y  =  e3t ,  yo=y'o=0. 

16.  Consider  solving  an  equation  like  (10.1)  but  with  nonzero  initial  conditions. 

(a)  Write  the  corrected  form  of  (10.2),  writing  the  transfer  function  in  factored 
form  as  indicated  just  after  (10.2).  Consider  the  extra  terms  in  Y  which  arise 
from  the  initial  conditions;  show  that  the  inverse  transforms  of  such  terms  can 
always  be  found  from  LG,  L7,  L8,  and  L18. 

(b)  Find  the  explicit  form  of  the  inverse  transform  of  the  transfer  function  for 
a  ^  b  (use  L7),  and  so  write  the  general  solution  of  (10.2)  with  nonzero  initial 
conditions  as  a  convolution  integral  plus  the  terms  which  you  found  in  (a). 

17.  Solve  the  differential  equation  y"  —  a2y  =  f{t),  where 

m  =  {  J  and  yo  =  y'o  =  0. 

Hint:  Use  the  convolution  integral  as  in  the  example. 
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18.  A  mechanical  or  electrical  system  is  described  by  the  differential  equation 
y"  +uo2y  =  f{t).  Find  y  if 


m  = 


1,  0  <  t  <  a, 

0,  otherwise, 


and  y0  =  y'0  =  0. 


Hint:  Use  the  convolution  integral  carefully.  Consider  t  <  a  and  t  >  a  separately, 
remembering  that  /(f)  =  0  for  t  >  a.  Show  that 


{-^■(1  —  cos  tut),  t  <  a, 

-Ar [coscuff  —  a)  —  coscutl,  t  >  a. 

t 


Sketch  the  motion  if  a  =  |T  where  T  is  the  period  for  free  vibrations  of  the  system; 
if  a  =  §T;  if  a  =  ±T. 

19.  Following  the  method  of  equations  (10.8)  to  (10.12),  show  that  /1/2  and  g  1  *  g2  are 
a  pair  of  Fourier  transforms. 


11.  THE  DIRAC  DELTA  FUNCTION 


In  mechanics  we  consider  the  idea  of  an  impulsive  force  such  as  a  hammer  blow 
which  lasts  for  a  very  short  time.  We  usually  do  not  know  the  exact  shape  of  the 
force  function  /(f),  and  so  we  proceed  as  follows.  Let  the  impulsive  force  f(t)  lasting 
from  f  =  fo  till  t  =  t\  be  applied  to  a  mass  in;  then  by  Newton’s  second  law  we 
have 


(11.1) 


dv 

m—  dt 
dt 


rv  1 

I  mdv  =  m(vi 
d  vn 


Vo). 


This  says  that  the  integral  of  /(f)  [called  the  impulse  of  /(f)] 
is  equal  to  the  change  in  the  momentum  of  to,  and  we  note 
that  the  result  is  independent  of  the  shape  of  /(f)  but  de¬ 
pends  only  on  the  area  under  the  /(f)  curve.  If  this  area  is  1, 
we  call  the  impulse  a  unit  impulse.  If  fi  —  fo  is  very  small,  we 
may  simply  ignore  the  motion  of  m  during  this  small  time, 
and  say  only  that  the  momentum  jumped  from  mv 0  to  mv  1 
during  the  time  t\—to-  If  vq  =  0,  the  graph  of  the  momentum  as  a  function  of  time 
would  be  as  in  Figure  11.1,  where  we  have  simply  omitted  the  (unknown)  part  of 
the  graph  between  to  and  f  1.  We  note  that  if  t\  —  to  is  very  small,  the  graph  in 
Figure  11.1  is  almost  the  unit  step  function  (L24).  Let  us  imagine  making  ti  —  to 
smaller  and  smaller  while  keeping  the  jump  in  mv  always  1. 

In  Figures  11.2,  11.3  and  11.4  we  have  sketched  some  possible  sequences  of  func¬ 
tions  f nit)  which  would  do  this.  We  could  draw  many  other  similar  sets  of  graphs; 
the  essential  requirement  is  that  /(f)  should  become  taller  and  narrower  (that  is, 
that  the  force  should  become  more  intense  but  act  over  a  shorter  time)  in  such  a 
way  that  the  impulse  [area  under  the  /(f)  curve]  remains  1.  We  might  then  consider 
the  limiting  case  in  which  Figure  11.1  has  a  jump  of  1  at  to',  the  force  /(f)  required 
to  produce  this  result  would  have  to  be  infinite  and  act  instantaneously.  Also  from 
equation  (11.1),  we  see  that  the  function  /(f)  is  the  slope  of  the  mv  graph;  thus 
we  are  asking  for  /(f)  to  be  the  derivative  of  a  step  function  at  the  jump.  We  see 
immediately  that  no  ordinary  function  has  these  properties.  However,  we  also  note 


mv 

1 


to  h 


Figure  11.1 
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that  we  are  not  so  much  interested  in  /(f)  as  in  the  results  it  produces.  Figure  11.1 
with  a  jump  at  to  makes  perfectly  good  sense;  for  any  t  >  to  we  could  choose  a 
sufficiently  tall  and  narrow  fn(t)  so  that  mv  would  already  have  its  final  value. 
We  shall  see  that  it  is  convenient  to  introduce  a  symbol  6(t  —  to)  to  represent  the 
force  which  produces  a  jump  of  1  in  mv  at  to!  S(t  —  to)  is  called  the  Dirac  delta 
function  although  it  is  not  an  ordinary  function  as  we  have  seen.  (It  may  properly 
be  called  a  generalized  function  or  a  distribution,  and  is  one  of  a  whole  class  of 
such  functions.)  Introducing  and  using  this  symbol  is  much  like  introducing  and 
using  the  symbol  oo.  It  is  convenient  to  write  equations  like  l/oo  =  0,  but  we  must 
not  write  oo/oo  =  1;  that  is,  such  symbolic  equations  must  be  abbreviations  for 
correct  limiting  processes.  Let  us  investigate,  then,  how  we  can  use  the  S  function 
correctly. 

Example  1.  Consider  the  differential  equation 

(11-2)  y" +v2y  =  f(t),  2/o  =  2/0=0- 

This  equation  might  describe  the  oscillations  of  a  mass  suspended  by  a  spring,  or 
a  simple  series  electric  circuit  with  negligible  resistance.  Let  us  assume  that  the 
system  is  initially  at  rest  (yo  =  y'0  =  0);  then  suppose  that,  at  t  =  to  the  mass  is 
struck  a  sharp  blow,  or  a  sudden  short  surge  of  current  is  sent  through  the  elec¬ 
tric  circuit.  The  function  /(f)  may  be  one  of  those  shown  in  Figures  11.2  to  11.4 
or  another  similar  function.  Let  us  solve  (11.2)  with  /(f)  equal  to  one  of  the  func¬ 
tions  in  Figure  11.3,  that  is,  /(f)  =  ne~n^t~t°\t  >  to-  Using  Laplace  transforms  L28 


Section  11 


The  Dirac  Delta  Function  451 


Figure  11.3 


and  L2  we  find 


o V 2  +  w2)y  =  L{ne-n(t~to))  =  n  ■ 


Y=n _ - _ - _ _  =  _ _  _ _ | _ _ _ £ _ 

(p  +  n)  (p2  +  oj2)  (n2  +  oj2)  \_p  +  n  p2  +  oj2  p2  +  oj2  \ 

(You  can  easily  verify  the  partial  fractions  expansion  in  the  last  step.)  Then  by  L28 
with  a  =  to,  L2  with  a  =  n,  and  L3,  L4,  with  a  =  u>  ,  we  find: 

„  ,,  /g-n(t-to)  nsmoj(t  — 10)  COSLo(t—to)\ 

(11.4)  y  =  n(  +  ^ - 2  ,  t  >  t0. 

\  nz  +  ujz  (nz  +  u}z)uj  n.  +  ojz  ) 

(Of  course,  y  =  0  for  t  <  to.)  By  making  f(t)  sufficiently  narrow  and  peaked  (that 
is,  by  making  n  large  enough),  we  can  make  the  first  and  third  terms  in  y  negligible, 
and  the  coefficient  of  sin  uj{t  —  to)  approximately  equal  to  l /to.  Thus  the  solution 
is  approximately 


y  =  —  sin  w(t  -  t0), 

OJ 


for  a  unit  impulse  of  very  short  duration  at  t  =  to-  (We  have  shown  this  only  for 
the  functions  of  Figure  11.3;  however,  the  same  result  would  be  found  for  other  sets 
of  functions,  such  as  those  in  Figure  11.4,  for  example — see  Problem  5.) 
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Figure  11.4 

Now  we  would  like  to  be  able  to  find  (11.5)  without  finding  (11.4),  in  fact, 
without  choosing  a  specific  set  of  functions  /n(t).  Our  discussion  above  suggests 
that  we  try  using  the  symbol  6(t  —  to)  for  f(t)  on  the  right-hand  side  of  (11.2).  In 
solving  the  equation,  we  would  then  like  to  take  the  Laplace  transform  of  6(t  —  to). 

Laplace  Transform  of  a  <5  Function  Let  us  investigate  whether  we  can  make 
sense  out  of  the  Laplace  transform  of  S(t  —  to).  More  generally,  let  us  try  to  attach 
meaning  to  the  integral  f  <j>(t)6(t  —  to)  dt,  where  (f>(t)  is  any  continuous  function 
and  S(t  —  to)  is  the  symbol  indicating  an  impulse  at  to-  We  consider  the  integrals 
/  4>(t)fn(t  —  to)  dt, ,  where  the  functions  fn(t—to)  are  more  and  more  strongly  peaked 
at  to  as  n  increases  (Figure  11.5),  but  the  area  under  each  graph  is  1.  When  fn(t—to) 
is  so  narrow  that  <j>(t)  is  essentially  constant  [equal  to  <t>(to)]  over  the  width  of 
fn(t  —  to),  the  integral  becomes  nearly  <j>(to)  J  fn(t  —  to)  dt  =  <j>(to)  •  1  =  </>{to)\  that 
is,  the  sequence  of  integrals  /  <j>(t)fn(t  —  to)  dt,  tends  to  <f>(to)  as  n  tends  to  infinity. 
It  then  seems  reasonable  to  say  that 


<n-6>  Jjmt -*>)<*  =  {*‘0“x  ^ 


Equation  (11.6)  is  the  defining  equation  for  the  6  function;  when  we  operate  with  S 
functions,  we  use  them  in  integrals  and  (11.6)  tells  us  the  value  of  the  integral.  The 
integral  in  (11.6)  is  not  a  Riemann  integral;  it  is  just  a  very  useful  symbol  indicating 
that  we  have  found  the  limit  of  f  <fi(t)fn(t  —  to)  dt  as  n  — >  oo.  You  may  then  ask 
how  we  can  carry  out  familiar  operations  like  integration  by  parts.  When  you  treat 
an  integral  containing  a  6  function  as  an  ordinary  integral,  you  can,  if  you  like,  think 
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that  you  are  really  working  with  the  functions  fn(t  —  to)  and  then  taking  the  limit 
at  the  end.  Of  course,  all  this  needs  mathematical  justification  which  exists  but 
is  beyond  our  scope.  (For  two  different  mathematical  developments  of  generalized 
functions,  see  Lighthill,  and  Chapter  9  of  Folland.)  Our  purpose  is  just  to  make 
understandable  the  S  function  formulas  which  are  so  useful  in  applications. 

Example  2.  We  can  now  easily  find  the  Laplace  transform  of  S(t  —  to).  In  the  notation 
used  in  L27  (which  we  are  about  to  derive),  we  have,  using  (8.1), 

POO 

(11.7)  L[5(t  —  a)}=  /  S(t  -  a)e~pt  dt  =  e~pa,  a  >  0, 

Jo 

since,  by  11.6,  the  integral  of  the  product  of  S(t,  —  a)  and  a  function  “picks  out” 
the  value  of  the  function  at  t  =  a.  Now  let  us  use  our  results  to  obtain  (11.5)  more 
easily. 


Example  3.  Solve 


(11.8)  y"+u2y=  5{t.-t0),  yo  =  y'o  =  0. 
Taking  Laplace  transforms  and  using  (11.7),  we  get 

(11.9)  (p2  +  o?)Y  =  L[S(t  -  <0)]  =  e~pt° . 


Then 

(11.10) 

and,  by  L3  and  L28, 

(11.11) 


g-pto 
p2  +  Ul2 


y  =  —  sin  co(t  -  t0), 
to 


t  >  to, 


as  in  (11.5). 
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Fourier  Transform  of  a  S  Function  Using  (11.6)  and  the  definition  of  a  Fourier 
transform  [Chapter  7,  equation  (12.2)],  we  may  write 

(11.12)  g(a)  =  2-  J  S(x  —  a)e~iax  dx  =  2- e~iaa . 

Formally,  then  (12.2)  of  Chapter  7  would  give  for  the  inverse  transform 

1  f°° 

(11.13)  S(x  -  a)  =  —  /  eia{x~a)da. 

J  - OQ 


We  say  “formally”  because  the  integral  in  (11.13)  does  not  converge.  However,  if  we 
replace  the  limits  —  oo,  oo  by  — n,  n,  we  obtain  a  set  of  functions  (Problem  12)  which, 
like  the  functions  fn(t)  in  Figures  11.2  to  11.4,  are  increasingly  peaked  around  x  =  a 
as  n  increases,  but  all  have  area  1.  In  this  sense,  then,  (11.13)  is  a  representation 
of  the  S  function.  Equations  (11.12)  and  (11.13)  are  useful  in  quantum  mechanics. 


Another  Physical  Application  of  <5  functions  What  is  the  density  (mass  per 
unit  length)  of  a  point  mass  on  the  x  axis?  Compare  the  concept  of  a  point  mass 
with  our  discussion  of  S  functions.  We  could  think  of  a  point  mass  as  corresponding 
to  the  limiting  case  of  a  density  function  like  those  in  Figures  11.2  to  11.4.  A  point 
mass  at  x  =  a  requires  that  the  density  be  zero  everywhere  except  at  x  =  a  but  the 
integral  of  the  density  function  across  x  =  a  should  be  the  mass  to.  Thus  we  can 
write  the  density  function  for  a  point  mass  to  at  x  =  a  as  mS(x  —  a).  Similarly,  we 
can  represent  the  charge  density  for  point  electrical  charges  using  6  functions. 


Example  4.  The  charge  density  for  a  charge  of  2  at  x  =  3,  a  charge  of  —5  at  x  =  7  and  a 
charge  of  3  at  x  =  —4  would  be  2S(x  —  3)  —  5S(x  —  7)  +  3S(x  +  4). 


Derivatives  of  the  S  Function  To  see  that  we  can  attach  a  meaning  to  the 
derivative  of  S(x  —  a),  we  write  f_  cj>(x)5'(x  —  a)  dx  and  integrate  by  parts  to  get 


(11.14)  /  (j>(x)S'(x  —  a)dx  =  <j)(x)5(x  —  a) 


(/>' (x)S(x  —  a)  dx  =  —<j)'(a). 


The  integrated  term  is  zero  at  ±oo,  and  we  evaluated  the  integral  using  equation 

(11.6) .  Thus,  just  as  S(x  —  a)  “picks  out”  the  value  of  <j>{x)  at  x  =  a  [see  equation 

(11.6) ],  so  S'(x  —  a)  picks  out  the  negative  of  <j>'{x)  at  x  =  a.  Integrating  by  parts 
twice  (Problem  14),  we  find 


(11.15) 


< p(x)S"(x 


a)  dx 


</>"(«)• 


Repeated  integrations  by  parts  gives  the  formula  for  the  derivative  of  any  order  of 
the  S  function  (Problem  14): 


(11.16) 


(j)(x)S<'n\x  —  a)  dx 


(-1  T^n\a). 


We  have  written  the  integrals  in  (11.14)  to  (11.16)  with  limits  — oo  to  oo,  but  all 
that  is  necessary  is  that  the  range  of  integration  include  x  =  a;  otherwise,  as  for 

(11.6),  the  integrals  are  zero. 
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Some  Formulas  Involving  8  Functions  Our  discussion  at  the  beginning  of  this 
section  (see  Figure  11.1  and  Z24  in  the  Laplace  transform  table)  implied  that  the 
derivative  at  x  =  a  of  the  unit  step  function  ought  to  be  8(x  —  a). 


(a) 

,  \  /  1,  x  >  a 

v  ’  \  0,  x  <  a 

(11.17) 

(b) 

"s' 

t-o 

II 

's' 

What  does  the  u'  =  8  equation  mean?  By  definition,  two  generalized  functions  (dis¬ 
tributions)  are  equal,  say  Gi(x)  =  G2(x)  ,  if  f  <j>(x)G i  dx  =  f  </>(x)G2  dx  for  any  test 
function  (/)( x).  Test  functions  are  assumed  to  be  very  well  behaved  functions;  let’s 
assume  that  they  are  continuous  with  continuous  derivatives  of  all  orders  and  that 
they  are  identically  zero  outside  some  finite  interval  so  that  the  integrated  term  in  an 
integration  by  parts  is  always  zero.  You  can  think  of  generalized  functions  as  being 
operators;  given  a  test  function  <j>(x),  they  “operate”  on  it  to  produce  a  value  such  as 
</>( 0).  Compare  the  differential  operators  in  Problem  5.31.  We  wrote  Dx  =  xD  +  1 
where  D  =  d/ dx.  This  would  be  nonsense  as  an  elementary  calculus  formula,  but  as 
an  operator  equation  to  be  applied  to  y(x),  it  means  D(xy)  =  ( xD  +  1  )y  =  xy'  +  y 
which  is  correct.  In  a  similar  way,  two  generalized  functions  are  equal  if  they  give 
the  same  results  when  they  operate  on  any  test  function.  Let’s  try  this  for  (11.17b). 
We  multiply  v!  by  <fi(x),  integrate  by  parts  (noting  that  the  integrated  term  is  zero 
because  we  require  test  functions  to  be  zero  for  large  |x|),  substitute  the  values  of 
u(x  —  a),  and  integrate  again  to  get 

/OO  pOO  poo 

<j>(x)u' (x  —  a)  dx  =  —  /  (f>'(x)u(x  —  a)  dx  =  —  /  4>{x)  dx  = 

-oo  J  —  oo  J  a 

This  is  indeed  the  value  of  the  integral  of  <f>{x)8{x  —  a),  so  u'(x  —  a)  =  8(x  —  a)  is  a 
valid  operator  equation  (generalized  function  equation). 

Since  we  think  of  8(x)  and  its  derivatives  as  being  zero  except  at  the  origin, 
and  x  is  zero  at  the  origin,  it  might  seem  plausible  that  x8,  x28,  xS' ,  etc.  would  be 
identically  zero.  It  turns  out  that  some  of  these  are  zero  and  some  are  not;  to  find 
out,  we  multiply  by  an  arbitrary  test  function  < f>(x )  and  integrate.  We  state  a  few 
results;  also  see  Problems  17  and  18. 


(11.18) 


(а)  x8(x)  =  0 

(б)  xS'(x)  =  —8(x) 

(c)  x28''(x)  =  28  (x) 


To  check  (b),  we  multiply  by  (j>{x)  and  integrate  using  (11.14)  with  <j>(x)  replaced 
by  x<j>( x). 


x8'{x)(j){x)  dx  =  —{xcj))' 


=  —(x<j)'  +  <t>) 


x=0 


x—0 


=  —0(0)  =  —  8{x)(j){x)  dx. 


Here  is  another  way  to  produce  valid  generalized  function  identities  like  those  in 
(11.18).  Suppose  Gi(x)  =  G^x);  then  we  can  show  (Problem  19a)  that  j^Gi(x)  = 
j^G2{x)  and  xnG\{x)  =  xnG2{x).  For  example,  if  we  differentiate  (11.18a)  we  get 
x8'(x)  +  8(x)  =  0,  or  x8'(x)  =  —8(x)  which  is  (11.18b). 
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We  list  a  few  more  operator  equations  (see  Problems  20  and  21). 


We  first  prove  (c)  when  a  is  negative,  say  a  =  —b,  b  >  0.  Let  u  =  — bx ,  then 
du  =  —bdx,  and  the  limits  x  =  — oo,  oo,  become  u  =  oo,  —  oo. 

*x)i(-te)  dx  =  L  SM  &)  =  iJ-j{'=i)s{u)du 

=  =  77^(0)  =  rr  f  <t>(x)6(x)  dx. 

b  |0|  l«l  J- oo 

From  the  second  integral  to  the  third,  we  have  reversed  the  order  of  integration 
(one  minus  sign)  and  also  changed  ^  to  ^  (another  minus  sign,  which  cancels  the 
first).  Now,  if  we  repeat  the  calculation  using  a  >  0  instead  of  —6,  neither  of  these 
sign  reversals  occurs,  and  so  we  get  the  result  ^<£(0)  instead  of  £<£(0).  But  when 
a  >  0,  a  and  \a\  are  the  same.  Thus  we  get  the  result  stated  in  (11.19c). 

S  functions  in  2  or  3  dimensions  It  is  now  straightforward  to  write  the  defining 
equations  in  rectangular  coordinates  for  S  functions  in  2  or  3  dimensions.  We  have 

/OO  POO 

/  4>{x,y)8{x  -  x0 )S(y  -  y0)  dxdy  =  ^(x0,yo)- 

-oo  J  —  OO 

/OO  nOO  pO O 

/  /  <j>{x,y,  z)5(x  -  x0)8(y  -  y0)S(z  -  z0)  dx  dy  dz 

-oo  J  —oo  J —oo 

=  <t>(xo,yo,  ~o)- 

As  in  one  dimension,  the  delta  function  “picks  out”  the  value  of  the  test  function  <f> 
at  the  “peak”  of  the  S  function.  The  integrals  need  not  be  over  all  space,  just  over 
a  region  containing  the  point  ro;  otherwise  the  integral  is  zero.  The  abbreviations 
<5(r)  or  <53(r)  are  often  used  for  5(x)5(y)5(z),  but  note  carefully  that  they  do  not 
mean  functions  of  the  vector  r,  but  rather  functions  of  the  components  x,  y,  z  of  r. 
Similarly  you  may  see  8(r  —  ro)  or  <53(r  —  ro)  meaning  the  S  function  in  (11.21). 

In  spherical  coordinates,  let’s  use  /  instead  of  <j>  to  mean  a  test  function  (since 
(f>  is  a  spherical  coordinate  angle).  By  the  definition  of  5  functions,  we  want 
Iff  f  (i'i  0 ,  (j))S(r  —  ro)S(9  —  9o)S((j)  —  (j)o)  di'  dd  dcj)  =  /(r0, 60,  fo).  But  since  we  would 
like  to  use  the  volume  element  dr  =  r2  sin  9  dr  d9  d(j>  =  r2d?’dfl,  we  need  to  write 
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(also  see  Problem  22) 
(11.22) 


7  ,  5(r  -  r0)5{6  -  0o)8((j>  -  (j>0) 

<(r  -  '»>  = - - ;  then 

f{r,  6, 4>)6( r  -  r0)  dr  =  f(r0,  90,  <t>o). 


then 


Similarly  in  cylindrical  coordinates,  with  dr  =  r  dr  dO  dz, 

5(r  -  r0)8(9  -  90)S(z  -  z0) 

o(r  -  r0)  = - ; 

(11.23) 


f(r,  d,  z)6( r  -  r0)  dr  =  f(r0, 90 ,  z0). 


Note  that  we  can  use  these  formulas  to  write  mass  density  or  charge  density 
functions  in  the  various  coordinate  systems. 


Example5.  Suppose  there  is  a  unit  charge  or  unit  mass  at  the  point  (x,  y,  z)  =  (—1,  \/3,  —2) 
then  in  rectangular  coordinates,  the  density  is 

p  =  6(x  +  1  )S(y  —  \/3)<!>(z  +  2). 

In  cylindrical  coordinates  the  point  is  ( r,9,z )  =  (2, 27r/3,  — 2)  so  in  cylindrical  co¬ 
ordinates  the  density  is 


p  =  6(r  —  2)5(9  —  2tt/3)5(z  +  2  )/r. 

In  spherical  coordinates,  the  point  is  (r,  9,  (f>)  =  (2\/2,  37t/4,  27t/3),  so  in  spherical 
coordinates  the  density  is 

p  =  5(r-  2y/2 )5(9  -  3tt/4)5(0  -  2?r/3)/(r  sin  9) . 

Finally,  let’s  verify  two  useful  operator  equations  for  <)'  functions  in  3  dimensions. 


(11.24)  V  •  — j  =  47r<5(r); 

(11.25)  V2-  =  —Att5(y). 

r 


You  can  easily  show  (Problem  24a)  that  V  •  (er/r2)  is  zero  for  any  r  /  0  (and  un¬ 
defined  for  r  =  0).  Also,  by  the  divergence  theorem  [Chapter  6,  equation  (10.17)] 
in  spherical  coordinates,  we  find 


volume  r 


surface 
inclosing  r 


r 2  sin  9  d9  dcj)  =  Att. 


Thus  V  •  (er/r2)  has  the  properties  that  it  is  zero  for  all  r  >  0  but  its  integral 
over  any  volume  including  the  origin  =  47r;  this  suggests  that  it  is  equal  to  Air8(r). 
Let’s  verify  that  this  is  correct.  (Compare  Problem  25.)  Since  V  •  (er/r2)  depends 
only  on  r  (Problem  24a),  we  use  a  test  function  /(r).  We  want  to  show  that 
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Iff  /(r)^7  '  (er/r2)  dr,  over  any  volume  containing  the  origin,  is  equal  to  4-7t/(0). 
For  convenience  we  integrate  over  the  volume  inside  the  sphere  r  =  a.  (Since  the 
integrand  is  zero  for  r  >  0,  the  answer  is  the  same  for  any  volume  containing  the 
origin.)  By  Problem  11.17(e)  of  Chapter  6  with  (/>  =  /,  V  =  er/r2,  and  n  =  er,  we 
find 


volume  r<a  surface  r=a  volume  r<a 


On  the  surface  r  =  a,  the  integrand  of  the  surface  integral  is  f(a)-^a2  dtt,  so  the 
surface  integral  is  4-7r/(a).  In  the  volume  integral,  V/(r)  •  er  is  the  r  component  of 
V/(r);  in  spherical  coordinates  this  is  just  df  /dr  (Chapter  6,  equation  6.8).  Thus 
the  volume  integral  on  the  right-hand  side  is 


volume  r<a 


f)  f  1 

— 1 — - r 2  dr  dQ  =  47 r/(r) 
or  rz 


4 tt[/(o)  ^  /( 0)] 


and  we  have 


J jj  /WV  •(^dT  =  47i 7(a)  -  47r[/(a)  -  /( 0)]  =  4tt/(0) 

volume  r<a 

as  we  expected.  Thus  (11.24)  is  a  valid  operator  equation.  You  can  show  (Prob¬ 
lem  24b)  that  V(l/r)  =  —  er/r2).  Since  V*  V  =  V2  (that  is,  div  grad  =  Laplacian), 
we  have  V2(l/r)  =  V  •  V(l/r)  =  —V  •  (er/r2).  Thus  (11.25)  is  also  valid. 

We  can  write  (11.24)  and  (11.25)  with  the  peak  of  the  <5  function  shifted  from 
the  origin  to  r(j.  The  unit  vector  from  ro  to  r  can  be  written  as  (r  —  ro)/|r  —  ro|. 
Then  we  have 

(11.26)  V2^^-  =  -V  •  r  ~  r°|3  =  — 47T(5(r  -  r0). 

|r-r0|  |r-r0|3 

It  is  now  interesting  to  note  that  we  have  seen  this  before  without  recogniz¬ 
ing  that  we  were  dealing  with  a  6  function.  Look  back  to  Chapter  6,  Equations 
(6.10.19)  to  (6.10.25).  With  D  given  by  (6.10.19),  we  have  from  (8.11.24)  above, 
V-D  =  j-4Tr5(r)  =  g£(r)  which  we  recognize  as  the  charge  density  for  a  charge  q  at 
the  origin.  Thus,  although  we  wrote  (6.10.25)  with  p  as  the  density  of  a  charge  dis¬ 
tribution,  we  could  now  write  it  with  p  =  charge  density  of  a  point  charge  =  (^(r), 
and  then  (6.10.25)  becomes  (6.10.22). 

PROBLEMS,  SECTION  11 

1.  Find  the  inverse  Laplace  transform  of  e-2p/p2  in  the  following  ways: 

(a)  using  L5  and  L27  and  the  convolution  integral  of  Section  10; 

(b)  using  L28. 

2.  Verify  L24  in  the  table  by  using  LI,  L27,  and  the  convolution  integral. 

3.  Verify  L28  in  the  table  by  using  L27  and  the  convolution  integral. 

4.  Show  that  fn(t )  dt  =  1  for  the  functions  fn(t)  in  Figures  11.3  and  11.4. 
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5.  Solve  the  differential  equation  y"  +  ui2y  =  f(t),  yo  =  y'o  =  0,  with  f(t)  given  by  the 
functions  in  Figure  11.4,  by  the  following  methods. 


(a) 

(b) 


Use  the  convolution  integral,  being  careful  to  consider  separately  the  three 
intervals  0  to  to,  to  to  to  +  1/n,  and  to  +  1/n  to  oo. 


Write  /n(t)  as  a  difference  of  unit  step  functions  as  in  L25,  and  use  L25  to 
find  L(/„).  Expand  - by  partial  fractions  and  use  L28  to  find  yn(t). 

p(p2  +  LU2) 

Your  result  should  agree  with  (a). 


(c)  Let  n  — >  oo  and  show  that  your  solution  in  (a)  and  (b)  tends  to  the  same 
solution  (11.5)  obtained  using  the  functions  of  Figure  11.3;  that  is,  either  set 
of  functions  gives,  in  the  limit,  the  same  solution  (11.11)  obtained  by  using  the 
8  function.  Note  that,  when  you  let  n  — ♦  oo,  you  do  not  need  to  consider  the 
interval  to  to  to  +  1/n  since,  if  t  >  to,  then  for  sufficiently  large  n,  t  >  to  +  1/n. 


6.  (a)  Let  a  mechanical  or  electrical  system  be  described  by  the  differential  equation 

Ay"  +  By 1  +  Cy  =  f(t),  yo  =  y'o  =  0.  As  in  Problem  10.16b,  write  the  solution 
as  a  convolution  (assume  a  ^  b).  Let  f(t)  be  one  of  the  functions  in  Figure 
11.4  and  Problem  5.  Find  y  and  then  let  n  —*  oo. 

(b)  Solve  (a)  with  f(t)  =  8(t  —  to);  your  result  should  be  the  same  as  in  (a). 

(c)  The  solution  y  as  found  in  (a)  and  (b)  is  called  the  response  of  the  system  to  a 
unit  impulse.  Show  that  the  response  of  a  system  to  a  unit  impulse  at  to  =  0 
is  the  inverse  Laplace  transform  of  the  transfer  function. 


Using  the  8  function  method,  find  the  response  (see  Problem  6c)  of  each  of  the  following 
systems  to  a  unit  impulse. 


7.  y"  +  2j/'  +  y  =  5(t  -  to) 

8.  y"  +  4y'  +  5y  =  5{t  -  to) 

9.  y"  +  2  y  +  lOy  =  8(t  —  to) 

10.  y"  -9 y  =  8(t  -  to) 

dAy 

H-  -r^-y  =  S{t-to) 

12.  Evaluate  the  functions  f„(x  —  a)  defined  by  the  integral  in  (11.13)  with  limits  —  n,  n. 

Show  that  fn(x  —  a)  dx  =  1  for  all  n.  Sketch  or  computer  plot  graphs  of  several 

fn’s  to  show  that  as  n  increases  the  functions  fn(x)  are  increasingly  peaked  around 

x  =  a,  and  that  as  \x  —  a\  increases,  they  oscillate  with  decreasing  amplitude. 

13.  Using  8  functions,  write  the  following  mass  or  charge  density  functions. 

(a)  Mass  5  at  x  =  2,  and  mass  3  at  x  =  —7. 

(b)  Charge  3  at  x  =  —5  and  charge  —4  at  x  =  10. 

14.  Integrate  by  parts  as  we  did  for  (11.14)  to  obtain  (11.15)  and  (11.16). 

15.  Use  (11.6)  and  (11.14)  to  (11.16)  to  evaluate  the  following  integrals.  Warning  hint: 
See  comments  just  after  (11.6)  and  (11.16)  about  the  range  of  integration. 


P  7T 

(a)  /  sin  a:  8(x  —  — )  dx 

Jo  ^ 

(b) 

P  7T 

/  sinx  S(x  +  — )  dx 

Jo  2 

r 1  „ 

P7T 

(c)  J  e3xS'(x)dx 

(d) 

/  coshx  6"(x  —  1)  dx 
Jo 
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16.  Verify  the  operator  equation  ^sgn  x  =  25(x)  where  the  function  signum  x,  meaning 
“sign  of  x,”  and  abbreviated  sgn  x,  is  defined  by 


sgn  x  — 


1,  x  >  0, 

— 1,  x  <  0. 


17.  Verify  (11.18a)  and  (11.18c)  by  multiplying  by  a  test  function  and  integrating. 

18.  Use  equation  (11.16)  to  generalize  the  operator  equations  (11.18)  as  follows: 

(a)  Show  that  xm5 ^  ( x )  =  0  if  m  >  n;  compare  equation  (11.18a). 

(b)  Show  that  xn5(-n\x)  =  (— 1  )nn\6(x)',  compare  (11.18b)  and  (11.18c). 

(c)  Show  that  xmS(n\x)  =  (-l)mT^Tr  <5(T,“m)(x),  m  <  n. 

(d)  Use  the  results  in  (a)  and  (b)  to  show  that 

(x2  +  y2  +  z2)V2[5(x)8(y)5(z)\  =  6<5(*)<5(2/)<S(;s). 


19.  (a)  Show  that  you  can  differentiate  a  generalized  function  equation  or  multiply  it 

by  a  power  of  x.  This  means  to  show  that  if  f  <fi(x)Gi(x )  dx  =  f  cj>(x)G 2(x)  dx 
for  all  test  functions  (j>,  then 


J  (j>(x)G'1(x)  dx  =  j  4>(x)G'2(x)  dx 


J 


<j>(x)xnGi(x)  dx  =  J  (f>(x)xnG2(x) 


and 

dx. 


Hints:  For  the  differentiation  proof,  integrate  by  parts.  For  the  multiplication 
by  xn  proof,  consider  whether  xn(j>(x)  is  a  test  function  if  <j>  is.  See  comment 
just  after  equation  (11.17). 

(b)  Multiply  (11.18b)  by  x  and  use  (11.18a).  Differentiate  the  result  and  simplify 
to  get  (11.18c). 

(c)  Multiply  (11.18c)  by  x,  use  (11.18a),  differentiate  and  simplify  to  find  x3S"'(x) 
in  terms  of  <5(x).  Check  your  result  by  Problem  (18b). 

(d)  Try  a  few  more  examples  as  in  (b)  and  (c)  and  check  your  results  by  Problem  18. 

20.  Verify  the  operator  equations  in  (11.19)  not  done  in  text. 

Hints  for  (a)  and  (b):  Follow  the  text  method  of  proof  of  (c),  making  the  change  of 
variable  u  =  —  x  or  u  =  a  —  x.  Hints  for  (c)  and  (d):  Split  the  integral  into  a  sum 
of  integrals  each  including  just  one  Xi.  In  (d),  what  is  the  value  of  (x  —  a)  when  x 
is  in  the  vicinity  of  6?  Use  part  (c). 


21.  Make  use  of  the  operator  equations  (11.19)  and  previous  equations  to  evaluate  the 
following  integrals. 


(a)  f  (5x  —  2)6(2  —  x)  dx 

Jo 

<•>  /_: 


cos*  S(— 2x)  dx 


(b) 


(d) 


</>(x)S(x2  —  o2)  dx 


L 


r/2 


cos  *  6 (sin  *)  dx 


—  7T  /  2 


22.  You  may  find  the  spherical  coordinate  5  function  written  as 


6(  r  —  ro)  =  6(r  —  ro)6(cos6  —  cos6o)5(rf>  —  (j>o)/r2. 


Show  that  this  equation  is  equivalent  to  (11.22).  Hints:  You  want  to  show  that 
S(cos9  —  cos6*o)  =  8(6  —  do)/ sin 6.  See  (11.19e).  Also  note  that  it  doesn’t  really 
matter  whether  we  write  r2  sin  9  or  Tq  sin#o  in  the  denominator  of  (11.22)  since  the 
5  functions  are  zero  unless  r  =  ro  and  9  =  9q. 


Section  12 


A  Brief  Introduction  to  Green  Functions  461 


23.  Write  a  formula  in  rectangular  coordinates,  in  cylindrical  coordinates,  and  in  spher¬ 
ical  coordinates  for  the  density  of  a  unit  point  charge  or  mass  at  the  point  with  the 
given  rectangular  coordinates: 

(a)  (-5,5,0)  (b)  (0, -1,-1) 

(c)  (-2,0,  2^3)  (d)  (3,  —3,  —V&) 

24.  (a)  Show  that  V  •  (er/r2)  =  0  for  r  >  0.  Hint:  You  can  do  this  in  rectangular 

coordinates,  but  it  is  easier  in  spherical  coordinates.  See  Chapter  6,  equation 
(7.9).  Show  that  V  •  [er.F(r)],  for  any  F(r),  is  a  function  of  r  only. 

(b)  Show  that  V(l/r)  =  —er/r2.  See  Chapter  6,  equation  (6.8). 

25.  Let 


Show  that  F"(x) 
that  F"(x)  might 
not  true. 

(a)  Show  that  (j>(x)F" (x)  dx  =  0(0)  +  20'(O),  where  0  is  any  test  function. 

Then  by  (11.6)  and  (11.14),  what  is  F"(x)? 

(b)  Show  that  F(x)  =  (x  —  2 )u(x)  where  u(x)  is  the  unit  step  function  in  (11.17). 
Differentiate  this  equation  twice  and  simplify  using  (11.17)  and  (11.18).  Com¬ 
pare  your  result  in  (a). 

(c)  As  in  (a)  and  (b),  find  G"(x)  in  terms  of  5  and  5'  if 


F(x)  = 


x  —  2, 
0, 


x  >  0, 
x  <  0. 


=  0  for  all  x  fF  0,  and  F"(x)  dx  =  1,  which  leads  you  to  think 
=  5(x).  Show  in  two  ways,  as  outlined  in  (a)  and  (b),  that  this  is 


G  Or) 


3*  +  1,  x  >  0, 
2x  —  4,  x  <  0. 


12.  A  BRIEF  INTRODUCTION  TO  GREEN  FUNCTIONS 

Let’s  do  some  examples  to  see  what  a  Green  function  is  and  how  we  can  use  it 
to  solve  ordinary  differential  equations.  Also  see  Chapter  13,  Section  8,  for  an 
application  to  partial  differential  equations.  (You  might  find  it  interesting  to  read 
“The  Green  of  Green  Functions”,  Physics  Today,  December  2003,  41-46.) 

Example  1.  We  reconsider  the  differential  equation  (11.2),  namely 
(12-1)  y"  +  u2y  =  /(f),  yo  =  y'Q  =  0 

where  /(f )  is  some  given  forcing  function.  Using  (11.6),  we  can  write 

/»oo 

(12.2)  /(f)  =  /  f{t’W  -  t)  dt ', 

Jo 

that  is,  we  can  think  of  the  force  /(f)  as  (a  limiting  case  of)  a  whole  sequence  of 
impulses.  (You  might  reflect  that,  on  the  molecular  level,  air  pressure  is  the  force 
per  unit  area  due  to  a  tremendous  number  of  impacts  of  individual  molecules.)  Now 
suppose  that  we  have  solved  (12.1)  with  /(f)  replaced  by  S(t,'  —  f),  that  is,  we  find 
the  response  of  the  system  to  a  unit  impulse  at  f'.  Let  us  call  this  response  G(f,  f'), 
that  is,  G(f,  f')  is  the  solution  of 


(12.3) 


^2  G(f,  f')  +  w2G(f,  f ')  —  <5(f'  —  f). 
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Then,  given  some  forcing  function  /(£),  we  try  to  find  a  solution  of  (12.1)  by  “adding 
up”  the  responses  of  many  such  impulses.  We  shall  show  that  this  solution  is 

pOO 

(12.4)  y(t)=  /  G(t,t')f(t')  dt' . 

Jo 

Substituting  (12.4)  into  (12.1)  and  using  (12.3)  and  (12.2),  we  find 

v" + =  (|j + "2) y  =  (h + “2)  l  G(i’  !')/(!'>  dt' 

nOO  /  _j2  \  /»<X> 

=  Jo  (^+^2)G(f,fW)^'  =  /o  6{t' =  f{t). 

Thus  (12.4)  is  a  solution  of  (12.1). 

The  function  G(f ,  t')  is  called  a  Green  function  (or  Green’s  function).  The  Green 
function  is  the  response  of  the  system  to  a  unit  impulse  at  t  =  t' .  Solving  (12.3) 
with  initial  conditions  G  =  0  and  dG/dt.  =  0  at  t  =  0,  we  find  (Problem  1) 


G(t, t')  =  j 0  <£<£', 
|  sinw(£  —  t ),  0  <  t!  <  t. 


Then  (12.4)  gives  the  solution  of  (12.1)  with  yo  =  Vo  =  0,  namely 


/■*  1 

y(t)  =  /  —sin  oj(t  —  dt' . 

Jo  u 


(The  upper  limit  is  t'  =  t  since  G  =  0  for  t'  >  t.)  Thus,  given  a  forcing  function 
/(£),  we  can  find  the  response  y(t)  of  the  system  (12.1)  by  integrating  (12.6)  (see 
Problems  2  to  5) .  Similarly  for  other  differential  equations  we  can  find  the  solution 
in  terms  of  an  appropriate  Green  function  (see  Problems  6  to  8). 


Example 2.  As  we  will  see  later  (Chapter  13,  Section  8),  in  using  Green  functions  in  three- 
dimensional  problems,  we  usually  want  a  solution  which  is  zero  on  the  boundary  of 
some  region.  In  order  to  have  a  similar  problem  here,  let  us  ask  for  a  solution  of 

(12.7)  y"  +  y  =  f{x) 

such  that  y  =  0  at  x  =  0  and  at  x  =  7r/2.  A  physical  interpretation  of  this  problem 
may  be  useful.  If  a  string  is  stretched  along  the  x  axis  from  a:  =  0toa;  =  7r/2,  and 
then  caused  to  vibrate  by  a  force  proportional  to  —f(x)  sinwf,  then  \y{x)\  in  (12.7) 
gives  the  amplitude  of  small  vibrations. 

We  first  find  a  solution  of  [compare  (12.3)] 

d2 

(12.8)  — -^G(x,  x')  +  G(x,  x')  =  Six'  —  x) 
dxz 

satisfying  G(0,x')  =  G(tt/2,x')  =  0;  this  solution  is  the  Green  function  for  our 
problem.  Then  [compare  (12.4)] 

r*/ 2 

(12.9)  y(x)  =  /  G{x,x’)f{x')dx' 

Jo 
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gives  a  solution  of  (12.7)  satisfying  the  conditions  y( 0)  =  y( 7t/2)  =  0  (Problem  9). 

To  construct  the  desired  Green  function,  we  first  note  that  for  any  x  ^  x' ,  the 
equation  (12.8)  becomes 


(12.10) 


——^G(x,x')  +  G(x,x')  =  0,  x  7^  x1 . 
dxz 


The  solutions  of  (12.10)  are  sin  a;  and  cos  a:;  we  observe  that  sin  a;  =  0  at  x  =  0  and 
cos  a:  =  0ata;  =  7r/2.  Thus  we  try  to  find  a  Green  function  of  the  form 


(12.11) 


G(x,  x')  = 


A{x’)  sin  a;,  0  <  x  <  x'  <  7r/2, 

B{xf)  cos  x,  0  <  x'  <  x  <  7t/2. 


The  next  step  may  be  clarified  by  thinking  about  |G(*,  *')| 

the  string  problem.  If  the  string  is  oscillated  by  a  _ 

concentrated  force  at  x'  [see  (12.8)],  then  the  am-  _ _ - — 1  \ 

plitude  of  the  vibration  given  by  (12.11)  is  shown  ?  \ 

in  Figure  12.1.  At  x  =  x',  G(x,x')  is  continuous, 

that  is,  from  (12.11)  Figure  12.1 

(12.12)  A(x' )  sin  x'  =  B{x')  cos  x' . 

However  (see  Figure  12.1),  the  slope  changes  abruptly  at  x' .  From  (12.11),  we  find 


SG(„')  = 


A(a/)cosa:,  x  <  a/, 
— B (a/)  sin  x,  x  >  x' . 


(12.13) 


Change  in  —  at  x'  is  —  B(x')  sin  a;'  —  A(x')  cos  a;'. 
ax 


We  can  evaluate  this  change  in  dG/dx  by  integrating  (12.8)  from  x  =  x'  —  e  to 
x  =  x'  +  e  and  letting  e  — >  0.  Since  /  d2G/dx2  =  dG/dx,  we  find 

j/~< \x  +£  fi+e  rx'+e 


dx  Jx,_e 


/•x  +e  i-x  +e 

/  G{x,x')dx=  /  8{x'  —  x)  dx  =  1, 

J  x'  —  e  J  x'  —  e 


or,  letting  e  — >  0: 

^Change  in  slope  ^  at  x'^j  is  1. 

Then  from  (12.13) 

(12.14)  —B(x')  sin  x'  —  A{ x')  cos  x'  =  1. 

We  solve  (12.12)  and  (12.14)  for  A(x')  and  B{ x')  (Problem  10)  and  get 

(12.15)  A(x')  =  —  cos  a;',  B{x')  =  —  sin  a;'. 


Thus  we  have 


(12.16) 


G( x,  x')  = 


—  cos  x'  sin  ai,  0  <  x  <  x'  <  7t/2, 

—  sin  a;' cos  a;,  0  <  x'  <  x  <  tt/2. 


Then  from  (12.9),  the  solution  of  (12.7)  with  y(0)  =  y( n/2)  =  0  is 

nX  pTl /2 

(12.17)  y(x)  =  —  cosx  /  (smx')f(x')  dx'  —  sinx  /  (cos  x')f(x')dx'. 

Jo  Jx 
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Example  3.  If  f{x)  =  cscx,  we  find  from  (12.17): 


V  0) 


nX  />7t/2 

—  cos  x  /  sin  x'  esc  x'  dx'  —  sin  x  /  cos  x'  esc  x'  dx' 
Jo  Jx 


(—  cos  x)  (x)  —  (sin  x)  (In  sin  x') 


■k/2 


—x  cos  x  +  (sin  x)  (In  sin  x) . 


It  is  interesting  to  note  that  we  can  use  the  Green  function  method  to  obtain  a 
particular  solution  of  a  nonhomogeneous  differential  equation  (nonzero  right-hand 
side)  when  we  know  the  solutions  of  the  corresponding  homogeneous  equation  (zero 
right-hand  side).  (See  Problems  14  to  18.)  In  (12.17)  each  integral  gives  a  function 
of  x  minus  a  constant  (from  the  constant  limits);  these  constants  times  sin  a:  and 
cos  x  give  a  solution  of  the  homogeneous  equation.  Thus  the  remaining  terms  give 
a  particular  solution  of  the  nonhomogeneous  equation.  We  can  write  this  particular 
solution  in  a  simple  form  by  changing  J'J'2  to  —  J'*/.2 ,  dropping  the  constant  limits 
and  writing  indefinite  integrals.  Then  a  particular  solution  yp(x)  of  (12.7)  is  given 

by 


(12.18) 


yP(x) 


—  COSX 


J  (sin  x)f(x )  dx  +  sin  x 


J  (cos  x)f(x)  dx. 


Example  4.  By  the  same  methods  used  above,  you  can  verify  (Problem  14)  that  a  solution 
of  the  differential  equation 

(12.19)  y"  +p{x)y'  +  q{x)y  =  f(x) 


with  y{a)  =  y(b)  =  0  is  given  by 


/noorA  ^  ^  fx  yi(x')f{x')  ,  ,  ,  ,  ,  fb  y2{x')f(x')  ^  , 

(12.20)  y(x)  =  y2(x)  J  dx  +  yi{x)  J '  ^  dx  , 

where  yi{x)  and  y2{x)  are  solutions  of  the  homogeneous  equation  with  j/i(a)  =  0, 
y2(b)  =  0,  and  W  is  the  Wronskian  of  yi(x)  and  y2(x)  [See  Chapter  3,  equation 
(8.5)].  Just  as  in  (12.18),  we  find  that  a  particular  solution  yp  of  (12.19)  is 

(12.21)  yp(x)  =  y2{x)  J  dx  ~  Vl / 


y2{x)f{x) 

W(x) 


The  particular  solution  (12.18)  and  (12.21)  are  exactly  the  same  as  those  obtained 
by  the  method  of  variation  of  parameters  (see  Problem  14b)  but  the  Green  function 
method  may  seem  less  arbitrary. 


PROBLEMS,  SECTION  12 

1.  Solve  (12.3)  if  G  =  0  and  dG/dt  =  0  at  t  =  0  to  obtain  (12.5).  Hint:  Use  L28  and 
L3  to  find  the  inverse  transform. 


In  Problems  2  and  3,  use  (12.6)  to  solve  (12.1)  when  f(t)  is  as  given. 
2.  f(t)  =  sin  Lot  3.  f(t)  =  e_t 
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4. 

5. 

6. 


Use  equation  (12.6)  to  solve  Problem  10.18. 

Obtain  (12.6)  by  using  the  convolution  integral  to  solve  (12.1). 

For  Problem  10.17,  show  (as  in  Problem  1)  that  the  Green  function  is 

0,  0  <  t  <  t', 

(1/a)  sinha(t  —  t'),  0  <t'<t. 


G(M')  =  { 


7. 


Thus  write  the  solution  of  Problem  10.17  as  an  integral  [similar  to  (12.6)]  and  eval¬ 
uate  it. 

Use  the  Green  function  of  Problem  6  to  solve 

y  -a  y  =  e  ,  y0  =  y0  =  0. 


8.  Solve  the  differential  equation  y"  +  2 y'  +  y  =  f(t),  yo  =  yb  =  0,  where 


f(t)  = 


1,  0  <  t  <  a, 

0,  t  >  a. 


As  in  Problems  6  and  7,  find  the  Green  function  for  the  problem  and  use  it  in 
equation  (12.4).  Consider  the  cases  t  <  a  and  t  >  a  separately. 

9.  Following  the  proof  of  (12.4),  show  that  (12.9)  gives  a  solution  of  (12.7). 

10.  Solve  (12.12)  and  (12.14)  to  get  (12.15).  Hint:  Use  Cramer’s  rule  (Chapter  3, 
Section  3);  note  that  the  denominator  determinant  is  the  Wronskian  [Chapter  3, 
equation  (8.5)]  of  the  functions  sin  a:  and  cos®. 


In  Problems  11  to  13,  use  (12.17)  to  find  the  solution  of  (12.7)  with  y(0)  =  y( ir/2)  =  0 
when  the  forcing  function  is  given  f(x). 


11.  /(®)  =  sin2® 


12.  /(®)  =  sec® 


13. 

14. 


/(*)  = 


x,  0  <  x  <  7t/4 
i/r/2  —  x,  7t/4  <  x  <  7r/2. 

Hint:  Write  separate  formulas  for  y(x)  for  x  <  n/4  and  x  >  n/4. 


(a)  Given  that  yi(x)  and  1/2  (®)  are  solutions  of  (12.19)  with  f(x)  =  0,  and  that 
yi(a)  =  0,  yi{b)  =  0,  find  the  Green  function  [as  in  (12.11)  to  (12.16)]  and 
so  obtain  the  solution  (12.20).  Then  find  the  particular  solution  (12.21)  as 
discussed  for  (12.18)  and  (12.21). 

(b)  The  method  of  variation  of  parameters  is  an  elementary  way  of  finding  a  par¬ 
ticular  solution  of  (12.19)  when  you  know  the  solutions  of  the  homogeneous 
equation.  Show  as  follows  that  this  method  leads  to  the  same  result  (12.21)  as 
the  Green  function  method.  Start  with  the  known  solution  of  the  homogeneous 
equation,  say  y  =  cij/i  +C2J/2  and  allow  the  “constants”  to  be  functions  of  x  to 
be  determined  so  that  y  satisfies  (12.19).  (The  c’s  are  the  “parameters”  which 
are  to  be  “varied”  in  the  expression  “variation  of  parameters”.)  You  want  to 
find  y'  and  y"  to  substitute  into  (12.19).  First  find  y'  and  set  the  sum  of  the 
terms  involving  derivatives  of  the  c’s  equal  to  zero.  Differentiate  the  rest  of 
y'  again  to  get  y" .  Now  substitute  y,  y'  and  y"  into  (12.19)  and  use  the  fact 
that  y  1  and  2/2  both  satisfy  the  homogeneous  equation  [that  is,  (12.19)  with 
f(x)  =  0].  You  should  have  the  two  equations: 


cl  yi  +  c'2  2/2  =  0, 

C1 2/1  +cli/l  =  f(x). 


Solve  this  pair  of  equations  for  cl  and  cl  [say  by  determinants,  and  note  that  the 
denominator  determinant  is  the  Wronskian  as  in  (12.20)  and  (12.21)].  Write 
the  indefinite  integrals  for  ci  and  C2,  and  write  y  =  cryi  +  C22/2  to  get  (12.21). 
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In  Problems  15  to  18,  use  the  given  solutions  of  the  homogeneous  equation  to  find  a 
particular  solution  of  the  given  equation.  You  can  do  this  either  by  the  Green  function 
formulas  in  the  text  or  by  the  method  of  variation  of  parameters  in  Problem  14b. 

15.  y"  —  y  =  sech*;  sinh*,  cosh* 

16.  x2y"  —  2xy'  +  2y  =  *ln*;  x,  x2 

17.  y"  —  2(csc2  x)y  =  sin2  *;  cot*,  1—*  cot* 

18.  (*2  +  l)y"  —  2xy'  +  2y  =  (x2  +  l)2;  x,  1  -  x2 

13.  MISCELLANEOUS  PROBLEMS 

Identify  each  of  the  differential  equations  in  Problems  1  to  24  as  to  type  (for  example, 
separable,  linear  first  order,  linear  second  order,  etc.),  and  then  solve  it. 


1. 

x2i /  —  xy  =  l/x 

2. 

*(ln  y)y'  —  ylnx  =  0 

3. 

y"  +  2  y"  +  2y'  =  0 

4. 

d2r  dr 

__6_+9r  =  o 

5. 

(2*  —  i/sin  2x)  dx  =  (sin2*  —  2 y)  dy 

6. 

y"  +  2 y  +2 y  =  10ex  +  6e~x  cos* 

7. 

6x  y  y  —  x  y  =1 

8. 

2  n  1  , 

x  y  -xy  +y  =  * 

9. 

dy  -  (2 y  +  y2e3x)  dx  =  0 

10. 

u(l  —  v)  dv  +  v2(l  —  u)  du  =  0 

11. 

(■ y  +  2x)  dx  —  x  dy  =  0 

12. 

xy"  +  y  =  4* 

13. 

y"  +  4  y  +5  y  =  26e3x 

14. 

y"  +  4 y  +  5y  =  2e~2x  cos* 

15. 

y"  —  4  y  +4  y  =  6e2x 

16. 

y  -  5y  +  6y  =  e 

17. 

(2x  +  y)  dy  —  (x  —  2 y)  dx  =  0 

18. 

(*cos y  —  e~siny)  dy  +  dx  =  0 

19. 

sin2*  dy  +  [sin2*  +  (*  +  ?/)  sin  2*]  dx  - 

=  0 

20. 

y"  —  2  y  +  5y  =  5*  +  4ea:(l  +  sin  2*) 

21. 

y  +xy=  x/y 

22. 

(D  —  2)2(D2  +  9)y  =  0 

23. 

sin  9  cos  9  dr  —  sin2  9  d6  =  r  cos2  9  d9 

24. 

x(yy"  +  y'2)  =  yy'  Hint:  Let  u  =  yy ‘ 

In  Problems  25  to  28,  find  a  particular  solution  satisfying  the  given  conditions. 

25.  3x2y  dx  +  x3  dy  =  0,  y  =  2  when  x  =  1. 

26.  xy'  —  y  =  x2 ,  y  =  6  when  x  =  2 

27.  y"  +  y'  —  6y  =  6,  y  =  1,  y'  =  4  when  *  =  0 

28.  yy”  +  y'2  +  4  =  0  y  =  3,  y'  =  0  when  x  =  1 

If  10  kg  of  rock  salt  is  placed  in  water,  it  dissolves  at  a  rate  proportional  to  the 

amount  of  salt  still  undissolved.  If  2kg  dissolve  during  the  first  10  minutes,  how 
long  will  it  be  until  only  2kg  remain  undissolved? 


29. 
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30.  A  mass  m  falls  under  gravity  (force  mg)  through  a  liquid  whose  viscosity  is  decreasing 
so  that  the  retarding  force  is  —2mv/{l  +  t),  where  v  is  the  speed  of  m.  If  the  mass 
starts  from  rest,  find  its  speed,  its  acceleration,  and  how  far  it  has  fallen  (in  terms 
of  g)  when  t  —  X. 

31.  The  acceleration  of  an  electron  in  the  electric  field  of  a  positively  charged  sphere  is 
inversely  proportional  to  the  square  of  the  distance  between  the  electron  and  the 
center  of  the  sphere.  Let  an  electron  fall  from  rest  at  infinity  to  the  sphere.  What 
is  the  electron’s  velocity  when  it  reaches  the  surface  of  the  sphere? 

32.  Suppose  that  the  rate  at  which  you  work  on  a  hot  day  is  inversely  proportional  to 
the  excess  temperature  above  75°.  One  day  the  temperature  is  rising  steadily,  and 
you  start  studying  at  2  p.m.  You  cover  20  pages  the  first  hour  and  10  pages  the 
second  hour.  At  what  time  was  the  temperature  75°  ? 

33.  Compare  the  temperatures  of  your  cup  of  coffee  at  time  t 

(a)  if  you  add  cream  and  let  the  mixture  cool; 

(b)  if  you  let  the  coffee  and  cream  sit  on  the  table  and  mix  them  at  time  t. 

Hints:  Assume  Newton’s  law  of  cooling  (Problem  2.27)  for  both  coffee  and  cream 
(where  it  is  a  law  of  heating).  Combine  n'  units  of  cream  initially  at  temperature  Tg 
with  n  units  of  coffee  initially  at  temperature  To,  and  find  the  temperature  at  time  t 
in  (a)  and  in  (b)  assuming  that  the  air  temperature  remains  a  constant  Ta,  and  that 
the  proportionality  constant  in  the  law  of  cooling  is  the  same  for  both  coffee  and 
cream. 


34.  A  flexible  chain  of  length  l  is  hung  over  a  peg  with  one  end  of  the  chain  slightly  longer 
than  the  other.  Assuming  that  the  chain  slides  off  with  no  friction,  write  and  solve 
the  differential  equation  of  motion  to  show  that  y  =  yo  cosht^/2g/l,  0  <  y  <  1/2, 
where  2 y  is  the  difference  in  length  of  the  two  ends,  and  y  =  y0  when  t  =  0. 

35.  A  raindrop  falls  through  a  cloud,  increasing  in  size  as  it  picks  up  moisture.  Assume 
that  its  shape  always  remains  spherical.  Also  assume  that  the  rate  of  increase  of 
its  volume  with  respect  to  distance  fallen  is  proportional  to  the  cross-sectional  area 
of  the  drop  at  any  time  (that  is,  the  mass  increase  dm  =  pdV  is  proportional  to 
the  volume  nr2  dy  swept  out  by  the  drop  as  it  falls  a  distance  dy).  Show  that  the 
radius  r  of  the  drop  is  proportional  to  the  distance  y  the  drop  has  fallen  if  r  =  0 
when  y  =  0.  Recall  that  when  m  is  not  constant,  Newton’s  second  law  is  properly 
stated  as  ( d/dt)(mv )  =  F .  Use  this  equation  to  find  the  distance  y  which  the  drop 
falls  in  time  t  under  the  force  of  gravity,  if  y  =  y  =  0  at  t  =  0.  Show  that  the 
acceleration  of  the  drop  is  g/ 7  where  g  is  the  acceleration  of  gravity. 

36.  (a)  A  rocket  of  (variable)  mass  m  is  propelled  by  steadily  ejecting  part  of  its  mass 

at  velocity  u  (constant  with  respect  to  the  rocket).  Neglecting  gravity,  the 
differential  equation  of  the  rocket  is  m(dv/dm)  =  —u  as  long  as  »  <  c,  c  = 
speed  of  light.  Find  v  as  a  function  of  m  if  m  =  mo  when  v  =  0. 

(b)  In  the  relativistic  region  (v/c  not  negligible),  the  rocket  equation  is 


dv 

m— —  =  —u 
dm 


Solve  this  differential  equation  to  find  v  as  a  function  of  m.  Show  that  v/c  = 
(1  —  x)/(l  +  x),  where  x  =  (m/mo)2u*c  ■ 
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37.  The  differential  equation  for  the  path  of  a  planet  around  the  sun  (or  any  object  in 
an  inverse  square  force  field)  is,  in  polar  coordinates, 

1  d  (  1  dr  \  1  k 

r 2  d6  \r2  d6 )  r3  r 2  ’ 

Make  the  substitution  u  =  l/r  and  solve  the  equation  to  show  that  the  path  is  a 
conic  section. 

38.  Use  L 15  and  L31  to  find  the  Laplace  transform  of  (1  —  cos  at) /t. 

39.  Use  L32  and  L9  to  find  the  Laplace  transform  of  tsinhot.  Verify  your  result  by 
finding  its  inverse  transform  using  the  convolution  integral. 

Use  the  Laplace  transform  table  to  evaluate: 

r  oo  °°  rn+ 1 

40.  /  t3e~4t  sinh  2t  dt  41.  1)"  /  te~2t  dt 

Jo  tC  Jn 

Find  the  inverse  Laplace  transform  of: 

42.  - ^ P 2  44.  _ 1 _ 

(p  +  a)3  '  (p2_|_a2)2  (p2  T  a2)3 

45.  Prove  the  following  shifting  or  translation  theorems  for  Fourier  transforms.  If  g(a) 
is  the  Fourier  transform  of  f(x),  then 

(a)  the  Fourier  transform  of  f(x  —  a)  is  e~xaa g(a); 

(b)  the  Fourier  transform  of  exl3x  f(x)  is  g(a  —  (3). 

Compare  Problems  8.19  to  8.27. 

46.  Use  the  table  of  Laplace  transforms  to  find  the  sine  and  cosine  Fourier  transforms 
of  e~x;  of  xe~x. 

Solve  Problems  47  and  48  either  by  Laplace  transforms  and  the  convolution  integral  or  by 
Green  functions. 


47.  y"  +  y  =  sec2  t 


48.  y"  +  y  =  tsinf 
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Table  of  Laplace  Transforms 

y  =  f(t),  t  >  o 
[y  =  f(t)  =  o,  t<  o] 


Y  =  L(y)  =  F(p)  =  /  e~*m  dt 


LI 

L2 

L3 

L4 

L5 

L6 

LI 

L8 

L9 

L10 

Lll 

L12 

L13 

L14 

L15 

L16 


1 

P 

Re  p  >  0 

e~at 

1 

p  +  a 

Re  (p+a)  >0 

sin  at 

a 

p2  +  a2 

Re  p  >  Im  a 

cos  at 

P 

p2  +  a2 

Re  p  >  Im  a 

tk,  k  >  -1 

k\  r(fc  +  i) 

-  or  - 

pk~\~  1  pk-\-l 

Re  p  >  0 

k  ~at  l,  ^  _i 

k\  r(fc  +  i) 

Re  (p  +  a)  >  0 

•  o  )  rv  L 

o P+a )fc+!  °r  (p  +  a)fc+1 

e~at  -  e_ht 

1 

Re  (p  +  a)  >  0 

Re  (p  +  b)  >  0 

b  —  a 

(P  +  a)(p  +  6) 

ae~at  — 

P 

Re  (p  +  a)  >  0 

Re  (p  +  b)  >0 

a  —  b 

( P  +  a)(p+b ) 

sinh  at 

a 

p2  —  a2 

Re  p  >  Re  a 

cosh  at 

P 

p2  —  a2 

Re  p  >  Re  a 

t  sin  at 

2  ap 

( p 2  +  a2)2 

Re  p  >  Im  a 

t  cos  at 

p2-a2 
(p2  +  a2)2 

Re  p  >  |  Im  a 

e~at  sin  bt 

b 

(p  +  a)2  +  b 2 

Re  (p+a)  >  Im  b\ 

e~at  cos  bt 

p  +  a 

(p  +  a)2  +  62 

Re  (p  +  a)  >  Im  b\ 

1  —  cos  at 

a2 

p(p2  +  a2) 

Re  p  >  |  Im  a 

at  —  sin  at 

a3 

p2(p2  +  a2) 

Re  p  >  |  Im  a 

2a3 


( p 2  +  a2)2 


L17 


sin  at  —  at  cos  at 


Re  p  >  |  Im  a| 
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Table  of  Laplace  Transforms  (continued) 


y  =  f{t),  t  >  o 
[y  =  f(t )  =  o,  t<o] 


e~at(l  -  at) 


pOO 

Y  =  L(y)  =  F(p)  =  /  e~ptf(t)  dt 
J  0 


p 

(p  +  a)2 

a 

arc  tan  — 


Re  (p  +  a)  >  0 


R,e  p  >  Im  a\ 


—  sin  at  cos  bt, 
t 

a  >  0,  b  >  0 


e~at  -  e~bt 


L22  1  -  erf 


,  a  >  0 


V2  VV’ 

(See  Chapter  11,  Section  9) 
Jo(at) 

(See  Chapter  12,  Section  12) 


T  r.  A  x  Jl,  t>a>0 

L24  u(t  —  a)  =  < 

(0,  t  <  a 

(unit  step,  or  Heaviside  function) 


L25  f(t)  =  u(t  —  a)  —  u{t  —  b) 


—  arc  tan  ■ 


( p 2  +  a2)  1 12 


e~ap  -  e~bp 


arc  tan  ■ 


Re  p  >  0 

Re  (p  +  a)  >0 
Re  \p  +  b)  >  0 

Re  p  >  0 


Re  p  >  Ima| 
or  Re  p  >  0 
for  real  a  ^  0 

Re  p  >  0 


L26  f(t)  I 


-  tanh  ( ^  ap) 


Re  p  >  0 


a  2  a  3  a  4  a 

S(t  —  a),  a  >  0 
(See  Section  11) 


L28  f(t)  = 


(  g(t  —  a),  t  >  a  >  0 
(0,  t  <  a 

=  g(t  —  a)u(t  —  a) 


e~paG{p) 

[G(p)  means  L(g).] 


L29 


e~atg(t) 


G(p  +  a) 
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Table  of  Laplace  Transforms  (continued) 


V  =  f(t),  t  >  0 

[y  =  f(t )  =  o,  t  <  o] 


Y  =  L(y)  =  F(p) 


e~ptf(t )  dt 


L30 

L31 

L32 

L33 

L34 


giat),  a  >  0 


g{t) 


(if  integrable) 


tng(t) 


g(r)  dr 


-g(1) 


(-i)' 


G(u)  du 

,dnG(p) 

dpn 


I  g(t  -  T)h(r)  dr  =  f  g(r)h(t  —  r)  dr 
Jo  Jo 

(convolution  of  g  and  h,  often 

written  as  g  *  h:  see  Section  10) 


iG(p) 


G(p)H(p) 


L35  Transforms  of  derivatives  of  y  (see  Section  9) : 
L(y')  =  pY  -y0 
L(y")  =  P2Y  -py0  -  y'0 
L(ym)  =  p3y  -  p2yo  -  py'o  -  y'd,  etc. 
L(y(n))  =  pnY  -  pn~1yo  -  Pn~2y'o - Vo 
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Calculus  of  Variations 


1.  INTRODUCTION 

What  is  the  shortest  distance  between  two  points?  You  probably  laugh  at  such  a 
simple  question  because  you  know  the  answer  so  well.  Can  you  prove  it?  We  shall 
see  how  to  prove  it  shortly.  Meanwhile  we  ask  the  same  question  about  a  sphere, 
for  example,  the  earth.  What  is  the  shortest  distance  between  two  points  on  the 
surface  of  the  earth,  measured  along  the  surface?  Again  you  probably  know  that 
the  answer  is  the  distance  measured  along  a  great  circle.  But  suppose  you  were 
asked  the  same  question  about  some  other  surface,  say  an  ellipsoid  or  a  cylinder  or 
a  cone.  The  curve  along  a  surface  which  marks  the  shortest  distance  between  two 
neighboring  points  is  called  a  geodesic  of  the  surface.  Finding  geodesics  is  one  of 
the  problems  which  we  can  solve  using  the  calculus  of  variations. 

There  are  many  others.  To  understand  what  the  basic  problem  is,  think  about 
finding  maximum  and  minimum  values  of  f(x)  in  ordinary  calculus.  You  find  f'(x) 
and  set  it  equal  to  zero.  The  values  of  x  you  find  may  correspond  to  maximum 
points  ,  minimum  points  W  ,  or  points  of  inflection  with  a  horizontal  tangent  7^- 
Suppose  that  in  solving  a  given  physical  problem  you  want  the  minimum  values 
of  a  function  f(x).  The  equation  f'(x)  =  0  is  a  necessary  (but  not  a  sufficient) 
condition  for  an  interior  minimum  point.  To  find  the  desired  minimum,  you  would 
find  all  the  values  of  x  such  that  f'(x)  =  0,  and  then  rely  on  the  physics  or  on 
further  mathematical  tests  to  sort  out  the  minimum  points.  We  use  the  general 
term  stationary  point  to  mean  simply  that  f(x)  =  0  there;  that  is,  stationary 
points  include  maximum  points,  minimum  points,  and  points  of  inflection  with 
horizontal  tangent.  In  the  calculus  of  variations,  we  often  state  problems  by  saying 
that  a  certain  quantity  is  to  be  minimized.  However,  what  we  actually  always  do 
is  something  similar  to  putting  f{x)  =  0,  above;  that  is,  we  make  the  quantity 
stationary.  The  question  of  whether  we  have  a  maximum,  a  minimum,  or  neither, 
is,  in  general,  a  difficult  mathematical  problem  (see  calculus  of  variations  texts) 
so  we  shall  rely  on  the  physics  or  geometry.  Fortunately,  in  many  applications, 
“stationary”  is  all  that  is  required  (Fermat’s  principle,  Problems  1  to  3;  Lagrange’s 
equations,  Section  5). 
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Now  what  is  the  quantity  which  we  want  to  make  stationary?  It  is  an  integral 

(1.1)  I  =  J.  F(x’y’y,'>dx  (where  y  =  , 

and  our  problem  is  this:  Given  the  points  (x\,yi)  and  ( £2,2/2 )  and  the  form  of 
the  function  F  of  x,  y,  and  y ',  find  the  curve  y  =  y( x)  (passing  through  the  given 
points)  which  makes  the  integral  /  have  the  smallest  possible  value  (or  stationary 
value).  Before  we  try  to  do  this,  let  us  look  at  several  examples. 

Example  1.  Geodesics:  Find  the  equation  y  =  y(x)  of  a  curve  joining  two  points  (£1,2/1) 
and  (£2,2/2)  hi  the  plane  so  that  the  distance  between  the  points  measured  along 
the  curve  (arc  length)  is  a  minimum.  Thus  we  want  to  minimize 

(1.2)  1=  r^l  +  y-dx; 

J  X\ 

this  is  equation  (1.1)  with  F(x,y,y')  =  \J\  +  y'2.  See  Section  2  and  Example  3.4. 


Example  2.  The  famous  brachistochrone  problem  (from  the  Greek:  brachistos  =  shortest, 
chronos  =  time,  as  in  chronometer):  Find  the  shape  of  a  wire  joining  two  given 
points  so  that  a  bead  will  slide  down  under  gravity  from  one  point  to  the  other 
(without  friction)  in  the  shortest  time.  Here  we  must  minimize  f  dt.  If  ds  is  an 
element  of  arc  length,  then  the  velocity  of  the  particle  is  v  =  ds/dt.  Then  we  have 


We  shall  see  later  that  (using  the  law  of  conservation  of  energy)  we  can  find  v  as  a 
function  of  x  and  y.  Then  the  integral  which  we  want  to  minimize,  namely 


is  of  the  form  (1.1).  See  Section  4. 


Example  3.  Soap  film  problem:  Suppose  a  soap  film  is  sus¬ 
pended  between  two  circular  wire  hoops  as  shown  in 
Figure  1.1;  what  is  the  shape  of  the  surface?  It  is 
clear  from  symmetry  that  it  is  a  surface  of  revolu¬ 
tion  (neglecting  gravity),  and  it  is  known  that  the 
soap  film  will  adjust  itself  so  that  the  surface  area 
is  a  minimum.  The  surface  area  can  be  written  as 
an  integral  and  again  our  problem  is  to  minimize  an 
integral.  See  Section  4. 

There  are  many  other  examples  from  physics.  A  chain  suspended  between  two 
points  hangs  so  that  its  center  of  gravity  is  as  low  as  possible;  the  2  coordinate 
of  the  center  of  gravity  is  given  by  an  integral.  Fermat’s  principle  in  optics  says 
that  light  traveling  between  two  given  points  follows  the  path  requiring  the  least 
time.  (This  is  a  simple,  but  inaccurate,  statement;  we  should  say  that  t  =  f  dt  is 
stationary — there  are  examples  where  it  is  a  maximum!  See  Problem  3).  Various 
other  basic  principles  in  physics  are  stated  in  the  form  that  certain  integrals  have 
stationary  values. 
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PROBLEMS,  SECTION  1 


The  speed  of  light  in  a  medium  of  index  of  refraction  n  is  v  =  ds/dt  =  c/n.  Then  the 
time  of  transit  from  A  to  B  is  t  =  f  ^  dt  =  c^1  Jj3  nds.  By  Fermat’s  principle  above,  t 
is  stationary.  If  the  path  consists  of  two  straight  line  segments  with  n  constant  over  each 
segment,  then 

rB 

/  nds  =  nidi  +  n2d2, 

J  A 

and  the  problem  can  be  done  by  ordinary  calculus.  Thus  solve  the  following  problems: 

1.  Derive  the  optical  law  of  reflection.  Hint:  Let  light  go  from  the 
point  A  =  (*1,1/1)  to  B  =  (*2,2/2)  via  an  arbitrary  point  P  = 

(*,  0)  on  a  mirror  along  the  *  axis.  Set  dt/dx  =  ( n/c)dD/dx  = 

0,  where  D  =  distance  APB,  and  show  that  then  9  =  <j>. 


2.  Derive  Snell’s  law  of  refraction:  m  sin#i  =  712  sin  62  (see  figure). 


3.  Show  that  the  actual  path  is  not  necessarily  one  of 
minimum  time.  Hint:  In  the  diagram,  A  is  a  source 
of  light;  CD  is  a  cross  section  of  a  reflecting  surface, 
and  B  is  a  point  to  which  a  light  ray  is  to  be  reflected. 

APB  is  to  be  the  actual  path  and  AP' B,  AP" B 
represent  varied  paths.  Then  show  that  the  varied  p 

(a)  Are  the  same  length  as  the  actual  path  if  CD  is  an  ellipse  with  A  and  B  as 
foci. 

(b)  Are  longer  than  the  actual  path  if  CD  is  a  line  tangent  at  P  to  the  ellipse  in 
(a). 

(c)  Are  shorter  than  the  actual  path  if  CD  is  an  arc  of  a  curve  tangent  to  the 
ellipse  at  P  and  lying  inside  it.  Note  that  in  this  case  the  time  is  a  maximum ! 

(d)  Are  longer  on  one  side  and  shorter  on  the  other  if  CD  crosses  the  ellipse  at  P 
but  is  tangent  to  it  (that  is,  CD  has  a  point  of  inflection  at  P). 


2.  THE  EULER  EQUATION 

Before  we  do  the  general  problem,  let  us  first  do  the  problem  of  a  geodesic  on  a 
plane;  we  shall  show  that  a  straight  line  gives  the  shortest  distance  between  two 
points.  (The  reason  for  doing  this  is  to  clarify  the  theory;  you  will  not  do  problems 
this  way.)  Our  problem  is  to  find  y  =  y{x)  which  will  make 

/  =  /  \Jl  +  y’2  dx 
J  X\ 

as  small  as  possible.  The  y(x)  which  does  this  is  called  an  extremal.  Now  we  want 
some  way  to  represent  algebraically  all  the  curves  passing  through  the  given  end¬ 
points,  but  differing  from  the  (as  yet  unknown)  extremal  by  small  amounts.  (We 
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assume  that  all  the  curves  have  continuous 
second  derivatives  so  that  we  can  carry  out 
needed  differentiations  later.)  These  curves 
are  called  varied  curves;  there  are  infinitely 
many  of  them  as  close  as  we  like  to  the  ex¬ 
tremal.  We  construct  a  function  representing 
these  varied  curves  in  the  following  way  (Fig¬ 
ure  2.1).  Let  rj(x)  represent  a  function  of  x 
which  is  zero  at  X\  and  X2 ,  and  has  a  contin¬ 
uous  second  derivative  in  the  interval  X\  to 
#2,  but  is  otherwise  completely  arbitrary.  We 
define  the  function  Y  ( x )  by  the  equation 

(2.1)  Y(x)  =  y(x)  +  er](x), 


where  y(x)  is  the  desired  extremal  and  e  is  a 

parameter.  Because  of  the  arbitrariness  of  y(x),  Y  ( x )  represents  any  (single- valued) 
curve  (with  continuous  second  derivative)  you  want  to  draw  through  (x\,yi)  and 
( X2,yi )•  Out  of  all  these  curves  Y(x)  we  want  to  pick  the  one  curve  that  makes 


rx2  - - 

(2.2)  1=  VI  +  Y'2dx 

J  Xi 

a  minimum.  Now  I  is  a  function  of  the  parameter  e;  when  e  =  0,  Y  =  y(x),  the 
desired  extremal.  Our  problem  then  is  to  make  1(e)  take  its  minimum  value  when 
e  =  0.  In  other  words,  we  want 

(2.3)  —r  =  Q  when  e  =  0. 

de 

Differentiating  (2.2)  under  the  integral  sign  with  respect  to  the  parameter  e,  we  get 


(2.4) 


dl  _ 

rX2  i  i 

/  C)V< 

de 

Jxi  2\A  +  r'2 

1  de  ) 

Differentiating  (2.1)  with  respect  to  x,  we  get 


(2.5) 


Y'(x)  =  y'(x)  +  erj'(x). 


Then  from  (2.5)  we  have 

dY' 

(2.6)  —  =  y'(x). 

We  see  from  (2.1)  that  putting  e  =  0  means  putting  Y (x)  =  y(x).  Then  substituting 
(2.6)  into  (2.4)  and  putting  di/de  equal  to  zero  when  e  =  0,  we  get 


(2.7) 


\Zi  +  y' 


We  can  integrate  this  by  parts  (since  we  assumed  that  rj  and  y  have  continuous 
second  derivatives).  Let 


ii  =  y' /  \/l  +  y'2 ,  dv  =  rf(x)dx. 
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dx,  v  =  rj(x), 


</  ,  ,X2  fX2  M  d  (  y' 

=='n(x)  -  /  t)(x)— 

+  y’2  X1  Jx i  dx  V  n/T+T1 


The  first  term  is  zero  because  ?y(;r)  =  0  at  the  endpoints.  In  the  second  term,  recall 
that  rj(x)  is  an  arbitrary  function.  This  means  that 


for  otherwise  we  could  select  some  function  y(x)  so  that  the  integral  would  not  be 
zero.  Notice  carefully  here  that  we  are  not  saying  that  when  an  integral  is  zero, 
the  integrand  is  also  zero;  this  is  not  true  (as,  for  example  sin  x  dx  =  0  shows). 
What  we  are  saying  is  that  the  only  way  J ^  f(x)rj(x)  dx  can  always  be  zero  for 
every  y(x)  is  for  /( x)  to  be  zero.  You  can  prove  this  by  contradiction  in  the  following 
way.  If  f(x)  is  not  zero,  then,  since  y(x)  is  arbitrary,  choose  y  to  be  positive  where 
/  is  positive  and  negative  where  /  is  negative.  Then  frj  is  positive,  so  its  integral 
is  not  zero,  in  contradiction  to  the  statement  that  J  fr]  dx  =  0  for  every 
Integrating  (2.8)  with  respect  to  x,  we  get 

y ’ 

— ,  =  const. 


or  y'  =  const.  Thus  the  slope  of  y(x)  is  constant,  so  y(x)  is  a  straight  line  as  we 
expected. 

Now  we  could  go  through  this  process  with  every  calculus  of  variations  problem. 
It  is  much  simpler  to  do  the  general  problem  once  for  all  and  find  a  differential 
equation  which  we  can  use  to  solve  later  problems.  The  problem  is  to  find  the  y 
which  will  make  stationary  the  integral 


F{x,  y,  y')  dx, 


where  F  is  a  given  function.  The  y{x)  which  makes  I  stationary  is  called  an  extremal 
whether  /  is  a  maximum  or  minimum  or  neither.  The  method  is  the  one  we  have 
just  used  with  the  straight  line.  We  consider  a  set  of  varied  curves 


Y  (x)  =  y(x)  +  £Tf(x) 


just  as  before.  Then  we  have 


rx  2 

1(e)  =  /  F(x,  Y,  Y')dx, 

J  X\ 


and  we  want  (d/de)I(e)  =  0  when  e  =  0.  Remembering  that  Y  and  Y'  are  functions 
of  e,  and  differentiating  under  the  integral  sign  with  respect  to  e,  we  get 

.  dl  n  fdFdY  dF  dY'\  , 


de  JXl  \d Y  de  dY'  de 
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Substituting  (2.1)  and  (2.5)  into  (2.11),  we  have 

(2.12) 


dJ 

de 


X2  r&F  .  ,  DF  ,,  . ' 
V{x)  +  1^7jV  ix) 


dx. 


dY  7  BY' 

We  want  dl/de  =  0  at  e  =  0;  recall  that  e  =  0  means  Y  =  y.  Then  (2.12)  gives 


(2.13) 


e=0 


dF  dF, 


dx  =  0. 


Assuming  that  y "  is  continuous,  we  can  integrate  the  second  term  by  parts  just  as 
in  the  straight-line  problem: 

fX2  dF  ,  dF  ,  X2  fX2  d  fdF\  .  ,  , 

<2i4)  L  wiix)dx=iv vix)  „  -  L  ^  [w) vix) 

The  integrated  term  is  zero  as  before  because  y{x)  is  zero  at  x\  and  X2 ■  Then  we 
have 


(dl\  = 

rx2 

'dF 

d  dF' 

o 

II 

vu 

J  X\ 

dy 

dx  dy' 

(2.15) 

As  before,  since  rj{x)  is  arbitrary,  we  must  have 


rj{x)  dx  =  0. 


d  dF 

dF 

(2.16) 

dx  d  y1 

7W  =  0- 
dy 

Euler  equation 

This  is  the  Euler  (or  Euler-Lagrange)  equation. 

Any  problem  in  the  calculus  of  variations,  then,  is  solved  by  setting  up  the 
integral  which  is  to  be  stationary,  writing  what  the  function  F  is,  substituting  it 
into  the  Euler  equation,  and  solving  the  resulting  differential  equation. 

Example.  Let’s  find  the  geodesics  in  a  plane  again,  this  time  using  the  Euler  equation  as 
you  will  do  in  problems. 

We  are  to  minimize 

ryirw, 

J  X\ 

so  we  have  F  =  \/l  +  y'2.  Then 

dF  _  y' 

dv'  \/l  +  y'2  ’  dy 
and  the  Euler  equation  gives 

d  (  y' 

dx  \  y/l  +  y'2 


as  we  had  in  (2.8). 
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PROBLEMS,  SECTION  2 


Write  and  solve  the  Euler  equations  to  make  the  following  integrals  stationary.  In  solving 
the  Euler  equations,  the  integrals  in  Chapter  5,  Section  1,  may  be  useful. 


rx9. 

l.  /  v^vi  +  y/2  dx 

J  X\ 

2.  r* 

jxi  x 

rx?. 

3.  /  x\J\  —  y'2  dx 

J  X\ 

rx  2 

4.  /  xds 

J  X\ 

rx  2 

5-  /  (y'2+y2)dx 

J  X\ 

rx  2 

6-  /  ( y,2  +  y/y)dx 

J  X\ 

7.  fX2  ex  \J  1  +  y'2  dx  Hint:  In  the  last  integration,  let  u  =  ex 
and  see  Chapter  5,  Problem  1.6. 


f*  2  _ 

8.  /  x\/ y'2  +  x 2  dx 

J  X\ 


9. 


+  yy'f  dx 


10. 


x2  dx 
xy'  +  1 


3.  USING  THE  EULER  EQUATION 

Other  Variables  We  have  used  x  and  y  as  our  variables.  But  the  mathematics 
is  just  the  same  if  we  use  some  other  letters,  for  example  polar  coordinates  r  and 
9.  To  minimize  (make  stationary)  the  integral 

J  F(r,9,9')  dr  where  9'  =  d9/dr, 

we  solve  the  Euler  equation 


(3.1) 

d  fdF\  dF 

d//  [WJ  ~~d9-' 

To  minimize  f  F(t,  x, 

x)  dt  where  x  =  dx/dt ,  we  solve 

(3.2) 

d  dF  dF 

dt  dx  dx 

Notice  that  the  first  derivative  in  the  Euler  equation  [ d/dx  in  (2.16),  d/dr  in  (3.1), 
d/dt  in  (3.2)]  is  with  respect  to  the  integration  variable  in  the  integral.  The  partial 
derivatives  are  with  respect  to  the  other  variable  and  its  derivative  [y  and  y1  in 
(2.16),  9  and  9'  in  (3.1),  x  and  x  in  (3.2)]. 


Example  1.  Find  the  path  followed  by  a  light  ray  if  the  index  of  refraction  (  in  polar 
coordinates)  is  proportional  to  r~2.  We  want  to  make  stationary 


/ 


n  ds 


or 


r  2  V dr 2  +  r2  d92 


r  2  \/ 1  +  r29'2  dr. 
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The  Euler  equation  is  then  (3.1)  with  F 
have 


d  (  r-W  \ 

—  — - . =  =0  or 

dr  \  y/i +  r20'2J 


r  2\/l  +  r29'2 .  Since  dF/dO  =  0,  we 
O’ 

—  :  =  const.  =  K. 

VI  +  r20'2 


Solve  for  0'  and  integrate  (see  Chapter  5,  Problem  1.5): 


O'2  =  K2{l  +  r20'2)  so  0'\l  -  K2r2)  =  K2 , 
„,_d0__  K 

dr  Vl  ~  K2r 2  ’ 

0  =  arc  sin  Kr  +  const. 


First  Integrals  of  the  Euler  Equation  In  some  problems  the  integrand  F  in  I 
[see  equation  (1.1)]  does  not  contain  y  (that  is,  F  does  not  contain  the  dependent 
variable).  Then  dF/dy  =  0  and  the  Euler  equation  becomes 


d_dF_ 
dx  dy' 


=  0, 


dF 

——  =  const. 

dy 


This  happened  in  the  example  and  most  of  your  problems  in  Section  2.  Because 
dF/dy  was  zero,  we  were  able  to  integrate  the  Euler  equation  once;  the  equation 
dF/dy'  =  const,  is  for  this  reason  called  a  first  integral  of  the  Euler  equation. 

There  is  another  less  obvious  case  in  which  we  can  easily  find  a  first  integral 
of  the  Euler  equation.  Let  us  show  this  by  an  example  (the  soap  film  problem 
mentioned  in  Section  1). 


Example 2.  Our  problem  is  this:  Given  two  points  Pi  and  P2  (not  too  far  apart),  we  are 
going  to  draw  a  curve  joining  Pi  and  P2  and  revolve  it  about  the  x  axis  to  form  a 
surface  of  revolution.  We  want  the  equation  of  the  curve  so  that  the  surface  area 
will  be  a  minimum.  That  is,  we  want  to  minimize  I  =  f  2n yds.  We  usually  write 

ds  =  \f\  +  y'2  dx.  Instead,  let  us  write  ds  =  \J\  +  x'2  dy ,  where  x'  =  dx/dy.  Then 
I  =  J  2iry\/l  +  x'2dy.  Recall  from  (3.1)  and  (3.2)  and  the  discussion  following 
them  how  to  write  the  Euler  equation  in  various  sets  of  variables.  Here  y  is  the 
variable  of  integration,  F  =  yy/ 1  +  x'2,  and  the  Euler  equation  is 


(3.3) 


d  dF  dF 
dy  dx'  dx 


Since  dF/dx  =  0,  (3.3)  becomes 


yx' 

Vl  +  x'2 


=  0. 


d_ 

dy 
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This  is  the  simplified  equation  we  wanted.  We  integrate  once,  solve  for  x'  and 
integrate  again  (see  Chapter  5,  Problem  1.3): 


/ 

x 

X 

y 

The  graph  of  this  equation  is  called  a  catenary ;  it  is  shown  in  Figure  3.1  for  the 
special  case  Ci  =  1,  C2  =  0,  y  =  cosha:  =  \(ex  +  e~x).  The  catenary  does  not 
always  give  the  solution  to  the  soap  film  problem.  If  the  given  points  (or  the  hoops 
in  Figure  1.1)  are  too  far  apart,  the  soap  film  may  break  into  two  parts  (circular 
films  on  the  hoops).  For  further  discussion  see  Courant  and  Robbins,  Chapter  7, 
Section  11,  and  Arfken  and  Weber,  Chapter  17.  For  another  problem  involving  a 
catenary,  see  Problem  6.4. 

Observe  that  the  method  used  in  this  example  will  simplify  any  problem  in 
which  I  —  f  F(y ,  y ')  dx  does  not  have  the  independent  variable  x  in  the  integrand. 
We  change  to  y  as  the  integration  variable  making  the  substitutions 


in  I.  Then  the  integrand  is  a  function  of  y  and  x' ,  so  the  Euler  equation  [now  (3.3)] 
simplifies  since  dF/dx  =  0.  (See  also  Problem  8.1.) 


Example  3.  Find  a  first  integral  of  the  Euler  equation  to  make  stationary  the  integral 


Since  x  is  missing  in  the  integrand,  we  change  to  y  as  the  integration  variable;  then 
by  (3.4) 


I  = 


F(y,  x')  dy. 


We  see  that  dF/dx  =  0;  from  (3.3)  the  Euler  equation  is 


d_  ( dF\ 
dy  \dx' ) 
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The  first  integral  of  the  Euler  equation  is,  then, 

(3.6)  - =  const. 

\fy  \j x'2  +  1 

Example  4.  Find  the  geodesics  on  the  cone  z2  =  8 (x2+y2).  Using  cylindrical  coordinates, 
we  have  z2  =  8 r2,  z  =  T\J 8,  dz  =  dry/8,  so 

ds 2  =  dr2  +  r2d02  +  dz 2  =  dr 2  +  r2d02  +  8  dr2  =  9  dr2  +  r2d02. 


We  want  to  minimize 


I  =  J  ds  =  J  \J 9  dr2  +  r2d02  =  j  Vd  +  r29'2  dr. 

Note  that  we  use  r  as  the  integration  variable  since  the  integrand  contains  r  but 
not  9.  Then  dF/d6  =  0,  and  we  can  immediately  write  a  first  integral  of  the  Euler 
equation: 


-f-)=o 

dr  \d6'  I 


d  F 


r26‘ 


=  const.  =  K. 


89'  ^9  +  r2  e'2 

We  solve  for  9'  and  integrate  again. 

r40'2  =  K2(9  +  r29'2), 

9l2{r 4  -  K2r2)  =  9 1<2, 

fdB=  (  3Kdr  . 

J  J  r\J r2  —  K 2 

From  computer  or  tables  (or  see  Chapter  5,  Problem  1.6): 

K 

9  +  a  =  3  arc  cos  —  (a  =  const,  of  integration) 
r 


cos 


K 


or  r  cos 


=  K. 


PROBLEMS,  SECTION  3 

Change  the  independent  variable  to  simplify  the  Euler  equation,  and  then  find  a  first 
integral  of  it. 


1. 


3. 


y 3,72  ds 


V1  +  yr 


J2 


Vx'2  +  : 


:dy 


rx< 

L 

h  HyVi 

J  x\ 


dx 


,2  +  y2  dx 


Write  and  solve  the  Euler  equations  to  make  the  following  integrals  stationary.  Change 
the  independent  variable,  if  needed,  to  make  the  Euler  equation  simpler. 


5. 


\f  1  +  y2y'2  dx 


px 

5-  y 

</Xl 


yy 


l  +  yy' 


■  dx 
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rx 2  p02  / _ 

7.  /  (?/  +y2)d*  8.  /  \/ r'2  +  r2  r'=dr/dd 

Jx  1 

9.  f  \J 6'2  +  sin2  9  d<j>,  6'  =  dd/d<j>  10.  f  s^1  \A2  +  s'2  dt,  s'  =  ds/dt 

J  (f) ^  J 

Use  Fermat’s  principle  to  find  the  path  followed  by  a  light  ray  if  the  index  of  refraction  is 
proportional  to  the  given  function. 

11.  x  + 1  12.  y  1  13.  y/y  14.  r~ 1 

15.  Find  the  geodesics  on  a  plane  using  polar  coordinates. 

16.  Show  that  the  geodesics  on  a  circular  cylinder  (with  elements  parallel  to  the  2  axis) 
are  helices  az  +  bd  =  c,  where  a,  b,  c  are  constants  depending  on  the  given  endpoints. 
(Hint:  Use  cylindrical  coordinates.)  Note  that  the  equation  az  +  bd  =  c  includes  the 
circles  2  =  const,  (for  b  =  0),  straight  lines  9  =  const,  (for  a  =  0),  and  the  special 
helices  az  +  b9  =  0. 

17.  Find  the  geodesics  on  the  cone  x2  +  y2  =  z2.  Hint:  Use  cylindrical  coordinates. 

18.  Find  the  geodesics  on  a  sphere.  Hints:  Use  spherical  coordinates  with  constant 
r  =  a.  Choose  your  integration  variable  so  that  you  can  write  a  first  integral  of  the 
Euler  equation.  For  the  second  integration,  make  the  change  of  variable  w  =  cot  8. 
To  recognize  your  result  as  a  great  circle,  find,  in  terms  of  spherical  coordinates  0 
and  cf>,  the  equation  of  intersection  of  the  sphere  with  a  plane  through  the  origin. 


4.  THE  BRACHISTOCHRONE  PROBLEM;  CYCLOIDS 


We  have  already  mentioned  this  problem  in  Section  1.  We  are 
given  the  points  (x\,yi)  and  (*2,2/2);  we  choose  axes  through 
the  point  1  with  the  y  axis  positive  downward  as  shown  in  Figure 
4.1.  Our  problem  is  to  find  the  curve  joining  the  two  points, 
down  which  a  bead  will  slide  (from  rest)  in  the  least  time;  that 
is,  we  want  to  minimize  J  dt.  Let  v  =  0  initially,  and  let  y  =  0 
be  our  reference  level  for  potential  energy.  Then  at  the  point 
(x,  y)  we  have 


Figure  4.1 


.  X  o  L 

kmetic  energy  =  —mv  =  -m 

potential  energy  =  —mgy. 

The  sum  of  the  two  energies  is  zero  initially  and  therefore  zero  at  any  time  since 
the  total  energy  is  constant  when  there  is  no  friction.  Hence  we  have 

-mv2  —  mgy  =  0  or  v  =  \/2gy. 

Then  the  integral  which  we  want  to  minimize  is 


ds 


ds  _  J_  fX2  y/1  +  y'2 

\fWd  V5g  J Xx  Vv 


dt  = 


v 
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This  is  the  integral  (3.5)  in  Example  3,  Section  3.  Then  the  first  integral  of  the 
Euler  equation  is  given  by  (3.6): 


Solving  for  x',  we  get 


\fy\Jx'2  +  1 


=  v/H- 


(4.1) 


dx  I  cy 
dy  V  1  ~cy' 


This  simplifies  if  we  let  cy  =  sin2  |  =  ^(1  —  cos  9).  We  find  (Problem  1) 


(4.2) 


dx 


x 


—  sin2  -  dO  =  —  (1  —  cos  9)  d9 , 
c  2  2cy  ’ 

- 1~(9  —  sin0)  +  d . 


The  equations  for  x  and  y  as  functions  of  9  are  parametric  equations  of  the  curve 
along  which  the  particle  slides  in  minimum  time.  Since  we  have  chosen  axes  to 
make  the  curve  pass  through  the  origin,  x  =  y  =  0  must  satisfy  the  equations  of 
the  curve,  so  c'  =  0,  and  we  have 


x  =  r—(9  —  sin0), 

(4.3)  7 

y=  ^(i-cos0). 

We  shall  now  show  that  these  are  the  parametric  equations  of  a  cycloid.  Imagine 
a  circle  of  radius  a  (say  a  wheel)  in  the  (x,  y)  plane  rolling  along  the  x  axis.  Let  it 
start  tangent  to  the  x  axis  at  the  origin  O  in  Figure  4.2.  Place  a  mark  on  the  circle 
at  O.  As  the  circle  rolls,  the  mark  traces  out  a  cycloid  as  shown  in  Figure  4.3.  Let 
point  P  in  Figure  4.2  be  the  position  of  the  mark  when  the  circle  is  tangent  to  the  x 
axis  at  A ;  let  (a:,  y)  be  the  coordinates  of  P.  Since  the  circle  rolled,  OA  =  PA  =  a9 
with  9  in  radians.  Then  from  Figure  4.2  we  have 


x  =  OA  —  PB  =  a9  —  a  sin  9  =  a{9  —  sin  9) , 
y  =  AB  =  AC  —  BC  =  a  —  a  cos  9  —  a(  1  —  cos  9) . 


Figure  4.2 
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Equations  (4.4)  are  the  parametric  equations  of  a  cycloid.  Comparing  (4.3),  we 
see  that  the  brachistochrone  is  a  cycloid  as  we  claimed.  Note  that,  since  we  have 
taken  the  y  axis  positive  down  (Figures  4.1  and  4.4),  the  circle  which  generates  the 
brachistochrone  rolls  along  the  under  side  of  the  x  axis. 

From  either  (4.3)  or  (4.4),  we  see  that 
all  cycloids  are  similar;  that  is  they  differ 
from  each  other  only  in  size  (determined 
by  a  or  c)  and  not  in  shape.  Figure  4.4  is 
a  sketch  of  a  cycloid  for  arbitrary  a.  If  the 
given  endpoints  for  the  wire  along  which 
the  bead  slides  are  O  and  P3,  we  see  that 
the  particle  slides  down  to  P2  and  back  up  Figure  4  4 

to  P3  in  minimum  time!  At  point  P2  the 

circle  has  rolled  halfway  around  so  OA  =  ^  •  2ira  =  ira.  For  any  point  Pl  on  arc 
OP2,  Pi  is  below  the  line  OP2,  and  the  coordinates  (*1,2/1)  of  Pi  have 


2/i  P2A  2  a  2 
x\  AO  ira  7 r 


or  Xi/yi  <  7t/2.  For  points  like  P3  on  P2P,  *3/2/3  >  7t/2,  whereas  at  P2,  we  have 
*2/2/2  =  7r/ 2  (Problem  2).  Then  if  the  right-hand  endpoint  is  (*,  y)  and  the  origin  is 
the  left-hand  endpoint,  we  can  say  that  the  bead  just  slides  down,  or  slides  down  and 
back  up,  depending  on  whether  x/y  is  less  than  or  greater  than  7r/2  (Problem  2). 


PROBLEMS,  SECTION  4 


1.  Verify  equations  (4.2). 

2.  Show,  in  Figure  4.4,  that  for  a  point  like  P3,  *3/2/3  >  7i/2  and  for  P2,  *2/2/2  =  tt/2. 

3.  In  the  brachistochrone  problem,  show  that  if  the  particle  is  given  an  initial  velocity 
Vo  =£■  0,  the  path  of  minimum  time  is  still  a  cycloid. 

4.  Consider  a  rapid  transit  system  consisting  of  frictionless  tun¬ 
nels  bored  through  the  earth  between  points  A  and  B  on  the 
earth’s  surface  (see  figure).  The  unpowered  passenger  trains 
would  move  under  gravity.  Using  polar  coordinates,  set  up  f  dt 
to  be  minimized  to  find  the  path  through  the  earth  requiring 
the  least  time.  See  Chapter  6,  Problem  8.21,  for  the  potential 
inside  the  earth.  Find  a  first  integral  of  the  Euler  equation.  Evaluate  the  constant 
of  integration  using  dr/dd  =  0  when  r  =  tq  (where  r 0  is  the  deepest  point  of  the 
tunnel — see  figure).  Now  solve  for  9'  =  dd/dr  as  a  function  of  r.  Substitute  this 
into  the  integral  for  t  and  evaluate  the  integral  to  show  that  the  transit  time  is 


T  =  n 


R2  -  r02 
gR 


Hint:  Find  2  f  dt. 

J  r=r  q 


Evaluate  T  for  ro  =  0  (path  through  the  center  of  the  earth — see  Chapter  8,  Problem 
5.35);  for  ro  =  0.991?..  [For  more  detail,  see  Am.  J.  Phys.  34  701-704  (1966).] 


In  Problems  5  to  7,  use  Fermat’s  principle  to  find  the  path  followed  by  a  light  ray  if  the 
index  of  refraction  is  proportional  to  the  given  function. 

5.  *“1/2  6.  (2/  —  1)-1/2  7.  (2*  +  5)-1/2 
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5.  SEVERAL  DEPENDENT  VARIABLES;  LAGRANGE'S  EQUATIONS 


It  is  not  necessary  to  restrict  ourselves  to  problems  with  one  dependent  variable  y. 
Recall  that  in  ordinary  calculus  problems  the  necessary  condition  for  a  minimum 
point  on  2  =  z( x)  is  dz/dx  =  0;  for  a  function  of  two  variables  z  =  z{x ,  y),  we  have 
the  two  conditions  dz/dx  =  0  and  dz/dy  =  0.  We  have  a  somewhat  analogous 
situation  in  the  calculus  of  variations.  Suppose  that  we  are  given  an  F  which  is  a 
function  of  y ,  z,  dy/dx,  dz/dx,  and  x,  and  we  want  to  find  two  curves  y  =  y(x) 
and  2  =  z(x)  which  make  I  =  f  F  dx  stationary.  Then  the  value  of  the  integral  I 
depends  on  both  y(x)  and  z{x)  and  you  might  very  well  guess  that  in  this  case  we 
would  have  two  Euler  equations,  one  for  y  and  one  for  z,  namely 


(5.1) 


d_  / dF\ 
dx  \dy' ) 
d_  / dF\ 
dx  \dz' ) 


By  carrying  through  calculations  similar  to  those  we  used  in  deriving  the  single 
Euler  equation  for  the  one  dependent  variable  case  you  can  show  (Problem  1(a)) 
that  this  guess  is  correct.  If  there  are  still  more  dependent  variables  (but  one 
independent  variable),  then  we  write  an  Euler  equation  for  each  dependent  variable. 
It  is  also  possible  to  consider  a  problem  with  more  than  one  independent  variable 
(see  Problem  1(b))  or  with  F  depending  on  y"  as  well  as  x,  y,  y'  (see  Problem  1(c)). 

There  is  a  very  important  application  of  equations  like  (5.1)  to  mechanics.  In 
elementary  physics,  Newton’s  second  law  F  =  ma  is  a  fundamental  equation.  In 
more  advanced  mechanics,  it  is  often  useful  to  start  from  a  different  assumption 
(which  can  be  proved  equivalent  to  Newton’s  law;  see  mechanics  text  books.)  This 
assumption  is  called  Hamilton’s  principle.  It  says  that  any  particle  or  system  of 
particles  always  moves  in  such  a  way  that  /  =  Jt*2  L  dt  is  stationary,  where  L  =  T—V 
is  called  the  Lagrangian\  T  is  the  kinetic  energy,  and  V  is  the  potential  energy  of 
the  particle  or  system. 


Example  1.  Use  Hamilton’s  principle  to  find  the  equations  of  motion  of  a  single  particle 
of  mass  m  moving  (near  the  earth)  under  gravity. 

We  first  write  the  formulas  for  the  kinetic  energy  T  and  the  potential  energy  V 
of  the  particle.  (It  is  convenient  to  use  a  dot  to  mean  a  derivative  with  respect  to 
t  just  as  we  use  a  prime  to  indicate  a  derivative  with  respect  to  x;  thus  dx/dt  =  x, 
dy/dt  =  y,  dy/dx  =  y',  d2x/dt2  =  x,  etc.)  The  equations  for  T,  V,  and  L  =  T  —V, 
are: 

T  =  ^tou2  =  +  y2  +  z2), 

(5.2)  V  =  mgz, 

L  =  T  —  V  =  -m{x2  +  y2  +  z2)  —  mgz. 

Here  t  is  the  independent  variable;  x,  y,  and  3  are  the  dependent  variables,  and 
L  corresponds  to  what  we  have  called  F  previously.  Then  to  make  I  =  J)*2  L  dt 
stationary,  we  write  the  corresponding  Euler  equations.  There  are  three  Euler 
equations,  one  for  x,  one  for  y,  and  one  for  z.  The  Euler  equations  are  called 
Lagrange’s  equations  in  mechanics  [see  (5.3)  next  page]. 
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(5.3) 


d  dL  dL 

dt  dx  dx 

d  dL  dL  n 

—  — - —  =  0,  Lagrange  s  equations 

dt  ay  ay 

±dL_dL 

dt  dz  dz 


Substituting  L  in  (5.2)  into  Lagrange’s  equations  (5.3),  we  get 


(5.4) 


—  ( mx )  =  0 

■  £(mp)  =  0 

—  ( mz )  +  mq  =  0 
dV  ’ 


{x  —  const., 
y  =  const., 
z  =  ~9- 


These  are  just  the  familiar  equations  obtained  from  Newton’s  law;  they  say  that 
in  the  gravitational  field  near  the  surface  of  the  earth,  the  horizontal  velocity  is 
constant  and  the  vertical  acceleration  is  —g.  In  this  problem  you  may  say  that  it 
would  have  been  simpler  just  to  write  the  equations  from  Newton’s  law  in  the  first 
place!  This  is  true  in  simple  cases,  but  in  more  complicated  problems  it  may  be 
much  simpler  to  find  one  scalar  function  (that  is,  L)  than  to  find  six  functions  (that 
is,  the  components  of  the  two  vectors,  force  and  acceleration).  For  example,  the 
acceleration  components  in  spherical  coordinates  are  quite  complicated  to  derive  by 
elementary  methods  (see  mechanics  text  books),  but  you  should  have  no  trouble 
deriving  the  equations  of  motion  in  polar,  cylindrical  or  spherical  coordinates  using 
the  Lagrangian.  Let’s  do  some  examples. 


Example  2.  Use  Lagrange’s  equations  to  find  the  equations  of  motion  of  a  particle  in 
terms  of  the  polar  coordinate  variables  r  and  6. 

The  element  of  arc  length  in  polar  coordinates  is  ds  where 

(5.5)  ds2  =  dr2  +  r2  d02 . 


The  velocity  of  a  moving  particle  is  ds/dt ;  from  (5.5)  we  get 
(5.6) 


u2  = 


=r2+r2e2. 


dt 


The  kinetic  energy  is  |m»2,  so  we  have 

T  =  —m(r2  +  r2d2), 

L  =  T  —  V  =  ^m(r2  +  r2d 2)  —  V(r,&), 


(5.7) 
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where  V (r,  0)  is  the  potential  energy  of  the  particle.  Lagrange’s  equations  in  the 
variables  r,  9  are: 


d  dL  dL 

/c  or  dt  dr  dr 

^  ±_d±_d± 

dt  d9  d6 

Substituting  L  from  (5.7) 

(5.9) 

The  r  equation  of  motion 

(5.10) 

The  0  equation  is 

or,  dividing  by  r, 

(5.11) 

Now  the  quantities  —dV/dr  and  — (1  /r)(dV/dO)  are  the  components  of  the  force 
(F  =  -W)  on  the  particle  in  the  r  and  0  directions.  (See  Chapter  6.)  Then 
equations  (5.10)  and  (5.11)  are  just  the  components  of  ma  =  F;  the  acceleration 
components  are  then 

ar  —  r  —  rO2, 
ag  =  rO  +  2  ft). 

The  second  term  in  ar  is  a  familiar  one;  it  is  just  the  centripetal  acceleration  v2/r 
when  v  =  rO  (the  minus  sign  indicates  that  it  is  toward  the  origin).  The  second 
term  in  ag  is  called  the  Coriolis  acceleration. 

We  show  by  an  example  another  important  point  about  Lagrange’s  equations. 


into  (5.8),  we  get 

d  .  -2  dV 

—  vmr)  —  mrO  +  — —  =  0, 
dt  dr 

d  .  2  • ,  dV 

-(mr  0)  +  W  =  0. 

is,  then, 

A2x  9V 
m(r  —  r0~)  =  — — . 

dr 


dV 

m(r20  +  2  rfO)  =  — , 
dO 


ldV 

mirO  +  2  rO)  = - — . 

r  dO 


Example  3.  A  mass  mi  moves  without  friction  on  the  surface 
of  the  cone  shown  (Figure  5.1).  Mass  m2  is  joined  to 
mi  by  a  string  of  constant  length;  m2  can  move  only 
vertically  up  and  down.  Find  the  Lagrange  equations  of 
motion  of  the  system. 

Let’s  use  spherical  coordinates  p,  0 ,  <j)  for  mi,  and 
coordinate  2  for  m-2-  Then  for  mi,  v 2  =  ( ds/dt )2  = 
p 2  +  p202  +  p2  sin2  0  (j>2  [Chapter  5,  equation  (4.20)],  and 
for  m2,  v 2  =  z2.  The  potential  energy  mgh  of  mi  is 
migpcosO  and  of  m2  is  m.232.  Note  that  we  have  used 


Figure  5.1 
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four  variables:  p,  9,  </>,  z;  however,  there  are  not  four  Lagrange  equations  We  must 
use  the  equation  of  the  cone  (9  =  30°)  and  the  equation  p  +  \z\  —l  (string  of 
constant  length)  to  eliminate  9  and  either  p  or  z.  The  Lagrangian  L  must  always 
be  written  using  the  smallest  possible  number  of  variables  (we  say  that  we  eliminate 
the  constraint  equations).  Then,  with  9  =  30°,  sin#  =  |,  cos 9  =  ^V3,  9  =  0,  and 
z  =  —  |  z  |  =  —(l  —  p),  we  find  L  in  terms  of  p  and  <f>: 

L  =  ^toi(p2 3 4  +  p2<j>2/ 4)  +  ^m2p2  -  ^ m1gpVs  +  m2g{l  -  p). 

Thus  the  Lagrange  equations  are 

■^(mip  +  m2p)  -  mipcj)2 /A  +  ^migVz  +  m2g  =  0, 

^(r?zip2</>/4)  =0  or  p2(j)  =  const. 

PROBLEMS,  SECTION  5 

Consider  the  case  of  two  dependent  variables.  Show  that  if  F  =  F(x,  y,  z,  y' ,  z') 
and  we  want  to  find  y(x)  and  z(x)  to  make  I  =  f*2  F  dx  stationary,  then  y  and 
z  should  each  satisfy  an  Euler  equation  as  in  (5.1).  Hint:  Construct  a  formula 
for  a  varied  path  Y  for  y  as  in  Section  2  [Y  =  y  +  er/(x)  with  r/(x)  arbitrary] 
and  construct  a  similar  formula  for  z  [let  Z  =  z  +  e£(x),  where  £(x)  is  another 
arbitrary  function].  Carry  through  the  details  of  differentiating  with  respect 
to  e,  putting  e  =  0,  and  integrating  by  parts  as  in  Section  2;  then  use  the  fact 
that  both  g{x)  and  C,{x)  are  arbitrary  to  get  (5.1). 

Consider  the  case  of  two  independent  variables.  You  want  to  find  the  function 
u(x,y )  which  makes  stationary  the  double  integral 

rV2  r&2 

/  /  F(u,x,y,ux,uy)dxdy. 

Jy  i  Jxi 

Hint:  Let  the  varied  U(x,y)  =  u(x,y)  +  ey(x,y)  where  g(x,y)  =  0  at  x  =  xi, 
x  =  X2,  y  =  yi,  y  =  y2,  but  is  otherwise  arbitrary.  As  in  Section  2,  differentiate 
with  respect  to  e,  set  t  =  0,  integrate  by  parts,  and  use  the  fact  that  g  is 
arbitrary.  Show  that  the  Euler  equation  is  then 

d  dF  d  dF  dF 
dx  dux  dy  duy  du 

(c)  Consider  the  case  in  which  F  depends  on  x,  y,  y\  and  y" .  Assuming  zero 
values  of  the  variation  g(x)  and  its  derivative  at  the  endpoints  xi  and  x2,  show 
that  then  the  Euler  equation  becomes 

dF 

dx2  dy"  dx  dy'  +  dy 

2.  Set  up  Lagrange’s  equations  in  cylindrical  coordinates  for  a  particle  of  mass  m  in  a 
potential  field  V(r,  9,z).  Hint:  v  =  ds/dt;  write  ds  in  cylindrical  coordinates. 

3.  Do  Problem  2  in  spherical  coordinates. 

4.  Use  Lagrange’s  equations  to  find  the  equation  of  motion  of  a  simple  pendulum.  (See 
Chapter  7,  Problem  2.13.) 


1.  (a) 


(b) 
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5.  Find  the  equation  of  motion  of  a  particle  moving  along  the  x  axis  if  the  potential 
energy  is  V  =  ^kx2 .  (This  is  a  simple  harmonic  oscillator.) 

6.  A  particle  moves  on  the  surface  of  a  sphere  of  radius  a  under  the  action  of  the  earth’s 
gravitational  field.  Find  the  9,  (j>  equations  of  motion.  ( Comment :  This  is  called  a 
spherical  pendulum.  It  is  like  a  simple  pendulum  suspended  from  the  center  of  the 
sphere,  except  that  the  motion  is  not  restricted  to  a  plane.) 

7.  Prove  that  a  particle  constrained  to  stay  on  a  surface  f(x,  y,  z)  =  0,  but  subject 
to  no  other  forces,  moves  along  a  geodesic  of  the  surface.  Hint:  The  potential 
energy  V  is  constant,  since  constraint  forces  are  normal  to  the  surface  and  so  do 
no  work  on  the  particle.  Use  Hamilton’s  principle  and  show  that  the  problem  of 
finding  a  geodesic  and  the  problem  of  finding  the  path  of  the  particle  are  identical 
mathematics  problems. 

8.  Two  particles  each  of  mass  m  are  connected  by  an  (in- 
extensible)  string  of  length  l.  One  particle  moves  on  a 
horizontal  table  (assume  no  friction),  The  string  passes 
through  a  hole  in  the  table  and  the  particle  at  the  lower 
end  moves  up  and  down  along  a  vertical  line.  Find  the 
Lagrange  equations  of  motion  of  the  particles.  Hint:  Let 
the  coordinates  of  the  particle  on  the  table  be  r  and  6,  and  let  the  coordinate  of  the 
other  particle  be  z.  Eliminate  one  variable  from  L  (using  r  +  \z\  =  l)  and  write  two 
Lagrange  equations. 

9.  A  mass  m  moves  without  friction  on  the  surface  of  the  cone  r  =  z  under  gravity 
acting  in  the  negative  z  direction.  Here  r  is  the  cylindrical  coordinate  r  =  \/x2  +  y2. 
Find  the  Lagrangian  and  Lagrange’s  equations  in  terms  of  r  and  9  (that  is,  elimi¬ 
nate  z). 


10. 


11. 


Do  Example  3  above,  using  cylindrical  coordinates  for  mi.  Hint:  Use  zi  and  Z2  for 
the  z  coordinates  of  mi  and  m2.  What  is  the  equation  of  the  cone  in  terms  of  r  and 
zi?  Note  that  r  7^  p,  and  9  in  cylindrical  coordinates  is  not  the  same  as  in  spherical 
coordinates  (see  Chapter  5,  Figures  4.4  and  4.5). 


A  yo-yo  (as  shown)  falls  under  gravity.  Assume  that  it  falls 
straight  down,  unwinding  as  it  goes.  Find  the  Lagrange  equation 
of  motion.  Hints:  The  kinetic  energy  is  the  sum  of  the  trans¬ 
lational  energy  imi2  and  the  rotational  energy  |/02  where  I  is 
the  moment  of  inertia.  What  is  the  relation  between  z  and  91 
Assume  the  yo-yo  is  a  solid  cylinder  with  inner  radius  a  and  outer 


radius  b. 


12.  Find  the  Lagrangian  and  Lagrange’s  equations  for  a  simple  pendulum  (Problem  4) 
if  the  cord  is  replaced  by  a  spring  with  spring  constant  k.  Hint:  If  the  unstretched 
spring  length  is  ro,  and  the  polar  coordinates  of  the  mass  m  are  (r,9),  the  potential 
energy  of  the  spring  is  ^k(r  —  ro)2. 

13.  A  particle  moves  without  friction  under  gravity  on  the  surface  of  the  paraboloid 
z  =  x2  +  y2-  Find  the  Lagrangian  and  the  Lagrange  equations  of  motion.  Show  that 
motion  in  a  horizontal  circle  is  possible  and  find  the  angular  velocity  of  this  motion. 
Use  cylindrical  coordinates. 
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14.  A  hoop  of  mass  M  and  radius  a  rolls  without  slipping  down  an  inclined  plane  of 
angle  a.  Find  the  Lagrangian  and  the  Lagrange  equation  of  motion.  Hint:  The 
kinetic  energy  of  a  body  which  is  both  translating  and  rotating  is  a  sum  of  two 
terms:  the  translational  kinetic  energy  ^ Mv 2  where  v  is  the  velocity  of  the  center 
of  mass,  and  the  rotational  kinetic  energy  \luj2  where  u>  is  the  angular  velocity  and 
I  is  the  moment  of  inertia  around  the  rotation  axis  through  the  center  of  mass. 

15.  Generalize  Problem  14  to  any  mass  M  of  circular  cross  section  and  moment  of  inertia 
I.  Consider  a  hoop,  a  disk,  a  spherical  shell,  a  solid  spherical  ball;  order  them  as  to 
which  would  first  reach  the  bottom  of  the  inclined  plane.  (For  moments  of  inertia, 
see  Chapter  5,  Section  4.) 

16.  Find  the  Lagrangian  and  the  Lagrange  equation  for 
the  pendulum  shown.  The  vertical  circle  is  fixed.  The 
string  winds  up  or  unwinds  as  the  mass  m  swings 
back  and  forth.  Assume  that  the  unwound  part  of 
the  string  at  any  time  is  in  a  straight  line  tangent  m 
to  the  circle.  Let  l  be  the  length  of  unwound  string 
when  the  pendulum  hangs  straight  down. 

17.  A  simple  pendulum  (Problem  4)  is  suspended  from  a  mass  M  which  is  free  to  move 
without  friction  along  the  x  axis.  The  pendulum  swings  in  the  xz  plane  and  gravity 
acts  in  the  negative  z  direction.  Find  the  Lagrangian  and  Lagrange’s  equations  for 
the  system. 

18.  A  hoop  of  mass  m  in  a  vertical  plane  rests  on  a  frictionless  table.  A  thread  is  wound 
many  times  around  the  circumference  of  the  hoop.  The  free  end  of  the  thread 
extends  from  the  bottom  of  the  hoop  along  the  table,  passes  over  a  pulley  (assumed 
weightless),  and  then  hangs  straight  down  with  a  mass  m  (equal  to  the  mass  of  the 
hoop)  attached  to  the  end  of  the  thread.  Let  x  be  the  length  of  thread  between  the 
bottom  of  the  hoop  and  the  pulley,  let  y  be  the  length  of  thread  between  the  pulley 
and  the  hanging  mass  m,  and  let  9  be  the  angle  of  rotation  of  the  hoop  about  its 
center  if  the  thread  unwinds.  What  is  the  relation  between  x,  y,  and  91  Find  the 
Lagrangian  and  Lagrange’s  equations  for  the  system.  If  the  system  starts  from  rest, 
how  does  the  hoop  move? 

For  the  following  problems,  use  the  Lagrangian  to  find  the  equations  of  motion  and  then 

refer  to  Chapter  3,  Section  12. 

19.  For  small  vibrations,  find  the  characteristic  frequencies  and  the  characteristic  modes 
of  vibration  of  the  coupled  pendulums  shown.  All  motion  takes  place  in  a  sin¬ 
gle  vertical  plane.  Assume  the  spring  unstretched 
when  both  pendulums  hang  vertically,  and  take 
the  spring  constant  as  k  =  mg /l  to  simplify  the 
algebra.  Hints:  Write  the  kinetic  and  potential 
energies  in  terms  of  the  rectangular  coordinates 
of  the  masses  relative  to  their  positions  hanging 
at  rest.  Don’t  forget  the  gravitational  potential 
energies.  Then  write  the  rectangular  coordinates 
x  and  y  in  terms  of  9  and  </>,  and  for  small  vibra¬ 
tions  approximate  sin  9  =  9 ,  cos  9  =  1  —  92 / 2,  and 
similar  equations  for  <j>. 
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20.  Do  Problem  19  if  the  spring  constant  is  k  =  3mg/l.  '///////I 

21.  Find  the  Lagrangian  and  Lagrange’s  equations  for 
the  double  pendulum  shown.  All  motion  takes 
place  in  a  single  vertical  plane.  Hint :  See  the  hint 
in  Problem  19. 

22.  Do  Problem  21  if  the  two  masses  are  different.  Let 
m  be  the  lower  mass  and  let  M  be  the  sum  of  the 
two  masses. 

23.  For  small  oscillations  of  the  double  pendulum  in  Problem  22,  let  M  =  4 m  and  find 
the  characteristic  frequencies  and  characteristic  modes  of  vibration. 

24.  Do  Problem  23  if  M/m  =  9/4 

25.  Do  Problem  23  in  general,  that  is,  in  terms  of  the  ratio  M/m.  Hint :  You  may  find 
it  helpful  to  use  a  single  letter  to  represent  \Jm/M,  say  a 2  =  m/M. 


6.  ISOPERIMETRIC  PROBLEMS 

Recall  that  in  ordinary  calculus  we  sometimes  want  to  maximize  a  quantity  subject 
to  a  condition  (for  example,  find  the  volume  of  the  largest  box  you  can  make  with 
given  surface  area).  Also  recall  that  the  method  of  Lagrange  multipliers  was  useful 
in  such  problems  (see  Chapter  4,  Section  9).  There  are  similar  problems  in  the 
calculus  of  variations.  The  original  question  which  gave  this  class  of  problems  its 
name  was  this:  Of  all  the  closed  plane  curves  of  given  perimeter  (isoperimetric  = 
same  perimeter),  which  one  incloses  the  largest  area?  To  solve  this  problem,  we 
must  maximize  the  area,  J  y  dx,  subject  to  the  condition  that  the  arc,  /  ds ,  is  the 
given  length  l.  In  other  words,  we  want  to  maximize  an  integral  subject  to  the 
condition  that  another  integral  has  a  given  (constant)  value;  any  such  problem  is 
called  an  isoperimetric  problem.  Let 

rx  2 

1=  F(x,  y,  y')  dx 

J  X\ 

be  the  integral  we  want  to  make  stationary;  at  the  same  time, 

rx  2 

J=  /  G(x,y,y')dx, 

J  Xi 


with  the  same  integration  variable  and  the  same  limits,  is  to  have  a  given  constant 
value.  (This  means  that  the  allowed  varied  paths  must  be  paths  for  which  J  has 
the  given  value.)  By  using  the  Lagrange  multiplier  method,  it  can  be  shown  that 
the  desired  condition  is  that 


(F  +  AG)  dx 


should  be  stationary,  that  is,  that  F  +  AG  should  satisfy  the  Euler  equation.  The 
Lagrange  multiplier  A  is  a  constant.  It  will  appear  in  the  solution  y(x)  of  the  Euler 
equation;  having  found  y(x),  we  can  substitute  it  into  f**G(x,y,y')dx  =  const, 
and  so  find  A  if  we  like.  However,  for  many  purposes  we  do  not  need  to  find  A. 
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Example  1.  Given  two  points  x\  and  X2  on  the  x  axis,  and  an  arc  length  l  >  X2  —  xi,  find 
the  shape  of  the  curve  of  length  l  joining  the  given  points  which,  with  the  x  axis, 
incloses  the  largest  area. 

We  want  to  maximize  I  =  fx ^  ydx  subject  to  the  condition  J  =  Jx  2  ds  =  l. 
Here  F  =  y  and  G  =  \J\  +  y'2  so 


(6.1)  F  +  XG  =  y  +  \\J  1  +  y’2- 

We  want  the  Euler  equation  for  F  +  AG.  Since 

A(F  +  AG)  =  _^_  and  |-(F  +  AG)  =  1, 
dy  y/1  +  y'2  dy 

the  Euler  equation  is 


(6.2) 


d_ 

dx 


Ay' 


.Vi- 


-1  =  0. 


The  solution  of  (6.2)  is  (Problem  7): 

(6.3)  (x  +  c)2  +  (y  +  c')2  =  A2 


We  see  that  the  answer  to  our  problem  is  an  arc  of  a  circle  passing  through  the  two 
given  points,  and  the  Lagrange  multiplier  A  is  the  radius  of  the  circle.  The  center 
and  radius  of  the  circle  are  determined  by  the  given  points  X\  and  x%,  and  the  given 
arc  length  l  (Problem  7). 


PROBLEMS,  SECTION  6 

In  Problems  1  and  2,  given  the  length  l  of  a  curve  joining  two  given  points,  find  the 
equation  of  the  curve  so  that: 

1.  The  surface  of  revolution  formed  by  rotating  the  curve  about  the  x  axis  has  minimum 
area. 

2.  The  plane  area  between  the  curve  and  a  straight  line  joining  the  points  is  a  maximum. 

3.  Given  10  cc  of  lead,  find  how  to  form  it  into  a  solid  of  revolution  of  height  1  cm  and 
minimum  moment  of  inertia  about  its  axis. 

4.  A  uniform  flexible  chain  of  given  length  is  suspended  at  given  points  (*1,3/1)  and 
(*2,3/2).  Find  the  curve  in  which  it  hangs.  Hint:  It  will  hang  so  that  its  center  of 
gravity  is  as  low  as  possible. 

5.  A  curve  y  =  y{x),  joining  two  points  x\  and  *2  on  the  x  axis,  is  revolved  around 
the  x  axis  to  produce  a  surface  and  a  volume  of  revolution.  Given  the  surface  area, 
find  the  shape  of  the  curve  y  =  y[x)  to  maximize  the  volume.  Hint:  You  should 
find  a  first  integral  of  the  Euler  equation  of  the  form  yf(y,  x' ,  A)  =  C.  Since  y  =  0 
at  the  endpoints,  C  =  0.  Then  either  y  =  0  for  all  x.  or  /  =  0.  But  y  =  0  gives  zero 
volume  of  the  solid  of  revolution,  so  for  maximum  volume  you  want  to  solve  /  =  0. 

6.  I11  Problem  5,  given  the  volume,  find  the  shape  of  the  curve  y  =  y(x)  to  minimize 
the  surface  area.  Hint:  See  the  hint  in  Problem  5. 

7.  Integrate  (6.2),  simplify  the  result  and  integrate  again  to  get  (6.3)  where  c  and  d 
are  constants  of  integration.  If  xi  =  —  \/3,  *2  =  \/3,  and  l  =  47t/3,  show  that  the 
center  and  radius  of  the  circle  are  (0,  —1)  and  A  =  radius  =  2. 
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7.  VARIATIONAL  NOTATION 


The  symbol  5  was  used  in  the  early  days  of  the  development  of  the  calculus  of  vari¬ 
ations  to  indicate  what  we  have  called  differentiation  with  respect  to  the  parameter 
e.  It  is  just  like  the  symbol  d  in  a  differential  except  that  it  warns  you  that  e  and 
not  x  is  the  differentiation  variable.  The  6  notation  is  not  used  much  any  more 
in  mathematics,  but  you  will  find  it  in  applications  and  so  should  understand  its 
meaning.  The  quantity  51  is  just  the  differential 

51  =  ^-de, 
de 

where  di/de  is  evaluated  for  e  =  0.  The  symbol  5  (read  “the  variation  of”)  is  also 
treated  as  a  differential  operator  acting  on  F,  y ,  and  y'\  we  shall  define  5y,  5y' ,  and 
5F  in  terms  of  our  previous  notation.  We  had  in  Section  2: 

Y(x,e)  =  y{x)  =  ey(x), 

Y'{x,e)  =  y'{x)  +  erj'(x). 


Then  the  meaning  of  5y  is 
(7.2) 


BY 


de  =  r](x)de; 


e=0 


this  is  just  like  a  differential  dY  if  e  is  the  variable.  The  meaning  of  5y'  is 

(BY1' 


(7.3) 


5y'  = 


\  de 


de  =  rf'(x)de. 


e=0 


This  is  identical  with 

(7.4)  ^(fy)  =  -^[y(x)de]  =  7)\x)de 

since  x  and  e  are  independent  variables;  in  other  words,  d  and  5  commute.  The 
meaning  of  5F  is 


(7.5) 


dF .  BF  s  , 

sf  =  -^-dv  +  ^-;sy ; 

dy  dy’ 


this  is  just  a  total  differential  dF  =  (dF/de)e=ode  of  the  function  F[x,  Y(x,  e),  Y'(x,  e)] 
at  e  =  0  with  e  considered  the  only  variable.  Then  the  variation  in  I  is 


(7.6) 


rx 2  px2 

51  =  5  Fdx=  5F  dx 


OF  .  8F  .  , 

^Sv+ajSy 


dx 


dF 

dy 


dF 


rj(x)de  +  —ri'(x)d> 


dx. 


If  you  compare  (7.6)  with  (2.13),  you  find  that  the  following  two  statements  about 
/  =  f  F(x,y,y')dx  mean  the  same  thing: 

(a)  I  is  stationary;  that  is,  dl/de  =  0  at  e  =  0  as  in  (2.13). 


(b)  The  variation  of  I  is  zero;  that  is,  51  =  0  as  in  (7.6). 
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8.  MISCELLANEOUS  PROBLEMS 

1.  (a)  In  Section  3,  we  showed  how  to  obtain  a  first  integral  of  the  Euler  equation 

when  F  =  F(y,y').  There  is  an  alternative  method  of  handling  this  case.  You 
can  show  that  if  F  =  F(y,y'),  then  F  —  y'dF/dy1  =  const.  To  prove  this, 
differentiate  the  left-hand  side  with  respect  to  x,  and  show  that  the  result  is 
zero  if  F  satisfies  the  Euler  equation.  Note  that  what  you  have  is  a  first  integral 
of  the  Euler  equation. 

(b)  Use  the  method  of  (a)  to  do  the  problems  at  the  end  of  Section  3. 

(c)  Consider  the  motion  of  a  particle  along  the  x  axis;  then  L  =  T  —  V  = 
\mx2  —  V(x).  Note  that  L  does  not  contain  the  independent  variable  t;  this 
corresponds  to  the  case  F  =  F(y,y')  in  (a).  Show  that  the  first  integral  found 
in  (a)  is  just  the  equation  of  conservation  of  energy  for  the  mechanics  problem. 


Find  a  first  integral  of  the  Euler  equation  to  make  stationary  the  integrals  in  Problems  2 
to  4. 


2. 


fb  x2  dy 

3. 

rb  yy'2  dx 

4. 

f 

Ja  Vl  +  X'2 

Ja  \/l  +  y'2 

•J  OL 

'2  +  r4  d8 


Write  and  solve  the  Euler  equations  to  make  stationary  the  integrals  in  Problems  5  to  7. 

+  1  dx  6.  [  ^  +  y  dx  7.  f  VT  +  x\/ 1  +  x'2  dy 

V2  Ja  1  +  y  Ja 

8.  Find  the  geodesics  on  the  cylinder  r  =  1  +  cos  6. 

9.  Find  the  geodesics  on  the  cone  z  =  r  cot  a,  where  r2  =  x2  +  y2. 

10.  Find  the  geodesics  on  the  parabolic  cylinder  y  =  x2 . 


In  Problems  11  to  18,  use  Fermat’s  principle  to  find  the  path  of  a  light  ray  through  a 
medium  of  index  of  refraction  proportional  to  the  given  function. 

11.  r“1/2  12.  ey  13.  (2a; +  3)-1  14.  (y  +  2)1/2 


15.  x 1/3  Hint:  In  the  last  integration,  let  x  =  u3. 

16.  r  Hint:  In  the  last  integration,  let  u  =  r2 . 

17.  r_1  In?’  Hint:  In  the  last  integration,  let  u  =  lnr. 

18.  ( x  +  y  j1/2  Hint:  Make  the  change  of  variables  (45°  rotation) 


X„^  +  y), 


Y  =  —(x 

V2 


y);  what  is  dX2  +  dY 2? 


19.  Find  Lagrange’s  equations  in  polar  coordinates  for  a  particle  moving  in  a  plane  if 
the  potential  energy  is  V  =  ^ kr 2 . 

20.  Repeat  Problem  19  if  V  =  —K/r. 

21.  Write  Lagrange’s  equations  in  cylindrical  coordinates  for  a  particle  moving  in  the 
gravitational  Held  V  =  mgz. 

22.  In  spherical  coordinates,  find  the  8  Lagrange  equation  for  a  particle  moving  in  the 
potential  field  V  =  V{r,6,<j>).  What  is  the  8  component  of  the  acceleration?  Hint: 
The  8  Lagrange  equation  is  the  8  component  of  m a  =  F  =  -  W;  for  components 
of  W,  see  Chapter  6,  end  of  Section  6,  or  Chapter  10,  Section  9. 


Section  8 


Miscellaneous  Problems  495 


23. 

24. 


A  particle  slides  without  friction  around  a  vertical  circle  under  the  force  of  gravity. 
Set  up  the  Lagrange  equation  of  motion. 

Write  and  simplify  the  Euler  equation  to  make  stationary  the  integral 

rb 


/  [P{x,y)  +  Q(x,y)y']dx. 


Show  that  if  the  Euler  equation  is  satisfied,  the  integral  has  the  same  value  for  all 
paths  joining  a  and  b.  (See  Problem  1.3.  Also  see  Chapter  6,  Section  9,  Example  2.) 

25.  Find  the  shape  of  a  curve  of  minimum  length  which  will  inclose  a  given  area  A  lying 
in  the  (*,  y)  plane.  Find  the  length  in  terms  of  A. 

26.  A  wire  carrying  a  uniform  distribution  of  positive  charge  lies  in  the  ( x ,  y )  plane  and 
joins  two  given  points.  Find  its  shape  to  minimize  the  electrostatic  potential  at  the 
origin. 

27.  Find  a  first  integral  of  the  Euler  equation  for  Problem  26  if  the  length  of  the  wire  is 
given. 

28.  Write  the  9  Lagrange  equation  for  a  particle  moving  in  a  plane  if  V  =  V(r)  (that 
is,  a  central  force).  Use  the  9  equation  to  show  that: 

(a)  The  angular  momentum  r  X  mv  is  constant. 

(b)  The  vector  r  sweeps  out  equal  areas  in  equal  times  (Kepler’s  second  law). 
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Tensor  Analysis 


1.  INTRODUCTION 


You  already  know  something  about  tensors  although  you  may  not  have  used  the 
term  tensor.  Tensors  of  rank  (or  order)  zero  are  just  scalars  and  tensors  of  rank 
one  are  just  vectors;  you  are  already  familiar  with  these.  In  3-dimensional  space  a 
scalar  has  3°  =  1  components  and  a  vector  has  31  =  3  components;  a  second-rank 
tensor  has  32  =  9  components;  and  in  general  a  tensor  of  rank  n  has  3n  components. 
After  scalars  and  vectors,  second-rank  tensors  are  the  ones  you  are  most  likely  to 
find  in  applications,  so  let’s  consider  an  example  of  such  a  tensor. 

Example  1.  Think  of  a  beam  carrying  a  load;  there  are  stresses  and  strains  in  the  material 
of  the  beam.  If  we  imagine  cutting  the  beam  in  two  by  a  plane  perpendicular  to 
the  x  direction,  we  realize  that  there  is  a  force  per  unit  area  exerted  by  the  material 
on  one  side  of  our  imaginary  cut  on  the  other  side.  This  is  a  vector,  so  it  has  three 
components  Pxx,  Pxy,  Pxz,  where  the  first  subscript  x  is  to  emphasize  that  this  is 
a  force  across  a  plane  perpendicular  to  the  x  direction.  Similarly,  if  we  consider  a 
plane  perpendicular  to  the  y  direction,  there  is  a  force  per  unit  area  across  this  plane 
with  components  Pyx,  Pyy,  Pyz',  and  finally  across  a  plane  perpendicular  to  the  z 
direction  there  is  a  force  per  unit  area  with  components  Pzx,  Pzy ,  Pzz ■  At  a  point 
in  the  material,  then,  we  have  a  set  of  nine  quantities  which  could  be  displayed  as 
a  matrix: 


(1.1) 


This  is  a  second-rank  tensor  known  as  the  stress  tensor.  The  forces  (per  unit  area) 
Pxx,  Pyy,  Pzz  are  pressures  or  tensions;  the  others  are  shear  forces  (per  unit  area). 
For  example  Pzy  is  a  force  per  unit  area  in  the  y  direction  acting  across  a  plane 
perpendicular  to  the  z  direction;  this  force  tends  to  shear  the  beam. 

So  far,  we  have  simply  indicated  the  number  of  components  that  tensors  of  the 
various  ranks  have.  This  is  not  the  whole  story.  To  see  what  else  is  required,  let  us 
talk  about  first-rank  tensors,  that  is  vectors,  which  are  already  familiar  to  you.  In 
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elementary  work  a  vector  is  usually  defined  either  as  a  magnitude  and  a  direction,  or 
as  a  set  of  three  components.  To  see  that  we  need  to  give  a  more  careful  definition, 
consider  this  example. 


Example  2.  We  can  draw  an  arrow  to  represent  a  given  rotation  of  a  rigid  body  in  the 
following  way.  Draw  the  arrow  along  the  axis  of  rotation,  make  its  length  equal 
to  the  rotation  angle  in  radians,  and  let  its  sense  be  given  by  the  right-hand  rule. 
Then,  apparently,  a  rotation  is  a  vector  according  to  the  magnitude  and  direction 
definition.  But  this  is  not  so!  Take  a  book  and  rotate  it  90°  about  the  x  axis,  then 
90°  about  the  y  axis.  (See  Chapter  3,  Problem  7.31.)  Repeat,  rotating  this  time 
first  about  the  y  axis  and  then  about  the  x  axis.  The  final  positions  of  the  book  are 
different.  But  the  sum  of  the  two  vectors  does  not  depend  on  the  order  in  which 
they  are  added  (in  mathematical  language,  vector  addition  is  commutative).  The 
arrows  associated  with  rotations  are  not  vectors. 


Example  3.  Now  let  us  consider  the  idea  of  a  vector  as  a  set  of  three  components.  In 
order  to  talk  about  components,  we  must  have  a  coordinate  system.  There  are 
infinitely  many  coordinate  systems — even  for  rectangular  axes  (x,  y ,  z)  there  are 
infinitely  many  sets  of  rotated  axes.  Thus  we  must  say  that  a  vector  consists  of  a 
set  of  three  components  in  each  coordinate  system.  If  the  components  of  a  vector 
relative  to  one  set  of  axes  are  given,  we  know  from  elementary  vector  analysis  that 
the  component  of  the  vector  in  any  direction,  or  its  components  relative  to  any 
rotated  set  of  axes,  can  be  found  by  taking  projections.  Then  the  new  components 
are  definite  combinations  of  the  old  components.  This  fact  allows  us  to  decide 
whether  a  physical  quantity  is  really  a  vector  or  not.  There  is  a  similar  requirement 
for  tensors,  for  example  the  second-rank  stress  tensor  we  have  described.  We  could 
imagine  cutting  the  beam  by  a  plane  oriented  in  any  given  direction  and  ask  for 
the  force  per  unit  area  acting  across  this  plane.  It  can  be  shown  (see  Section  7) 
that  each  component  of  this  force  is  a  certain  combination  of  the  nine  components 
of  the  stress  tensor  (1.1).  Thus  the  components  of  the  stress  tensor  in  any  other 
coordinate  system  are  definite  combinations  of  the  nine  components  of  the  tensor 
relative  to  the  ( x,y,z )  axes.  In  other  words,  tensors  of  all  ranks,  like  vectors,  have 
a  physical  meaning  which  is  independent  of  the  reference  coordinate  system  and 
there  are  definite  mathematical  laws  which  relate  their  components  in  two  systems. 

You  may  wonder  why  we  cannot  make  just  any  set  of  components  (3  for  a 
vector,  9  for  a  second-rank  tensor,  etc.),  given  in  one  coordinate  system,  a  tensor 
by  defining  its  components  in  other  systems  by  the  correct  transformation  laws. 
Mathematically,  we  could!  But  for  a  physical  entity,  we  are  not  free  to  define  its 
components  in  various  coordinate  systems;  they  are  determined  by  physical  fact. 
We  merely  give  a  mathematical  description  of  the  entity  and  identify  it  as  a  scalar, 
a  vector,  a  second-rank  tensor,  etc.  (or  perhaps  none  of  these).  We  can  see  again 
now  why  an  arrow  associated  with  a  rotation  is  not  a  vector.  If  we  treat  the  arrow 
as  a  vector  and  take  components  of  it,  these  component  vectors  do  not  represent 
rotations  which  can  be  combined  to  give  the  original  rotation.  Thus  a  vector  which 
looks  superficially  like  the  arrow  we  have  defined  is  not  a  correct  mathematical 
representation  of  the  physical  entity  (a  rotation)  we  are  trying  to  describe. 
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What  is  the  relationship  between  the  vectors  we  are  going  to  define  here  and  the 
vectors  of  a  linear  vector  space  (Chapter  3,  Sections  10  and  14)?  The  ideas  of  an 
abstract  vector  space  grew  out  of  the  geometry  of  three-dimensional  displacement 
vectors.  A  change  of  coordinate  system  (for  example,  rotation  of  axes)  corresponds 
to  a  change  of  basis  in  a  vector  space.  Because  the  definitions  of  a  vector  space 
are  set  up  to  parallel  the  geometry,  displacement  vectors  are  vectors  in  the  vector 
space  sense.  Whether  other  physical  entities  (force,  temperature,  stress,  etc.)  are 
properly  modeled  as  vectors  then  depends  on  whether  they  transform  under  a  change 
of  coordinate  system  (that  is,  change  of  basis)  in  the  same  way  that  displacement 
vectors  do.  Here  we  want  the  word  “vector”  to  refer  to  all  physical  quantities  which 
transform  properly.  Thus  we  shall  find  the  transformation  law  for  a  displacement 
vector,  and  then  define  a  vector  as  any  entity  which  obeys  the  same  law. 

A  tensor  which  transforms  properly  under  a  rotation  of  rectangular  (a;,  y1  z) 
axes  is  called  a  Cartesian  tensor;  we  will  study  these  in  some  detail.  Things  become 
a  little  more  complicated  when  we  consider  transformations  to  other  coordinate 
systems  such  as  spherical  coordinates;  we  will  consider  this  at  the  end  of  the  chapter. 
But  for  Sections  1  to  7,  the  term  tensor  will  mean  Cartesian  tensor. 


2.  CARTESIAN  TENSORS 


In  Chapter  3,  Section  7,  we  considered  the  effect  of  rotations  on  vectors,  and  empha¬ 
sized  active  rotations  (vector  rotated,  axes  fixed).  Now  we  want  to  consider  passive 
rotations  (vector  fixed,  axes  rotated),  in  order  to  find  how  the  components  of  a  dis¬ 
placement  vector  in  one  coordinate  system  are  related  to 
its  components  in  a  rotated  system.  Let  ( x ,  y ,  z )  be  a  set 
of  rectangular  axes  and  (x' ,  y' ,  z')  another  set  obtained  by 
rotating  the  axes  in  any  manner  keeping  the  origin  fixed 
(Figure  2.1).  In  the  table  (2.1),  we  list  the  cosines  of  the 
nine  angles  between  the  (x,y,z)  axes  and  the  (a :',y',z') 
axes. 


(2.1) 


X 

y 

z 

x' 

h 

mi 

ni 

y' 

h 

m2 

n2 

z' 

h 

m3 

n3 

In  the  table,  I2  means  the  cosine  of  the  angle  between  the  x  axis  and  the  y'  axis, 
etc.  A  vector  r  (Figure  2.1)  has  components  x,  y,  z  or  x' ,  y' ,  z'  relative  to  the 
two  coordinate  systems;  we  want  to  find  the  relations  between  the  two  sets  of 
components. 

Example  1.  Let  i,  j,  k  be  unit  basis  vectors  along  the  ( x,y,z )  axes  and  i',  j',  k'  be  unit 
basis  vectors  along  the  (x',y',z')  axes.  Then  the  vector  r  can  be  written  in  terms 
of  either  set  of  components  and  basis  vectors  as  follows: 

(2.2)  r  =  ix  +  jy  +  kz  =  \'x'  -I-  ]'y'  +  k! z  . 

Taking  the  dot  product  of  this  equation  with  i',  we  get 

(2.3) 


i  •  i'x  +  j  •  My  +  k  •  M z  =  x' 
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(since  i7  •  i7  =  1,  and  i7  •  j7  =  i7  -  k'  =  0).  Now  i  •  i7  is  the  cosine  of  the  angle  between 
i  and  i',  that  is,  between  the  x  and  x'  axes,  since  i  and  i7  are  unit  vectors;  thus 
i  •  i7  =  l\  from  the  table  (2.1).  Similarly,  j  •  i7  =  m\  and  k  •  i7  =  ni  and  (2.3)  becomes 

(2.4)  x'  =  l\x  +  miy  +  n\z. 

Similarly,  dotting  r  into  j7  nd  k',  and  using  (2.1)  we  get 

y' =  l2x  +  m2y  +  n2z, 
z  =  l3x  +  m3y  +  n3z. 

The  equations  (2.4)  and  (2.5)  are  called  the  transformation  equations  from  the 
coordinate  system  (x,y,z)  to  (x',y',z'). 


Example  2.  In  the  same  way,  dotting  r  with  i,  j,  k  in  turn,  we  get  equations  for  x,  y ,  z 
in  terms  of  x'  y1,  z'\ 

x  =  hx'  +  l2y'  +  l3z', 

(2.6)  y  =  mix'  +  m2y'  +  m3z\ 

z  =  n\x'  +  n2y'  +  n3z' . 

These  transformation  equations  may  be  written  more  concisely  in  matrix  notation. 
Equations  (2.4)  and  (2.5)  become  the  matrix  equation: 

(x'  \  (  l\  mi  n\z  \  /  x  \ 

y'  =  (  h  rn2  n2z  or  r7  =  Ar, 

z'  J  \  h  rn3  n3z  )  \  z  ) 

where  r7,  A,  and  r  stand  for  the  matrices  in  (2.7).  [Compare  Chapter  3,  equation 
(7.13)  for  the  two-dimensional  case.]  Similarly,  (2.6)  becomes 

(2.8)  r  =  ATr' 

where  AT  is  the  transpose  of  A.  Recall  from  Chapter  3,  Sections  7  and  9,  that  a 
rotation  matrix  is  an  orthogonal  matrix,  and  for  an  orthogonal  matrix,  AT  =  A-1. 
Also  see  Problems  3  and  4. 

Equations  (2.7)  or  (2.8)  tell  us  how  displacement  vectors  in  a  rectangular  co¬ 
ordinate  system  transform  under  a  rotation  of  axes.  We  now  use  this  result  to 
define  Cartesian  vectors,  that  is,  vectors  which  transform  in  the  same  way  that 
displacement  vectors  do  under  rotations  of  rectangular  (Cartesian)  axes.  We  will 
then  generalize  this  to  define  Cartesian  tensors  of  other  ranks. 


Definition  of  Cartesian  Vectors  A  Cartesian  vector  V  consists  of  a  set  of  three 
numbers  (components)  in  every  rectangular  coordinate  system;  if  14,  Vy ,  14  are  the 
components  in  one  system  and  V^,  Vy7,  V'z  are  the  components  in  a  rotated  system, 
these  two  sets  of  components  are  related  by  an  equation  similar  to  (2.7),  namely, 


(2.9) 


or  V7  =  AV, 


where  A  is  the  rotation  matrix  in  (2.7).  Alternatively,  we  could  use  (2.8)  and  require 
that  V  =  ATV7. 
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We  can  simplify  our  notation  by  making  the  following  changes. 


Replace  x,y,z 

by 

Xl,X-2,  X3 

Replace  x' .  xj .  z' 

by 

X\,X2,  x3 

Replace  Vx,Vy,  Vz 

by 

v1}v2,v3 

Replace  V'x ,  Vj  V'z 

by 

v;xx 

(  CLu  a±2 

Ol3 

Replace  A  in  (2.7) 

by 

I  021  022 

V  031  032 

023 

033 

(2.10) 


In  this  notation  (2.7)  and  (2.9)  become  (2.11)  and  (2.12): 

3 


(2.11) 

xi  —  'y  1  aij  xj ) 
j= 1 

*  =  1,2,3, 

3 

(2.12) 

Vi  =  y  J  aij  Vj , 

*  =  1,2,3. 

J= 1 

Alternatively,  we  could  solve  (2.11)  for  the  x  coordinates  in  terms  of  the  x'  coor- 

3 


dinates  as  in  (2.8),  to  get,  in  the  summation  form:  Xi 
companion  formula  to  (2.12),  namely 


and  a  similar 
o= i 


3 

(2.13)  Vi  =  Y,a»Vi- 

j= i 


Since  we  will  occasionally  want  the  transformation  formula  for  a  Cartesian  vector 
solved  for  the  unprimed  components  as  in  (2.13),  you  should  be  sure  you  understand 

(2.13).  Compare  carefully  the  indices  in  (2.12)  and  (2.13).  In  matrix  form  (2.12)  is 
V'  =  AV  and  (2.13)  is  V  =  ATV'  [see  equations  (2.7)  and  (2.8)].  Now  element  i,  j 
of  AT  is  the  same  as  element  j,  i  of  A,  so  the  coefficients  in  (2.13)  are  urj  instead  of 
ciij  as  they  were  in  (2.12).  It  is  now  straightforward  to  define  tensors. 


Definition  of  Cartesian  Tensors  A  tensor  of  rank  zero  has  one  component 
which  is  unchanged  by  a  rotation  of  axes;  it  is  called  an  invariant  or  a  scalar. 
Simple  examples  are  the  length  of  a  vector,  or  the  dot  product  of  two  vectors.  A 
first  rank  tensor  is  just  a  vector.  A  tensor  of  second  rank  has  nine  components  (in 
three  dimensions)  in  every  rectangular  coordinate  system.  If  we  call  the  components 
in  one  system  the  components  in  a  rotated  coordinate  system  are  given  by 

(2.14),  where  the  a’s  are  the  direction  cosines  in  the  rotation  matrix  A. 


3  3 

(2.14)  n*  =  EE  Q'ki&ijTij ,  k,  l —  1,2,3. 

i=l  j= 1 
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Direct  Product  We  can  give  a  very  simple  example  of  a  second-rank  tensor. 


Example  3.  Let  U  and  V  be  vectors;  we  form  the  following  array  (in  each  coordinate 
system)  from  the  components  Ui,  U2,  U3  and  V) ,  V2,  V3  of  U  and  V  (in  that 
coordinate  system): 


U\V\ 

U1V2 

U1V3 

(2.15) 

U2v1 

U2V2 

U2V3 

u3v1 

U3V2 

U3V3 

We  can  show  that  these  nine  quantities  are  the  components  of  a  second-rank  tensor 
which  we  shall  denote  by  UV.  Note  that  this  is  not  a  dot  product  or  a  cross 
product;  it  is  called  the  direct  product  of  U  and  V  (or  outer  product  or  tensor 
product).  Since  U  and  V  are  vectors,  their  components  in  a  rotated  coordinate 
system  are,  by  (2.12): 

3  3 

(2.16)  U'k  =  Y,akiui,  Vj^aijVj. 

i=  1  1=1 

Hence  the  components  of  the  second-rank  tensor  UV  are 

33  3 

(2.17)  U'kV{  =  ^akiUi^aijVj  =  ^  akiaijUiVj, 

i=l  j= 1  i,j= 1 

which  is  just  (2.14)  with  Ti:j  =  UiVj  and  T'kJ  =  U'kV/ . 

Equation  (2.14)  generalizes  immediately.  For  example,  a  4th-rank  Cartesian  ten¬ 
sor  is  defined  as  a  set  of  34  or  81  components  Tijki ,  in  every  rectangular  coordinate 
system,  which  transform  to  a  rotated  coordinate  system  by  the  equations 


(2.18) 

Ta/3~/8  —  ^  ^  &a.i&(3jQ’rykQ’8irrijkli 

where  i,  j,  k ,  l  take  the  values  1,  2,  3.  Note  that  a  4th-rank  tensor  has  4  indices  and 
requires  four  a’s  in  its  definition.  Similarly,  an  nth-rank  tensor  has  n  indices  and 
requires  n  a’s  in  its  definition.  Also  we  can  generalize  (2.17)  to  show,  for  example, 
that  the  direct  product  of  a  vector  and  a  3rd-rank  tensor  produces  a  4th-rank  tensor, 
and,  in  general,  the  direct  product  of  tensors  of  ranks  n  and  m  is  a  tensor  of  rank 
771  +  n  (see  Problem  7). 

PROBLEMS,  SECTION  2 

1.  Verify  equations  (2.6). 

2.  Show  that  the  sum  of  the  squares  of  the  direction  cosines  of  a  line  through  the  origin 
is  equal  to  1  Hint:  Let  (a,  b,  c )  be  a  point  on  the  line  at  distance  1  from  the  origin. 
Write  the  direction  cosines  in  terms  of  (a,  b,  c). 

3.  Consider  the  matrix  A  in  (2.7)  or  (2.10).  Think  of  the  elements  in  each  row  (or  col¬ 
umn)  as  the  components  of  a  vector.  Show  that  the  row  vectors  form  an  orthonormal 
triad  (that  is  each  is  of  unit  length  and  they  are  all  mutually  orthogonal),  and  the 
column  vectors  form  an  orthonormal  triad. 
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4.  Any  rotation  of  axes  in  three  dimensions  can  be  described  by  giving  the  nine  direction 
cosines  of  the  angle  between  the  ( x,y,z )  axes  and  the  (x',y',z')  axes.  Show  that 
the  matrix  A  of  these  direction  cosines  in  (2.7)  or  (2.10)  is  an  orthogonal  matrix. 
Hint:  See  Chapter  3,  Section  9.  Find  AAT  and  use  Problem  3. 

5.  Write  equations  (2.12)  out  in  detail  and  solve  the  three  simultaneous  equations  (say 
by  determinants)  for  xi,  X2,  X3  in  terms  of  xj,  x'2,  x'3  to  verify  equations  (2.13).  Use 
your  results  in  Problem  4. 

6.  Write  the  transformation  equation  for  a  3rd-rank  tensor;  for  a  5th-rank  tensor. 

7.  Following  what  we  did  in  equations  (2.14)  to  (2.17),  show  that  the  direct  product 
of  a  vector  and  a  3rd-rank  tensor  is  a  4th-rank  tensor.  Also  show  that  the  direct 
product  of  two  2nd-rank  tensors  is  a  4th-rank  tensor.  Generalize  this  to  show  that 
the  direct  product  of  two  tensors  of  ranks  m  and  n  is  a  tensor  of  rank  m  +  n. 

8.  Write  the  equations  in  (2.16)  and  so  in  (2.17)  solved  for  the  unprimed  components 
in  terms  of  the  primed  components. 

3.  TENSOR  NOTATION  AND  OPERATIONS 

Summation  Convention  As  you  may  have  noticed  in  the  last  section,  tensor 
equations  use  a  lot  of  summation  signs — it  would  be  a  simplification  if  we  could 
get  along  without  them.  Using  the  summation  convention  (or  Einstein  summation 
convention),  we  omit  the  summation  signs  in  equations  like  (2.11)  to  (2.14),  and 
(2.16)  to  (2.18),  and  simply  understand  a  summation  over  any  index  which  appears 
exactly  twice  in  one  term.  Here  are  some  examples  using  summation  convention  (in 
three  dimensions). 


Examples. 

0 : j  or  cbjj  or  0.3, 3 ,  etc. 

means 

On  +  0,22  +  033; 

XiXi  or  xaxa,  etc. 

means 

2  ,  2  ,  2 
xx  +x2  +x3; 

O'ij  bj  k 

means 

anbik  +  ai2b2k  +  «i3^3fe; 

(3.1) 

du  dxj 

means 

du  dxx  du  dx2  du  dx* 

_ _  1 _ _  1 _ _ 

dxj  dx\ 

dx  1  dx\  dx2  dx\  dx 3  dx\ 

TijkiSijVkUi 

means 

EEEET^dW; 

i  j  k  l 


and  so  on.  The  repeated  index  (which  is  summed  over)  is  called  a  dummy  index; 
like  an  integration  variable  in  a  definite  integral,  it  does  not  matter  what  letter  is 
used  for  it.  An  index  which  is  not  repeated  is  called  a  free  index. 

When  summation  convention  is  being  used,  we  are  not  warned  by  a  summation 
sign  what  letters  to  sum  over;  we  just  have  to  inspect  the  indices  and  see  which 
ones  appear  twice.  In  writing  terms  using  the  summation  convention,  we  must  be 
careful  not  to  re-use  an  index.  For  example,  if  we  already  have  two  i  subscripts 
indicating  a  sum  over  i,  and  we  want  another  sum  in  the  same  term,  we  must  use 
a  different  dummy  index,  say  j  or  m  or  a ,  etc.  In  the  following  discussion  we  will 
use  summation  convention;  watch  carefully  for  the  repeated  dummy  indices. 

Contraction  The  transformation  equations  for  a  4th-rank  tensor  are  [see  (2.18)] 
(3.2)  —  dai&fSjd'ykQ'SlTijkl- 

(Note  the  sums  over  i,  j ,  k ,  and  l). 
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Example  1.  Now  suppose  we  put  8  =  (3  which,  by  summation  convention,  means  a  further 
sum  over  (3.  Then  we  have 

(3-3)  r^'af3'yf3  —  &ociQj0jQn/kd0lTijki. 

Now  apjdpi  (summed  over  (3)  is  the  dot  product  of  columns  j  and  l  of  the  rotation 
matrix  A  [see  Problem  (2.3)].  This  dot  product  is  1  if  j  =  l,  and  0  otherwise.  In 
other  words  =  Sji  [see  Chapter  3,  equation  (9.4)].  Then  SjiTjki  becomes 

Tijkj  since  Sji  is  zero  unless  j  and  l  are  equal.  (The  repeated  dummy  index  could 
be  either  j  or  l  or  anything  else  except  the  dummy  indices  i  and  k  which  are  already 
used,  and  the  free  indices  a  and  7).  Thus  we  have 

(3.4)  Tap, y/3  —  daid'yk^jlTijkl  —  C  a /,.  7  )j  fcj 

Now  (3.4)  says  that  are  the  components  of  a  2nd-rank  tensor  since  there  are 
two  free  indices  and  two  a  factors  are  required  [compare  equation  (2.14)].  This 
process  of  setting  two  indices  of  a  tensor  equal  to  each  other  and  then  summing 
is  called  contraction.  Contraction  reduces  the  rank  of  a  tensor  by  2.  Note  that  in 
(3.2)  we  started  with  a  4th-rank  tensor  and  after  contracting  we  have  a  tensor  of 
rank  2  in  (3.4). 

It  is  interesting  to  observe  that  the  dot  (or  scalar  or  inner)  product  of  two  vectors 
in  elementary  vector  analysis  is  an  example  of  contraction.  In  Section  2  we  showed 
that  the  direct  product  of  two  vectors  [see  (2.17)]  is  a  2nd-rank  tensor.  If  we  contract 
UiVj  to  get  UiVi  we  have  the  dot  product  of  vectors  U  and  V,  which  is  a  scalar. 
Again  note  that  contraction  has  reduced  the  rank  of  a  tensor  by  2  (a  scalar  is  a 
tensor  of  rank  zero). 


Tensors  and  Matrices  The  components  of  first  or  second  rank  tensors  can  be 
displayed  as  matrices  and  this  is  often  useful.  We  have  frequently  (see  Chapter  3) 
written  the  components  of  a  vector  (lst-rank  tensor)  as  a  column  or  row  matrix. 
The  components  X)j  of  a  2nd-rank  tensor  can  be  written  as  the  elements  of  a  square 
matrix  (see  inertia  matrix,  Section  4).  Then  note  that  in  the  tensor  equation, 
Ui  =  TijVj ,  the  contraction  (sum  on  j)  corresponds  exactly  to  row  times  column 
multiplication  for  matrices. 


Symmetric  and  Antisymmetric  Tensors  A  2nd-rank  tensor  T%3  is  called  sym¬ 
metric  if  Tij  =  Tji,  and  antisymmetric  (or  skew  symmetric )  if  Ty  =  —  Tji.  Note  that 
these  agree  with  the  corresponding  definitions  for  matrices  [Chapter  3,  (9.2)].  Any 
2nd-rank  tensor  can  be  written  as  a  sum  of  a  symmetric  tensor  and  an  antisymmetric 
tensor  as  in  (3.5)  (Problem  13). 

(3-5)  =  —  (Tij  +  Tji )  +  -(Tij  -  T3i). 

For  tensors  of  higher  rank,  similar  terminology  is  used.  If  an  exchange  of  two  indices 
leaves  the  tensor  component  unchanged,  we  say  that  the  tensor  is  symmetric  with 
respect  to  those  two  indices.  If  an  exchange  of  two  indices  changes  the  tensor 
component  to  its  negative,  we  say  that  the  tensor  is  antisymmetric  with  respect  to 
those  two  indices. 
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Combining  tensors  The  sum  or  difference  (in  fact  linear  combination)  of  two 
tensors  of  rank  n  is  a  tensor  of  rank  n  (Problems  6  and  7).  For  example,  T^j  + 
is  a  tensor  of  rank  2.  Note  the  summation  convention  and  the  contraction  which 
makes  RijkVk  also  a  tensor  of  rank  2  so  that  we  can  add  it  to  Ty.  (Addition  is  not 
defined  for  tensors  of  different  ranks.) 


Quotient  Rule  Let  us  suppose  we  know  that,  for  every  vector  Vj,  the  quantities 
Ui  =  TijVj  are  the  components  of  a  non-zero  vector  and  that  this  holds  true  in 
all  rotated  coordinate  systems.  Then  we  can  prove  that  the  quantities  are  the 
components  of  a  2nd-rank  tensor.  This  is  an  example  of  the  quotient  rule. 


Example  2.  To  prove  this,  we  need  the  following  equations: 


(3.6) 


TLpVg  =  K, 
Ua  — 

TJ .  =  T-V 

l  -  -*■  IJ  v  J  5 

Vj  =  apjVg, 


given  equation  in  rotated  system; 
U  is  a  vector; 
given  equation; 

V  is  a  vector;  see  equation  (2.13). 


Now,  putting  this  all  together  we  have 


(3.7)  TapVp  =  u'a  =  aaiUi  =  aaiTijVj  =  aaiTijagjVg. 

Factoring  out  Vg  from  the  first  and  last  steps,  we  have 

(3.8)  (Ta/3  ~  aaiapjTij)Vp  =  0  for  all  vectors  V'. 

Since  V'  is  arbitrary,  the  parenthesis  in  (3.8)  is  equal  to  zero  (Problem  8).  Thus 
we  have 


(3.9)  Tag  —  aaiagjTij. 

Now  (3.9)  is  the  transformation  equation  for  a  2nd-rank  tensor  [compare  (2.14)],  so, 
as  claimed,  the  quantities  Tij  are  the  components  of  a  2nd-rank  tensor. 

The  quotient  rule  is  useful  in  determining  whether  some  given  quantities  are 
the  components  of  a  tensor.  [As  an  example  of  this,  see  (4.1).]  Suppose  X  is  a  set 
of  3"  components  (the  right  number  for  a  tensor  of  rank  n  in  3  dimensions).  The 
quotient  rule  says  that  if  the  product  of  X  and  an  arbitrary  tensor  is  a  non-zero 
tensor,  then  X  is  a  tensor.  The  product  may  be  either  a  direct  product  or  a  direct 
product  combined  with  one  or  more  contractions.  We  have  proved  the  quotient  rule 
for  one  case  but  the  proof  of  any  case  follows  this  same  pattern.  Given  XA  =  B. 
where  A  is  an  arbitrary  tensor  and  B  is  a  non-zero  tensor,  we  use  the  transformation 
equations  for  A  and  B,  and  the  fact  that  A  is  arbitrary,  to  find  the  transformation 
equations  for  X  (see  Problems  9  to  12). 
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PROBLEMS,  SECTION  3 

1.  Write  equations  (2.11),  (2.12),  (2.13),  (2.14),  (2.16),  (2.17),  and  (2.18)  using  sum¬ 
mation  convention. 

2.  Show  that  the  fourth  expression  in  (3.1)  is  equal  to  du/d x[.  By  equations  (2.6)  and 
(2.10),  show  that  dxj/dx[  =  dij,  so 

du  _  du  dxj  _  du 
dx'i  dxj  dx\  dxj ' 

Compare  this  with  equation  (2.12)  to  show  that  V«  is  a  Cartesian  vector.  Hint: 
Watch  the  summation  indices  carefully  and  if  it  helps,  put  back  the  summation  signs 
or  write  sums  out  in  detail  as  in  (3.1)  until  you  get  used  to  summation  convention. 

3.  As  we  did  in  (3.3),  show  that  the  contracted  tensor  Taj  is  a  first-rank  tensor,  that 
is,  a  vector. 

4.  Show  that  the  contracted  tensor  TijkVk  is  a  2nd-rank  tensor. 

5.  Show  that  TijkimSim  is  a  tensor  and  find  its  rank  (assuming  that  T  and  S  are  tensors 
of  the  rank  indicated  by  the  indices). 

6.  Show  that  the  sum  of  two  3rd-rank  tensors  is  a  3rd-rank  tensor.  Hint:  Write  the 
transformation  law  for  each  tensor  and  then  add  your  two  equations.  Divide  out  the 
a  factors  to  leave  the  result  +  S'al3l  =  aociapja.ykiTijk  +  Sijk)  using  summation 
convention. 

7.  As  in  problem  6,  show  that  the  sum  of  two  2nd-rank  tensors  is  a  2nd-rank  tensor; 
that  the  sum  of  two  4th-rank  tensors  is  a  4th-rank  tensor. 

8.  Show  that  (3.9)  follows  from  (3.8).  Hint:  Give  a  proof  by  contradiction.  Let  Sap  be 
the  parenthesis  in  (3.8);  you  may  find  it  useful  to  think  of  the  components  written 
as  a  matrix.  You  want  to  prove  that  all  9  components  of  Sap  are  zero.  Suppose  it 
is  claimed  that  S 12  is  not  zero.  Since  V/  is  an  arbitrary  vector,  take  it  to  be  the 
vector  (0, 1,0),  and  observe  that  SapV/  is  then  not  zero  in  contradiction  to  (3.8). 
Similarly  show  that  all  components  of  Sap  are  zero  as  (3.9)  claims. 

Prove  the  quotient  rule  in  each  of  the  following  problems,  that  is,  given  XA  =  B  where  A 
is  any  arbitrary  tensor  and  B  is  a  non-zero  tensor,  show  that  X  is  a  tensor.  Hints:  Follow 
the  general  method  in  (3.6)  to  (3.9).  See  the  last  sentence  of  the  section. 


9. 

XiAij  —  Bj 

10. 

XiAj  —  Bij 

11. 

XijAfo  —  Bijk 

12. 

XijkiAki  =  Bij 

13. 

Show  that  the  first  parenthesis  in 
thesis  is  antisymmetric. 

(3.5)  is  a 

symmetric  tensor  and  the  second  paren- 

4.  INERTIA  TENSOR 

Inertia  tensor  If  a  rigid  body  is  rotating  about  a  fixed  axis,  then  from  elemen¬ 
tary  mechanics  we  know  that  t  =  dL / dt  where  t  is  the  torque  and  L  is  the  angular 
momentum  about  the  rotation  axis.  The  angular  velocity  uj  and  the  angular  mo¬ 
mentum  L  are  related  by  the  equation  L  =  Iuj  where  I  is  the  moment  of  inertia 
of  the  body  about  the  rotation  axis.  For  rotation  about  a  fixed  axis,  L  and  uj  are 
parallel  vectors,  and  I  is  a  scalar.  But  if  the  rotation  axis  is  not  fixed,  the  angular 
velocity  and  the  angular  momentum  may  not  be  parallel. 
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Example  1.  Try  the  following  experiment.  Take  a  small  book  bound  by  a  rubber  band, 
hold  it  by  one  corner  and  toss  it  upward  giving  it  a  spin.  As  it  falls  observe  that 
it  tumbles,  that  is,  the  angular  velocity  u  about  the  center  of  mass  is  not  fixed  in 
direction.  However,  by  definition  of  the  center  of  mass,  the  gravitational  torque 
r  about  the  center  of  mass  is  zero  so  r  =  dL/dt  =  0.  (We  are  neglecting  air 
resistance.)  Thus  L  is  a  constant  vector,  and  a  constant  L  and  a  changing  uj  are 
not  parallel.  Then  if  the  equation  L  =  Iuj  is  to  be  true,  I  cannot  be  a  scalar. 

We  have  seen  this  situation  before;  look  at  the  discussion  of  the  quotient  rule  in 
Section  3  and  the  proof  of  the  case  we  have  here  in  (3.5)  to  (3.8).  Since  L  and  u>  are 
vectors,  we  see  by  the  quotient  rule  that  (when  L  and  ui  are  not  parallel)  the  scalar 
I  must  be  replaced  by  a  2nd-rank  tensor  with  components  Ijk ■  Then  in  component 
form  we  have 

(4.1)  Lj  =  Ijk^k 

Example  2.  Next  we  want  to  find  the  components  of  the  inertia  tensor.  For  simplicity, 
first  consider  a  point  mass  m  at  the  tip  of  a  vector  r  with  tail  at  the  origin  O. 
From  Chapter  6,  end  of  Section  3,  the  angular  momentum  of  m  about  the  origin 
is  L  =  mr  X  (lj  X  r)  where  u>  is  the  angular  velocity  of  the  mass  m  about  O. 
(See  Chapter  6,  Figures  2.6  and  3.8.)  We  can  expand  the  triple  vector  product  [see 
Chapter  6,  equation  (3.8)]  to  get 

(4.2)  L  =  mr  X  (u  X  r)  =  m[r2uj  —  (r  •  u>)r]  =  m[r2tu  —  {xu>x  +  ycuv  +  zu>z) r]. 

Next  we  write  the  components  of  L  in  terms  of  the  components  of  uj.  For  example, 
taking  the  x  component  of  (4.2),  we  find 

(4.3)  Lx  =  m[r2u)x  —  (xu>x  +  yuj  v  +  zcoz)x\  =  m[(r2  —  x2)u>x  —  xytoy  —  xzcoz]. 
Thus  three  components  of  the  inertia  tensor  are 

(4.4)  Ixx  =  m(r2  -  x2)  =  m(y2  +  z2),  Ixy  =  - mxy ,  Ixz  =  -mxz. 

The  other  6  components  can  be  found  similarly  by  taking  the  y  and  z  components 
of  (4.2)  (Problem  1). 

Example  3.  If,  instead  of  a  single  mass,  we  have  a  set  of  masses  or  an  extended  body,  then 
the  expressions  for  the  components  of  the  inertia  tensor  become  sums  or  integrals. 

Ixx  =  '^2’ml(y2  +  z2)  or 

(4.5) 

IXy  =  ~  y~]  rriiXiyi  or 

i 

It  is  useful  to  write  (4.1)  as  a  matrix  equation  (see  discussion  in  Section  3 
about  contraction).  Then  the  inertia  tensor  components  form  a  square  matrix. 
This  matrix  is  symmetric  and  so  we  know  from  Chapter  3,  Section  11,  that  it 
can  be  diagonalized  by  an  orthogonal  similarity  transformation.  The  new  axes  are 
called  the  principal  axes  of  inertia  and  the  three  eigenvalues  are  called  the  principal 
moments  of  inertia.  We  see  that  the  equations  of  motion  are  simpler  relative  to  the 
principal  axes. 


{yz  +  z2)  dm, 


J  xy  dm,  etc.  (Problem  1.) 
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Example  4.  Find  the  inertia  tensor  about  the  origin  for  the  mass  distribution  consisting 
of  a  mass  1  at  (0, 1, 1)  and  a  mass  2  at  (1,  —1,  0).  Find  the  principal  moments  of 
inertia  and  the  principal  axes. 

Substituting  (aq,  y\,  zi)  =  (0, 1, 1),  mi  =  1,  and  (x2, 2/2,  z2)  =  (1,  -1,  0),  m2  =  2 
into  (4.5),  we  find  Ixx  =  (l2  +  l2)  +  2(-l)2  =  4,/X!/  =  Iyx  =  -0  -  2(-l)  =  2. 
Continuing  in  the  same  way,  we  can  find  the  rest  of  the  components  (Problem  2) 
and  write  them  as  an  inertia  matrix 


Either  by  hand  or  by  computer  we  find  that  the  eigenvalues  of  the  matrix  I  are  6  and 
3  ±  \/3;  these  are  the  principal  moments  of  inertia.  The  corresponding  eigenvectors 
are  (1, 1,  —1),  (—1  —  -\/3, 2  +  V3, 1),  (—1  +  v^,  2  —  x/3, 1);  these  are  vectors  along 
the  principal  axes  of  inertia. 


Example  5.  Find  the  inertia  tensor  about  the  origin  for  a  mass  of  uniform  density  =  1, 
inside  the  part  of  the  unit  sphere  in  the  first  octant,  that  is,  x  >  0,  y  >  0,  z  >  0. 

We  will  write  the  integrals  for  the  components  of  the  inertia  tensor  first  in 
rectangular  coordinates  and  then  switch  to  spherical  coordinates  [see  Chapter  5, 
equation  (4.5)]  to  evaluate  them  since  the  limits  are  then  simpler.  Satisfy  yourself 
that  in  order  to  cover  the  required  volume,  the  limits  are:  r  from  0  to  1,  9  from  0 
to  7t/2,  and  </>  from  0  to  7t/2.  Then 

Ixx  =  JJJ (r2  -  x2)  dV  = 

1  /■  tt/2  /*7t/2  ^ 

/  /  (r2  —  r2  sin2  0cos2  (j))r2  sin  0  dr  dd  d(f>  =  — . 

=0  Je= 0  J(j>= 0  15 


IXy  — 


(- xy )  dV  = 

1  /»-7t/2  /*7t/2 

r= 0  J0= 0  Jd>= 0 


(r2  sin2  9  cos  </>  sin  <^)r2  sin  9  dr  d9  d(j> 


1 

15' 


Similarly,  the  other  integrals  can  be  written  and  evaluated  (Problem  3).  Alterna¬ 
tively,  it  may  be  clear  that  by  symmetry  the  three  diagonal  components  are  all  the 
same,  and  all  the  off-diagonal  components  are  the  same.  Then  the  inertia  matrix  is 


As  in  Example  4,  we  find  (Problem  3): 

Principal  moments  of  inertia:  ~  2'  ”  +  ^  ^ 

15 

Principal  axes  of  inertia:  (1, 1, 1),  and  any  two  orthogonal  vectors  in  the 

plane  x  +  y  +  z  =  0,  for  example,  (1,  —1,  0)  and 

(1,1, -2). 
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PROBLEMS,  SECTION  4 

1.  As  in  (4.3)  and  (4.4),  find  the  y  and  z  components  of  (4.2)  and  the  other  6  com¬ 
ponents  of  the  inertia  tensor.  Write  the  corresponding  components  of  the  inertia 
tensor  for  a  set  of  masses  or  an  extended  body  as  in  (4.5). 

2.  Complete  Example  4  to  verify  the  rest  of  the  components  of  the  inertia  tensor  and 
the  principal  moments  of  inertia  and  principal  axes.  Verify  that  the  three  principal 
axes  form  an  orthogonal  triad. 

3.  As  in  Problem  2,  complete  Example  5. 

4.  Find  the  inertia  tensor  about  the  origin  for  a  mass  of  uniform  density  =  1,  inside 
the  part  of  the  unit  sphere  where  x  >  0,  y  >  0,  and  find  the  principal  moments  of 
inertia  and  the  principal  axes.  Note  that  this  is  similar  to  Example  5  but  the  mass 
is  both  above  and  below  the  ( x ,  y)  plane.  Warning  hint:  This  time  don’t  make  the 
assumptions  about  symmetry  that  we  did  in  Example  5. 

For  the  mass  distributions  in  Problems  5  to  7,  find  the  inertia  tensor  about  the  origin,  and 
find  the  principal  moments  of  inertia  and  the  principal  axes. 

5.  Point  masses  1  at  (1, 1, 1)  and  at  (—1, 1, 1). 

6.  Point  masses  1  at  (1, 1,  —2)  and  2  at  (1, 1, 1). 

7.  Mass  of  uniform  density  =  1,  bounded  by  the  coordinate  planes  and  the  plane 
x  +  y  +  z  =  1. 

8.  For  the  point  mass  m  we  considered  in  (4.2)  to  (4.4),  the  velocity  is  v  =  u  X  r  so  the 

kinetic  energy  is  T  =  \mv2  =  im(u  X  r)  ■  (u  X  r).  Show  that  T  can  be  written  in 
matrix  notation  as  T  =  where  I  is  the  inertia  matrix,  ui  is  a  column  matrix, 

and  cuT  is  a  row  matrix  with  elements  equal  to  the  components  of  lj. 

5.  KRONECKER  DELTA  AND  LEVI-CIVITA  SYMBOL 

The  Kronecker  <5  is  defined  in  Chapter  3,  equation  (9.3)  but  let’s  repeat  it  here  for 
convenience. 


(5.1) 


1  if  *  =  3, 
0  otherwise. 


The  definition  of  the  Levi-Civita  symbol  (or  permutation  symbol)  is 


(  1 

if  i,  j,  k  =  1, 2,3  or  2, 3, 1  or  3, 1,  2; 

(5.2) 

^ijk  —  \  1 

if  i,j,  k  =  3,  2, 1  or  2, 1,  3  or  1, 3,  2; 

l  o 

if  any  indices  are  repeated. 

Note  in  (5.2)  that  if  you  read  the  indices  i,j,k ,  cyclically  (as  if  they  were  written 
around  a  circle  so  you  can  start  anywhere),  then  if  the  indices  read  in  the  direction 
1,2,3, 1,2,3, 1,  ■  ■  -  ,  the  result  is  +1;  if  the  indices  read  in  the  opposite  direction  the 
result  is  —1. 
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We  can  say  (5.2)  in  another  way  which  is  sometimes  useful.  Start  with  the 
fact  that  ei23  =  +1.  Now  if  we  exchange  any  two  indices,  we  change  the  sign;  for 
example  6321  =  —  1  (we  exchanged  1  and  3).  If  we  now  continue  this  process  and 
exchange  1  and  2  in  £321  =  —1,  we  have  £312  =  +1.  [Try  a  few  more  and  compare 
with  (5.2).]  The  result  of  an  even  number  of  exchanges  in  123  is  called  an  even 
permutation  of  123  and  the  result  of  an  odd  number  of  exchanges  is  called  an  odd 
permutation  of  123.  Thus  we  could  replace  (5.2)  by  the  definition 


(5.3) 

eb'fc  —  i 

if  i,j,  k  =  an  even  permutation  of  1, 2,  3; 
if  i,j,  k  =  an  odd  permutation  of  1, 2,  3; 

l  0 

if  any  indices  are  repeated. 

We  say  that  e^k  is  totally  antisymmetric  (see  Section  3),  that  is,  it  is  antisymmet¬ 
ric  with  respect  to  every  pair  of  indices,  since  each  exchange  of  indices  produces  a 
change  in  sign. 


Isotropic  Tensors  A  Cartesian  isotropic  tensor  means  a  tensor  which  has  the 
same  components  in  all  rotated  coordinate  systems.  The  definitions  (5.1)  to  (5.3) 
are  general,  independent  of  any  reference  system.  Thus  to  show  that  <5^  and  Cijk  are 
isotropic  tensors ,  we  need  to  show  that  a  tensor  transformation  simply  reproduces 
the  tensor  we  start  with,  that  is,  S'  =  S  and  e'  =  e.  In  this  section  we  shall  show 
this  and  develop  some  useful  formulas. 


Kronecker  delta  To  show  that  6ij  is  an  isotropic  2nd-rank  tensor,  we  write  the 
tensor  transformation  to  a  rotated  system  and  show  that  it  gives  S '  =  S. 


(5.4) 


8 rnn  —  Q'mi®nj8ij  —  Q'mjQ'nj 


s 


mn  • 


Remember  summation  convention  and  follow  carefully  the  sums  in  (5.4).  Note  in 
the  second  step  that  ami  becomes  amj  because  1 %  is  zero  unless  i  =  j.  (We  could 
just  as  well  change  anj  to  ani  and  sum  on  i.)  In  the  last  step,  amjanj  (or  amiani ) 
is  the  dot  product  of  rows  m  and  n  of  the  rotation  matrix  and  this  is  8mn  (see 
Problem  2.3).  Thus  the  Kronecker  S  is  a  2nd-rank  isotropic  Cartesian  tensor. 


Determinants  We  can  write  a  useful  formula  for  the  value  of  a  3-by-3  determi¬ 
nant  using  the  Levi-Civita  symbol: 

(5.5)  det  A  =  aua2ja3keijk- 

It  is  straightforward  to  show  (Problem  1)  that  (5.5)  is  equivalent  to  a  Laplace 
development.  Another  useful  formula  is 

(5.6)  £a/37  det  A  —  Q’aiQ’f3jQjryk€ijk  • 

Again  it  is  straightforward  (although  lengthy)  to  show  that  this  is  equivalent  to  a 
Laplace  development  (Problem  2). 
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Levi-Civita  Symbol  To  show  that  etjk  is  an  isotropic  tensor,  we  write  the  trans¬ 
formation  equation  to  a  rotated  system.  We  find 

(5-7)  ^a/37  —  Q‘aiQ,0j&~/k£ijk  — 

In  the  last  step  we  used  (5.6)  with  det  A  =  1  (recall  from  Chapter  3,  Section  7,  that 
if  A  is  a  rotation  matrix,  det  A  =  1).  Thus  eQ(37  is  a  3rd-rank  isotropic  Cartesian 
tensor  (assuming  only  rotated  coordinate  systems;  for  reflections  see  Section  6). 


Products  of  Isotropic  Tensors  We  can  find  other  isotropic  tensors  from  direct 
products  of  the  two  we  have,  or  from  direct  products  followed  by  contraction.  Recall 
from  Sections  2  and  3  that  the  direct  product  of  two  tensors  of  ranks  n  and  m  is 
a  tensor  of  rank  n  +  m  and  that  each  contraction  produces  another  tensor  of  rank 
smaller  by  2.  If  the  tensors  you  multiply  are  isotropic,  the  products  are  also  isotropic 
(Problems  3  and  4). 

To  simplify  products  of  two  Levi-Civita  tensors,  the  following  formula  is  useful. 


Both  sides  of  (5.8)  are  4th-rank  tensors  (contracted  6th-rank  on  the  left)  with  free 
indices  j,  k,  m,  n.  We  want  to  see  that  (5.8)  is  true  for  any  choice  of  these  four 
indices.  Most  choices  will  just  give  0  =  0;  let’s  consider  what  is  required  for  the 
product  of  e’s  to  be  different  from  zero. 

Example  1.  Remember  that  an  e  is  zero  unless  its  three  indices  are  all  different.  Since 
the  first  index  is  the  same  in  e^fc  and  €imn,  the  product  is  different  from  zero  only 
if  the  other  two  indices  (j,  k  and  to,  n)  are  the  same  pair  in  both  e’s.  (For  example, 
if  i  =  1,  then  j ,  k  and  to,  n  must  be  2,  3  or  3,  2.)  This  means  that  either  (1)  j  =  m 
and  k  =  n,  or  (2)  j  =  n  and  k  =  to.  In  case  (1),  the  two  e’s  are  the  same  (both 
=  +1  or  both  =  —1)  so  the  product  is  +1;  this  is  the  same  as  5jmSkn  on  the  right 
side  of  (5.8).  In  case  (2),  the  indices  on  the  two  e’s  are  ijk  and  ikj  so  one  of  them 
is  an  even  permutation  of  1,  2,  3  and  the  other  is  an  odd  permutation.  Thus  the 
product  of  the  two  e’s  is  —1,  and  this  is  the  same  as  —5jn5kn  on  the  right  side  of 

(5.8) .  Note  that,  given  j,  k,  to,  ti  satisfying  either  j,  k  =  m,n  or  j,  k  =  n,  to,  only 
one  term  in  the  sum  over  i  is  different  from  zero,  that  is,  the  term  with  i  different 
from  either  j  or  k.  Also  see  Problem  5. 

Now  that  we  have  (5.8),  it  is  easy  to  write  a  similar  formula  with  the  contraction 
(sum)  over  a  different  pair  of  indices.  Suppose  we  want  eabc£pqb ■  Recall  that  an  e  is 
not  changed  by  cyclic  permutation  of  its  indices  [see  discussion  after  (5.2)].  Thus 
£abc  =  f-bca  and  epqb  —  ebpq  [we  have  cyclically  permuted  the  indices  so  that  the 
summation  index  b  appears  as  the  first  index  for  each  e  as  it  does  in  (5.8)].  Now, 
this  is  the  same  pattern  as  in  (5.8),  with  the  sum  over  the  first  index  of  each  e  [i  in 

(5.8) ,  b  in  the  second  step  in  (5.9)],  so  we  have 

(5.9)  ^abc^pqb  —  tbca^bpq  —  ^cp^aq  ^cq^ap 

It  may  be  helpful  in  writing  this  to  repeat  what  we  said  in  getting  (5.8).  In  (5.9), 
the  product  Cbca^bpq  is  zero  unless  either  c,a  =  p,q,  or  c,  a  =  q,p,  as  indicated  by 
the  right  side  of  (5.9).  For  practice,  do  Problem  7. 
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We  can  further  contract  (5.8)  to  get  (Problem  8). 


(5.10) 


{^ijk^-ijn  — 

—  0. 


Vector  Identities  The  familiar  formulas  in  vector  analysis  can  be  written  in 
tensor  form  using  Sij  and  e^fc.  [See  Am.  J.  Phys.  34,  503-507  (1966).]  We  have 
already  commented  (Section  3)  that  the  dot  product  A  •  B  is  the  contracted  direct 
product,  AiBi .  Now  let’s  show  that  the  components  of  the  cross  product  of  two 
vectors  can  be  written  as 


(5.11) 


(B  x  C)^  —  eijkBjCk- 


To  see  that  this  is  correct  we  look  at  one  component  at  a  time  and  compare  the 
result  with  Chapter  3,  equation  (4.19),  replacing  x,  y ,  z  by  1,  2,  3.  To  find  the 
first  component  of  B  X  C  in  (5.11),  we  let  i  =  1.  Then  the  only  nonzero  terms  on 
the  right  side  of  (5.11)  are  the  two  with  j,  k  =  2,3  or  3, 2,  so  we  find  that  the  first 
component  of  B  X  C  is  (B2C3  —  B3C2),  in  agreement  with  Chapter  3.  Similarly 
the  other  components  agree  with  the  vector  analysis  definition  of  a  cross  product 
(Problem  9a). 


Example  2.  Now  let’s  use  (5.11)  to  write  a  triple  vector  product  in  tensor  form,  and  then 
use  (5.8)  or  (5.9)  to  simplify  it  (Problem  9b). 

(5.12)  [A  x  (B  x  C)]  n  —  ^nip  Ai( B  x  C)p  —  ^nip  A-i  [tpj k Bj  Ck\ 

—  ^-nip  £pj  k  Ai  B j  Ck  —  tpni^-pjk-A-iBjCk 

=  (. SnjSik  -  SnkSiiiAiBjCk  =  BniAiCi)  -  CniAiBi) 

=  components  of  B(A  •  C)  —  C(A  •  B). 

We  recognize  the  final  step  as  the  formula  [Chapter  6,  (3.8)]  for  the  triple  vector 
product;  we  have  just  derived  it  in  tensor  form.  Similarly  we  can  prove  other  vector 
formulas  (see  Problems  10  to  13). 

Recall  from  Chapter  6  that  we  treated  V  as  if  it  were  “almost”  a  vector.  Here 
we  can  similarly  treat  it  as  a  first  rank  tensor,  always  remembering  that  it  is  also  a 
differential  operator.  The  components  of  V  are  d/dxi ,  so  as  in  (5.11)  we  write 


d 

(5.13) 

^  =  eijk~Q^r 

Then  following  the  method  of  (5.12),  we  next  find  the  components  of  curl  curl  V  in 
tensor  form  [compare  part  (e)  in  the  table  at  the  end  of  Chapter  6]. 
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(5.14) 


[V  X  (V  X  V)]„  =  eniP— — (V  X  V)p 


d_ 

'dXi 

d 


~€nipdXi  [epjkdXjVk] 

A  At/ 

-  zPmePjkdxi  gxj  vk 

d  d 

=  i^nj $ik  $nk$ij)  q  ^  q  ~ 

=  J-(±v^-±±vn 

dXn  \dXi  )  dXi  da u 
=  components  of  V(V  •  V)  —  V2V. 


Dual  tensors  Let  T,j  be  an  antisymmetric  2nd-rank  tensor,  that  is,  =  — T)j. 
If  we  display  the  components  Ty  as  elements  of  a  matrix,  it  looks  like  this  (see 
Problem  14). 

/  0  Ti2  -T31  \ 

(5.15)  T  =  -Tia  0  T23 

\  T3i  — X23  0  ) 


Observe  that  there  are  just  3  independent  nonzero  components,  just  enough  to  be 
the  components  of  a  vector.  (Note  that  this  happens  only  in  3  dimensions — see 
Problem  15.)  If  we  define 

(5.16)  Vi  =  ^£ijkTjk 

then  we  find  (Problem  16) 


(5.17) 


Vi  =  X23,  P2  =  T3i,  V;j  =  T12. 


Since  and  Tjk  are  tensors  and  Vi  is  a  contracted  direct  product  of  them,  we  are 
assured  that  V)  is  a  first  rank  tensor,  that  is,  a  vector  (but  see  Section  6).  Thus  the 
three  quantities  in  (5.17)  can  be  considered  as  the  three  independent  components 
of  an  antisymmetric  2nd-rank  tensor  Ty ,  or  as  the  three  components  of  a  vector  Vk 
called  the  dual  of  .  We  can  also  start  with  a  vector  Vk  and  define  Ty  in  terms 
of  it  (Problem  16). 

(5.18)  Tij  =  eijkVk 

Now  suppose  Aj  and  Bk  are  vectors.  Then  Tjk  =  AjBk  —  AkBj  is  a  2nd-rank 
antisymmetric  tensor,  and  the  three  independent  components  of  Tjk  are  just  the 
components  of  A  X  B  (Problem  17).  Thus  we  see  that  the  vector  product  can  be 
considered  as  either  a  vector  or  a  2nd-rank  antisymmetric  tensor. 


PROBLEMS,  SECTION  5 

1.  Verify  that  (5.5)  agrees  with  a  Laplace  development,  say  on  the  first  row  (Chapter 
3,  Section  3).  Hints:  You  will  find  6  terms  corresponding  to  the  6  non-zero  values 
of  tijt  ■  First  let  t  —  1 ;  then  j,  k  can  be  2,  3  or  3,  2.  These  two  terms  give  you  an 
times  its  cofactor.  Next  let  i  =  2  with  j,  k  =  1,3  and  3, 1,  and  show  that  you  get 
ai2  times  its  cofactor.  Finally  let  i  =  3.  Watch  all  the  signs  carefully. 
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2.  Verify  for  a  few  representative  cases  that  (5.6)  gives  the  same  results  as  a  Laplace 
development.  First  note  that  if  a,/3, 7  =  1,2,3,  then  (5.6)  is  just  (5.5).  Then  try 
letting  a,/3, 7  =  an  even  permutation  of  1,2,3,  and  then  try  an  odd  permutation, 
to  see  that  the  signs  work  out  correctly.  Finally  try  a  case  when  ea0-y  =  0  (that  is 
when  two  of  the  indices  are  equal)  to  see  that  the  right  hand  side  of  (5.6)  is  zero 
because  you  are  evaluating  a  determinant  which  has  two  identical  rows. 

3.  Show  that  Sijtkim  is  an  isotropic  tensor  of  rank  5.  Hint:  Combine  equations  (5.4) 
and  (5.7). 

4.  Generalize  Problem  3  to  see  that  the  direct  product  of  any  two  isotropic  tensors  (or 
a  direct  product  contracted)  is  an  isotropic  tensor.  For  example  show  that  tijktimn 
is  an  isotropic  tensor  (what  is  its  rank?)  and  tijktimnSjn  is  an  isotropic  tensor  (what 
is  its  rank?). 

5.  Let  Tjkmn  be  the  tensor  in  (5.8).  This  is  a  4th-rank  tensor  and  so  has  34  =  81 
components.  Most  of  the  components  are  zero.  Find  the  nonzero  components  and 
their  values.  Hint:  See  discussion  after  (5.8). 


6. 

Evaluate: 

(a) 

Sij  SjkSkmSim 

(b) 

djkSjk 

(c) 

Cjk2tk2j 

(d) 

^3jk(-kj3 

(e) 

t23ii-2i3 

(f) 

tk31f-3kl 

7. 

Write  in  terms  of  S’ s  as  in  (5.8)  and  (5.9): 

(a) 

(b) 

tabc^pqc 

8.  Show  that  the  equations  (5.10)  are  correct.  Hints:  You  can  do  these  by  further 
contracting  (5.8).  You  can  also  do  them  by  direct  argument  as  follows:  In  the  first 
equation,  why  must  k  =  nl  If  k  =  n,  then  how  many  choices  are  there  for  i  and  p. 
In  the  second  equation,  in  how  many  ways  can  you  arrange  the  three  numbers  1,  2, 
3,  and  for  each  arrangement,  what  is  the  product  of  the  e’s? 

9.  (a)  Finish  the  work  of  showing  that  the  cross  product  components  are  correctly 

given  by  (5.11).  Hints:  Follow  the  text  discussion  just  after  (5.11).  For  the 
second  component,  let  i  =  2;  etc. 

(b)  Go  through  the  sums  in  (5.12)  carefully  to  verify  each  step.  Hints:  Use  (5.11) 
twice  being  careful  about  repeated  indices,  and  look  at  the  discussion  after 
equation  (5.4). 

(c)  Similarly  check  (5.14). 

10.  (a)  Write  the  triple  scalar  product  A  •  (B  X  C)  in  tensor  form  and  show  that  it  is 

equal  to  the  determinant  in  Chapter  6,  equation  (3.2).  Hint:  See  (5.5). 

(b)  Write  equation  (3.2)  of  Chapter  6  in  tensor  form  to  show  the  equivalence  of 
the  various  expressions  for  the  triple  scalar  product.  Hint:  Change  the  dummy 
indices  as  needed. 

11.  Using  problem  10,  write  A  •  (B  X  A)  in  tensor  notation  and  show  that  it  is  =  0. 
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12.  Write  and  prove  in  tensor  notation: 

(a)  Chapter  6,  Problem  3.13. 

(b)  Chapter  6,  Problem  3.14. 

(c)  Lagrange’s  identity:  (A  X  B)  •  (C  X  D)  =  (A  •  C)(B  •  D)  -  (A  •  D)(B  •  C). 

(d)  (A  x  B)  x  (C  x  D)  =  (ABD)C  -  (ABC)D,  where  the  symbol  (XYZ)  means 
the  triple  scalar  product  of  the  three  vectors. 

13.  Write  in  tensor  notation  and  prove  the  following  vector  operator  identities  in  the 
table  at  the  end  of  Chapter  6:  parts  (b),  (d),  (f),  (g),  (h),  (k). 

14.  Show  that  the  diagonal  elements  of  an  antisymmetric  tensor  are  zero  and  that  (5.15) 
is  a  correct  display  of  the  components  of  an  antisymmetric  2nd-rank  tensor  in  3 
dimensions. 

15.  Write  a  4-by-4  antisymmetric  matrix  to  show  that  there  are  6  different  components, 
not  the  4  components  of  a  vector  in  4  dimensions. 

16.  Verify  that  (5.16)  gives  (5.17).  Also  verify  that  (5.18)  gives  (5.17). 

17.  Write  out  the  components  of  Tjk  =  AjBk  —  AkBj  to  show  that  Tjk  is  a  2nd-rank 
antisymmetric  tensor  with  elements  which  are  the  components  of  A  X  B. 

6.  PSEUDOVECTORS  AND  PSEUDOTENSORS 

So  far  we  have  considered  only  rotations  of  rectangular  coordinate  systems  in  our 
definitions  of  tensors.  Recall  that  an  orthogonal  transformation  includes  both  ro¬ 
tations  and  reflections  (Chapter  3,  Sections  7  and  11).  Now  we  want  to  consider 
how  the  entities  we  have  called  tensors  behave  under  reflections.  Remember  that 
the  determinant  of  an  orthogonal  matrix  is  +1  for  a  rotation  (sometimes  called  a 
“proper”  rotation)  and  the  determinant  is  —1  if  a  reflection  is  involved  (sometimes 
called  an  “improper”  rotation). 

When  detA  =  — 1,  at  least  one  eigenvalue  of  matrix  A  is  —1  (see  Chapter  3, 
Section  11) .  The  —  1  eigenvalue  corresponds  to  the  reversal  of  one  principal  axis,  that 
is,  a  reflection  through  the  plane  perpendicular  to  the  axis  [for  example  a  reflection 
through  the  (x,y)  plane  which  reverses  the  z  axis].  The  other  two  eigenvalues 
correspond  to  a  rotation  [see  Chapter  3,  equation  (7.19)];  this  includes  the  case  of 
a  180°  rotation  which  is  equivalent  to  reversal  of  the  other  two  axes  (see  Problems 
1  and  2).  So  in  thinking  about  reflections,  we  can  think  of  reversing  all  three  axes 
(called  an  inversion)  or  reversing  just  one,  since  a  rotation  doesn’t  affect  the  sign  of 
det  A.  It  is  important  to  realize  that  reversing  either  one  or  all  three  axes  changes 
the  coordinate  system  from  a  right-handed  to  a  left-handed  coordinate  system. 

Example  1.  Let’s  look  at  a  simple  example  of  something  we  usually  think  of  as  a  vector 
(namely  a  cross  product)  which  doesn’t  obey  the  vector  transformation  laws  under 
reflections.  Let  U  and  V  be  displacement  vectors.  Recall  (Section  2)  that,  by 
definition,  a  vector  transforms  the  way  displacement  vectors  do.  Also  remember 
that  we  are  considering  passive  transformations:  vectors  remain  fixed  in  space  while 
the  axes  are  changed  (rotated  or  reflected) .  Now  if  the  z  axis  is  reversed  [reflected 
through  the  ( x ,  y)  plane],  then  the  z  components  of  the  displacement  vectors  U  and 
V  change  signs;  this  is  then  a  requirement  for  all  vectors.  But  the  z  component  of 
U  X  V  (which  is  UxVy  —  UyVx)  does  not  change  sign  (Problems  3  and  4).  Thus 
U  X  V  is  not  a  vector  under  reflections.  We  call  U  X  V  a  pseudovector.  We  will 
discover  other  pseudovectors  as  we  continue. 
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Levi-Civita  symbols  We  want  to  use  (5.6)  when  the  matrix  A  is  the  matrix 
of  an  orthogonal  transformation.  Remember  (Chapter  3,  Section  7)  that  if  A  is 
orthogonal,  det  A  =  ±1  so  (det  A)2  =  1.  Multiply  (5.6)  by  det  A  to  get  the  equation 
ea/ 37  =  (det  A)aOLiapja1k£ijk-  Then  the  transformation  which  gives  e'  =  e  (see 
isotropic  tensors  in  Section  5)  is 

(6.1)  ^a/?7  —  (det  A ')&aiQ,0jQl~/k€ijk  —  Capry. 

Now  this  is  not  the  right  transformation  equation  for  a  3-rank  tensor — the  factor 
det  A  would  not  be  there  for,  say,  the  direct  product  of  three  displacement  vectors. 
Of  course,  we  got  away  with  calling  eijk  a  3rd-rank  tensor  in  Section  5  because  we 
were  discussing  just  rotations  and  det  A  =  1  if  A  is  a  rotation  matrix.  But  now 
we  are  dealing  with  general  orthogonal  transformations,  and  when  det  A  =  —  1  (re¬ 
flection)  there  is  an  extra  factor  —1  in  the  transformation  equation.  We  call  e^k 
a  3rd-rank  pseudotensor.  A  pseudovector  or  pseudotensor  obeys  the  tensor  trans¬ 
formation  equations  under  rotations  (that  is,  det  A  =  1),  but  if  the  transformation 
includes  a  reflection  (that  is,  det  A  =  —1),  then  the  transformation  equation  con¬ 
tains  an  extra  factor  of  —1.  If  we  have  a  direct  product  of  two  pseudotensors  (or 
such  a  product  contracted),  this  will  be  a  tensor  because  the  product  of  the  two 
det  A  factors  is  (det  A)2  =  1.  (Problem  5). 


Polar  and  Axial  Vectors  If  a  vector  (under  rotations)  also  satisfies  the  vector 
transformation  equations  (that  is,  behaves  like  a  displacement  vector)  under  re¬ 
flections,  it  is  called  a  polar  vector  (or  true  vector  or  just  a  vector).  If  there  is  a 
change  in  sign  when  det  A  =  —1,  it  is  called  an  axial  vector  (or  pseudovector).  In 
Example  1,  U  and  V  were  polar  vectors  and  U  X  V  was  an  axial  vector. 

In  order  to  understand  pseudotensors  we  need  to  discuss  left-handed  coordinate 
systems.  These  are  relatively  unfamiliar  in  elementary  work  and  for  good  reason. 
When  we  define  a  cross  product  or  specify  a  vector  to  represent  a  rotation,  the  right 
hand  rule  is  a  part  of  our  definition.  It  would  be  confusing  to  deal  with  this  in  a 
left-handed  system  so  you  are  always  warned  to  use  right-handed  systems.  But  we 
are  now  considering  the  general  case  of  orthogonal  transformations  which  includes 
reflections  and  so  produces  left-handed  reference  systems  which  we  must  learn  to 
cope  with. 

Let’s  consider  the  physics  and  geometry  of  this  by  comparing  linear  velocity  and 
angular  velocity,  both  vectors  under  rotations.  Is  there  a  difference  when  we  consider 
reflections  and  so  have  a  left-handed  coordinate  system?  The  linear  velocity  vector 
indicates  a  path  along  which  something  moves;  it  has  a  direct  physical  meaning,  and 
under  passive  transformations,  it  stays  fixed  in  space.  In  the  case  of  angular  velocity, 
the  physical  motion  is  taking  place  in  the  plane  perpendicular  to  the  angular  velocity 
vector,  say  a  wheel  rotating,  or  a  mass  or  charge  moving  in  a  circle.  The  angular 
velocity  “vector”  is  something  we  choose  via  the  right  hand  rule  to  represent  the 
motion.  We  might  guess  (correctly)  that  linear  velocity  is  a  vector  (polar  vector)  and 
angular  velocity  is  a  pseudovector  (axial  vector).  Remember  that  in  Example  1  we 
found  that  the  cross  product  (defined  using  the  right  hand  rule)  is  a  pseudovector. 
As  we  continue,  watch  for  this;  when  the  right  hand  rule  is  used  in  the  definition  of 
a  vector,  you  suspect  that  it  is  a  pseudovector. 
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Cross  Product  In  Example  1,  we  found  that  the  cross  product  of  two  displace¬ 
ment  vectors  does  not  satisfy  the  vector  transformation  equations  under  reflections. 
Now  we  want  to  write  a  formula  to  show  exactly  how  a  cross  product  transforms 
under  a  general  orthogonal  transformation.  By  (5.11),  we  write 


(6.2) 


(U  X  V)^  —  ^ijkUjVk- 


Then  using  (6.1),  (6.2),  and  the  vector  transformation  equations  for  the  displace¬ 
ment  vectors  U  and  V,  we  find 

(6.3)  (U'  X  V%  =  e'aMv; 

—  (det  A)aaiapja~tkeijkaf3mUma.ypVp 
=  (det:  A  jdaifijrn  ^kp^ijk  Cm  l), 

=  (det  A )aai(eijkUjVk)  =  (detA)aai(U  x  Vfy 


If  det  A  =  1  (no  reflection,  just  a  rotation),  then  (6.3)  is  the  transformation  equation 
for  a  vector.  If  det  A  =  —1  (reflection)  then  the  transformation  has  an  extra  — 1 
factor.  Thus  the  vector  product  of  two  polar  vectors  is  a  pseudovector,  as  we  have 
seen  before  and  as  we  guessed  from  the  fact  that  the  right  hand  rule  is  used  in 
defining  cross  product. 

Example  2.  Find  the  triple  scalar  product  of  3  polar  vectors. 

Here  we  have  one  det  A  factor  (from  the  cross  product),  so  the  triple  scalar 
product  of  3  polar  vectors  is  a  pseudoscalar  (Problem  7). 

Example  3.  What  is  the  tensor  character  of  W  X  S  if  W  is  a  polar  vector  and  S  is  a 
pseudovector? 

In  the  transformation  equation  for  W  X  S,  there  is  one  factor  of  det  A  for  S, 
and  another  det  A  for  the  cross  product  as  in  (6.3).  The  two  minus  signs  cancel,  so 
W  X  S  is  a  polar  vector  (Problem  8). 

Example  4.  Show  that  acceleration  a  and  force  F  are  polar  vectors. 

By  definition,  the  displacement  r  is  a  polar  vector  (we  define  vectors  as  quantities 
which  transform  the  way  displacements  do).  Then  the  velocity  v  =  dv/dt  and  the 
acceleration  a  =  d2r/dt2  are  vectors  (since  time  t  is  a  scalar)  and  F  =  ma  is  a 
vector  since  m  is  a  scalar. 

Example  5.  Find  the  tensor  character  of  each  symbol  in  v  =  w  X  r. 

By  Example  4,  v  is  a  vector  sou  X  r  must  be  a  vector  (both  sides  of  a  tensor 
equation  must  have  the  same  tensor  character).  Then  uj  must  be  a  pseudovector 
so  that  there  are  two  det  A  factors,  one  from  the  cross  product  and  one  from  uj. 
Recall  that  we  predicted  this  because  the  right  hand  rule  is  used  in  defining  angular 
velocity. 
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PROBLEMS,  SECTION  6 

1.  Show  that  in  2  dimensions  (say  the  x ,  y  plane),  an  inversion  through  the  origin  (that 
is,  x'  =  — x,  y'  =  —y)  is  equivalent  to  a  180°  rotation  of  the  ( x,y )  plane  about  the 
z  axis.  Hint:  Compare  Chapter  3,  equation  (7.13)  with  the  negative  unit  matrix. 

2.  In  Chapter  3,  we  said  that  any  3-by-3  orthogonal  matrix  with  determinant  =  —  1  can 
be  written  in  the  form  (7.19).  Use  this  and  Problem  1  to  show  that  in  3  dimensions, 
an  inversion  (that  is  a  reflection  through  the  origin  so  that  all  three  axes  are  reversed) 
is  equivalent  to  a  reflection  through  a  plane  combined  with  a  rotation  about  the  line 
perpendicular  to  the  plane  [say  a  reflection  through  the  ( x ,  y)  plane — that  is,  a 
reversal  of  the  2  axis — and  a  rotation  of  the  ( x,y )  plane  about  the  z  axis].  Hint: 
Consider  the  matrix  B  in  Chapter  3,  (7.19). 

3.  For  Example  1,  write  out  the  components  of  U,  V,  and  U  X  V  in  the  original  right- 
handed  coordinate  system  S  and  in  the  left-handed  coordinate  system  S'  with  the 
z  axis  reflected.  Show  that  each  component  of  U  X  V  in  S'  has  the  “wrong”  sign 
to  obey  the  vector  transformation  laws. 

4.  Do  Example  1  and  Problem  3  if  the  transformation  to  a  left-handed  system  is  an 
inversion  (see  Problem  2). 

5.  Write  the  tensor  transformation  equations  for  tijkf-mnp  to  show  that  this  is  a  (rank 
6)  tensor  ( not  a  pseudotensor).  Hint:  Write  (6.1)  for  each  e  and  multiply  them, 
being  careful  not  to  re-use  a  pair  of  summation  indices. 

6.  Write  the  transformation  equations  to  show  that  V  X  V  is  a  pseudovector  if  V  is  a 
vector.  Hint:  See  equations  (5.13),  (6.2)  and  (6.3). 

7.  Write  the  transformation  equations  for  the  triple  scalar  product  W-  (U  X  V)  remem¬ 
bering  that  now  det  A  =  —  1  if  the  transformation  involves  a  reflection.  Thus  show 
that  the  triple  scalar  product  of  three  polar  vectors  is  a  pseudoscalar  as  claimed  in 
Example  2.  Hint:  Use  the  result  in  (6.3). 

8.  Write  the  transformation  equations  for  W  X  S  to  verify  the  results  of  Example  3. 

In  the  physics  formulas  of  Problems  9  to  14,  identify  each  symbol  as  a  vector  (polar  vector) 
or  a  pseudovector  (axial  vector).  Use  results  from  the  text  and  the  fact  that  both  sides  of 
an  equation  must  have  the  same  tensor  character.  The  definition  of  the  symbols  used  is: 
r  =  displacement,  t  —  time,  m  =  mass,  q  =  electric  charge,  v  =  velocity,  F  =  force,  u  = 
angular  velocity,  r  =  torque,  L  =  angular  momentum,  T  =  kinetic  energy,  E  =  electric 
field,  B  =  magnetic  field.  Assume  that  t,  m,  and  q  are  scalars.  Note  that  we  are  working 
in  3  dimensional  physical  space  and  assuming  classical  (that  is  nonrelativistic)  physics. 


9. 

F 

E  =  — 

Q 

10. 

L  =  mr  X  v  =  mr  X  (u  X  r) 

11. 

t  =  r  X  F 

12. 

F  =  q(E  +  v  X  B) 

13. 

<9B 

w--VxE 

14. 

T  =  im(u  X  r)  •  (w  X  r) 

15. 

In  equation  (5.12),  find  whether  A  X  (B  X  C)  is  a  vector  or  a  pseudovector  assuming 

(a)  A,  B,  C  are  all  vectors; 

(b)  A,  B,  C  are  all  pseudovectors; 

(c)  A  is  a  vector  and  B  and  C  are  pseudovectors. 

Hint:  Count  up  the  number  of  det  A  factors  from  pseudovectors  and  cross  products. 

16.  In  equation  (5.14),  is  V  X  (V  X  V)  a  vector  or  a  pseudovector? 

17.  In  equation  (5.16),  show  that  if  Tjk  is  a  tensor  (that  is,  not  a  pseudotensor),  then 
Vi  is  a  pseudovector  (axial  vector).  Also  show  that  if  Tjk  is  a  pseudotensor,  then  Vi 
is  a  vector  (true  or  polar  vector).  You  know  that  if  Vi  is  a  cross  product  of  polar 
vectors,  then  it  is  a  pseudovector.  Is  its  dual  Tjk  a  tensor  or  a  pseudotensor? 
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7.  M0REAB0UT APPLICATIONS 


Stress  Tensor  We  started  our  discussion  of  tensors  with  a  description  of  the 
stress  tensor  (you  may  want  to  review  this  in  Section  1).  Now  let’s  show  that  the 
nine  quantities  Pij  displayed  in  the  matrix  (1.1)  really  are  the  components  of  a 
2nd-rank  tensor.  For  simplicity  in  notation  (and  to  use  summation  convention),  we 
make  the  replacements  indicated  in  (2.10);  we  also  replace  i,  j,  k  by  ei,  e2,  e3  and 
i',  j',  k'  by  e'l5  e'2,  e'3.  Our  problem  is  to  write  the  components  P'ag  relative  to  a 
rotated  coordinate  system  in  terms  of  the  components 
Pij  to  show  that  P'aa  =  aaia/3jPij  as  in  (2.14)  or  (3.9).  J 


Figure  7.1  shows  the  unprimed  axes  and  one  of  the 
rotated  axes.  (With  a  =  1,2,  3,  the  x'a  axis  represents 
any  one  of  the  rotated  axes.)  We  draw  a  slanted  plane, 
as  shown,  perpendicular  to  the  x'a  axis,  and  consider 
the  forces  on  the  small  volume  element  dV  bounded  by 
the  unprimed  coordinate  planes  and  the  slanted  plane. 
Recall  (Section  1)  that  pressure  is  force  per  unit  area,  so 
the  force  acting  across  a  face  is  the  pressure  times  the 
area  of  the  face.  Let  the  area  of  the  slanted  face  (call  it 
face  a)  be  dS.  Then  the  area  of  the  face  perpendicular 
to  the  Xi  axis  (call  it  face  i)  is  aai  dS  where  aa;  [see 
(2.10)]  is  the  cosine  of  the  angle  between  the  x'a  and  Xi 


Figure  7.1 

axes  (Problem  1). 


(7.1) 


Area  of  face  i  is  equal  to  aai  dS. 


The  pressure  across  face  i  is  Pij&j  (note  the  sum  on  j  and  see  Problem  2).  Mul¬ 
tiplying  this  by  (7.1)  (force  =  pressure  times  area  of  face)  and  summing  on  i,  we 
find  that  the  total  force  acting  on  the  material  in  the  volume  element  dF,  across 
the  three  faces  in  the  unprimed  coordinate  planes  is 


(7.2)  (Pijej)aaidS. 

For  equilibrium,  the  sum  of  these  three  forces  must  be  equal  to  the  force  acting 
across  face  a  on  the  neighboring  material.  This  force  is 

(7.3)  Kpe'pdS 

Setting  (7.2)  and  (7.3)  equal,  taking  the  dot  product  of  both  sides  with  e^,  and 
canceling  dS ,  we  have  (Problem  3) 

(7.4)  Pap  =  aaiapjPij 

Thus  we  see  that  the  stress  Pij  is,  as  claimed,  a  2nd-rank  tensor. 


Example  1.  Suppose  the  following  matrix  is  a  display  of  the  elements  of  a  stress  tensor. 

I  1  3 

P  =  3  -2 

\0  -1 

We  note  that  P  is  symmetric  (this  is  true  of  stress  tensors)  so  we  can  diagonalize  P 
by  an  orthogonal  transformation.  In  Chapter  3,  Section  12,  Example  2,  we  found 
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that  the  eigenvalues  of  this  matrix  are  1.  —4,  3.  Thus  a  rotation  of  axes  (matrix  C 
in  the  Chapter  3  example)  produces  a  stress  tensor  P'  with  stress  components  only 
along  the  principal  axes.  The  positive  eigenvalues  are  tensions  and  the  negative  are 
compressions.  Relative  to  the  principal  axes  there  are  no  shear  forces. 


Strain  and  Stress;  Hooke’s  Law  The  strain  tensor  specifies  the  deformation 
of  a  solid  body  under  stress.  For  a  simple  case  such  as  a  wire  supporting  a  weight, 
strain  (change  in  length  per  unit  length)  and  stress  (force  per  unit  cross  sectional 
area)  are  proportional  (Hooke’s  Law).  But  for  a  3  dimensional  problem,  stress  is 
a  2nd-rank  tensor  Pjj  (as  we  have  seen  above),  and  strain  is  also  a  2nd-rank  tensor 
Sij .  If  the  components  of  P  are  linear  combinations  of  the  components  of  S ,  then 
we  can  write 


(7.5) 


Pij  —  Cijkm^km 


By  the  quotient  rule,  Cijkm  is  a  4th-rank  tensor  (Problem  5).  The  components 
of  Cijkm  depend  on  the  kind  of  material  under  stress  and  are  called  the  elastic 
constants  of  the  material  (see  Problem  6). 


Inertia  Tensor  Revisited  In  Section  4  we  considered  the  inertia  tensor  using 
vector  notation.  Now  let’s  look  at  it  using  the  tensor  form  for  vector  identities  that 
we  discussed  in  Section  5. 


Example 2.  In  (4.2)  we  had  L  =  mr  X  (<*>  X  r).  Using  (5.12)  with  A  =  C  =  r  and  B  =  a>, 
we  find 

(7.6)  Ln  =  m[ r  x  (w  x  r)]„  =  m(SnjSik  -  5nk5ij)xiLOjXk- 

Now  sum  over  i  and  k  to  get  5njSikXiXk  =  SnjXkXk  =  Snjr 2  and  SnkSijXiXk  =  XjXn. 
Thus  we  have  [compare  (4.2)] 

(7.7)  Ln  =  m(5njr2  -  xnXj)ujj. 

The  coefficient  of  ujj  is  then  the  component  Inj  of  the  inertia  tensor. 


(7.8) 


Inj  —  7/7. ( Snj  T  X'fy  Xj  )  . 


We  can  easily  verify  that  these  components  are  the  same  as  we  found  in  Section 
4.  For  example  [compare  (4.4)]: 

(7.9)  In  =  m{r2  -  x\ ),  Ii2  =  —mx ix2,  h 3  =  -mxix3, 

and  similarly  for  the  other  components  (Problem  7). 
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Other  Applications  In  your  study  of  electric  fields  in  matter,  you  will  find  the 
equation  P  =  yE;  this  relates  the  electric  field  E  applied  to  a  dielectric  and  the 
resulting  polarization  P  of  the  dielectric.  For  some  materials  it  may  be  true  that  P 
and  E  are  parallel  vectors  with  x  =  scalar,  but  for  other  materials  P  and  E  are  not 
parallel.  Now  this  should  remind  you  of  our  work  in  Section  4  with  the  equation 
L  =  Iu>  when  we  realized  that  L  and  uj  are  not  always  parallel.  Just  as  we  replaced 
the  scalar  I  by  a  2nd-rank  tensor,  so  we  replace  x  by  a  2nd-rank  tensor.  In  the 
equation  Pj  =  XijEj,  the  quotient  rule  (see  Section  3)  tells  us  that  Xij  is  a  2nd-rank 
tensor.  You  will  find  other  equations  of  this  sort  in  various  applications. 

Tensor  Fields  Recall  from  Chapter  6  that  a  scalar  field  (temperature,  for  exam¬ 
ple)  means  a  single  number  at  each  point,  that  is,  a  single  function  f(x,y,z).  A 
vector  field  (such  as  the  electric  field)  means  a  set  of  three  numbers  at  each  point, 
that  is,  a  set  of  three  functions  Vj{x,  y7  z).  Similarly,  a  2nd-rank  tensor  field  means  a 
set  of  9  numbers  at  each  point,  that  is,  a  set  of  9  functions  Tij(x,  y ,  z).  Think  of  our 
discussion  of  stress  and  strain.  At  every  point  in  the  material  under  stress,  we  can 
think  of  three  vectors  giving  the  force  per  unit  area  across  the  three  perpendicular 
planes  through  the  point,  that  is,  a  set  of  9  functions.  The  4th-rank  tensor  Cijkm  in 
(7.5)  is  then  a  set  of  34  =  81  functions,  and  so  on.  (Of  course,  in  order  to  be  tensors 
these  sets  must  transform  properly  under  rotations  as  discussed  in  this  chapter.) 

PROBLEMS,  SECTION  7 

1.  Verify  (7.1).  Hints:  In  Figure  7.1,  consider  the  projection  of  the  slanted  face  of 
area  dS  onto  the  three  unprimed  coordinate  planes.  In  each  case,  show  that  the 
projection  angle  is  equal  to  an  angle  between  the  x'a  axis  and  one  of  the  unprimed 
axes.  Find  the  cosine  of  the  angle  from  the  matrix  A  in  (2.10). 

2.  Write  out  the  sums  Pij e,  for  each  value  of  i  and  compare  the  discussion  of  (1.1). 
Hint:  For  example,  if  i  =  2  [or  y  in  (1.1)],  then  the  pressure  across  the  face  per¬ 
pendicular  to  the  X2  axis  is  P2iei  +  P22&22  +  ^2363,  or,  in  the  notation  of  (1.1), 
Pyx  i  +  Pyyi  +  Pyz  k. 

3.  Carry  through  the  details  of  getting  (7.4)  from  (7.2)  and  (7.3).  Hint:  You  need  the 
dot  product  of  e'g  and  ej .  This  is  the  cosine  of  an  angle  between  two  axes  since  each 
e  is  a  unit  vector.  Identify  the  result  from  matrix  A  in  (2.10). 

4.  Interpret  the  elements  of  the  matrices  in  Chapter  3,  Problems  11.18  to  11.21,  as 
components  of  stress  tensors.  In  each  case  diagonalize  the  matrix  and  so  find  the 
principal  axes  of  the  stress  (along  which  the  stress  is  pure  tension  or  compression). 
Describe  the  stress  relative  to  these  axes.  (See  Example  1.) 

5.  Show  by  the  quotient  rule  (Section  3)  that  Cijkm  in  (7.5)  is  a  4  -rank  tensor. 

6.  If  P  and  S  are  2nd-rank  tensors,  show  that  92  =  81  coefficients  are  needed  to  write 
each  component  of  P  as  a  linear  combination  of  the  components  of  S.  Show  that 
81  =  34  is  the  number  of  components  in  a  4th-rank  tensor.  If  the  components  of 
the  4th-rank  tensor  are  Cijkm ,  then  equation  (7.5)  gives  the  components  of  P  in 
terms  of  the  components  of  S.  If  P  and  S  are  both  symmetric,  show  that  we  need 
only  36  different  non-zero  components  in  Cijkm-  Hint:  Consider  the  number  of 
different  components  in  P  and  S  when  they  are  symmetric.  Comment:  The  stress 
and  strain  tensors  can  both  be  shown  to  be  symmetric.  Further  symmetry  reduces 
the  36  components  of  C  in  (7.5)  to  21  or  less. 

7.  In  (7.9)  we  have  written  the  first  row  of  elements  in  the  inertia  matrix.  Write  the 
formulas  for  the  other  6  elements  and  compare  with  Section  4. 

8.  Do  Problem  4.8  in  tensor  notation  and  compare  the  result  with  your  solution  of  4.8. 
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8.  CURVILINEAR  COORDINATES 


Before  we  discuss  non-Cartesian  tensors  we  need  to  talk  about  some  properties  of 
curvilinear  coordinate  systems  such  as  spherical  or  cylindrical  coordinates.  To  make 
the  discussion  concrete,  we  shall  illustrate  the  ideas  involved  by  using  two  familiar 
coordinate  systems — rectangular  coordinates  ( x ,  y,  z)  and  cylindrical  coordinates 
(r,  9,  z).  The  elements  of  arc  length  in  these  two  systems  are  given  by 

ds 2  =  dx2  +  dy 2  +  dz2  (rectangular  coordinates) 

ds2  =  dr 2  +  r 2  dO2  +  dz2  (cylindrical  coordinates) 

These  expressions  for  ds  are  called  line  elements ;  they  have  much  greater  signifi¬ 
cance  than  just  their  use  in  computing  arc  lengths.  First  consider  how  we  can  find 
ds 2  for  a  given  coordinate  system.  In  the  case  of  a  well-known  coordinate  system, 
the  answer  may  be  obvious  from  the  geometry.  For  example  in  polar  coordinates  in 
the  plane  we  have  (from  Figure  8.1  and  the  Pythagorean  theorem) 


(8.2)  ds2  =  dr2  +  r2  d92 . 

For  an  unfamiliar  or  complicated  change  of  variables, 
however,  we  need  a  systematic  method  of  finding  ds ; 
we  illustrate  the  method  by  finding  the  value  of  ds 2 
for  cylindrical  coordinates  as  given  in  (8.1).  From  the 
equations 

x  =  rcos0, 

(8.3)  y  =  rsinO, 
z  =  z, 


we  get 


dx  =  cos  9  dr  —  r  sin  9  d9 , 
(8.4)  dy  =  sin  0  dr  +  r  cos  9  d91 

dz  =  dz. 


Squaring  each  equation  in  (8.4)  and  adding  the  results,  we  find 
(8.5)  ds2  =  dx2  +  dy2  +  dz 2  =  dr2  +  r2  d92  +  dz2. 

Notice  particularly  here  that  all  the  cross  products  ( dr  d9 ,  etc.)  canceled  out. 
This  will  not  always  happen,  but  it  often  does;  when  it  does  we  call  the  coor¬ 
dinate  system  orthogonal.  Such  coordinate  systems 
have  some  particularly  simple  and  useful  properties. 

Geometrically,  an  orthogonal  system  means  that  the 
coordinate  surfaces  are  mutually  perpendicular.  For 
the  cylindrical  system  (Figure  8.2),  the  coordinate  sur¬ 
faces  are  r  =  const,  (set  of  concentric  cylinders),  9  = 
const,  (set  of  half-planes) ,  and  z  =  const,  (set  of  planes). 

The  three  coordinate  surfaces  through  a  given  point 
intersect  at  right  angles.  The  three  curves  of  intersec¬ 
tion  of  the  coordinate  surfaces  in  pairs  intersect  at  right 


Figure  8.2 
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angles;  these  curves  are  called  the  coordinate  “lines”  or  directions.  We  draw  unit 
basis  vectors  tangent  to  the  coordinate  directions;  for  the  cylindrical  system  (Figure 
8.2)  we  might  call  them  er,  eg,  e-  (e2  is  identical  to  k).  These  unit  vectors  form 
an  orthogonal  triad  like  i,  j,  k.  We  refer  to  such  coordinate  systems  as  curvilinear 
coordinate  systems  when  the  coordinate  surfaces  (or  some  of  them)  are  not  planes 
and  the  coordinate  lines  (or  some  of  them)  are  curves  rather  than  straight  lines. 
We  shall  be  principally  interested  in  orthogonal  curvilinear  coordinate  systems. 

Scale  Factors  and  Basis  Vectors  In  the  rectangular  system,  if  x,  y ,  z ,  are  the 
coordinates  of  a  particle,  and  x  changes  by  dx  with  y  and  z  constant,  then  the 
distance  the  particle  moves  is  ds  =  dx.  However,  in  the  cylindrical  system,  if  6 
changes  by  dd  with  r  and  z  constant,  the  distance  the  particle  moves  is  not  dd,  but 
ds  =  r  dd.  Factors  like  the  r  in  r  dd  which  must  multiply  the  differentials  of  the 
coordinates  to  get  distances  are  known  as  scale  factors  and  are  very  important  as 
we  shall  see.  A  straightforward  way  to  get  them  is  to  calculate  ds2  as  we  did  in 

(8.5) ;  if  the  transformation  is  orthogonal,  then  the  scale  factors  can  be  read  off  from 
ds2.  (Note  that  the  coefficients  in  ds2  are  the  squares  of  the  scale  factors.)  From 

(8.5) ,  we  see  that  the  scale  factors  for  cylindrical  coordinates  are  1,  r,  1. 

It  is  also  useful  to  consider  a  vector  ds  which  (in  cylindrical  coordinates)  has 
components  dr,  rdd,  dz  in  the  coordinate  directions  er,  eg,  ez: 

(8.6)  ds  =  er  dr  +  eg  r  dd  +  ez  dz. 

Then  ds2  =  ds  •  ds  which  gives  (8.1),  since  the  e  vectors  are  orthonormal. 

We  can  write  the  unit  basis  vectors  of  a  curvilinear  coordinate  system  (er,  eg, 
ez  in  cylindrical  coordinates)  in  terms  of  i,  j,  k.  This  is  useful  when  we  want  to 
differentiate  a  vector  which  is  expressed  in  terms  of  the  curvilinear  coordinate  basis 
vectors.  The  unit  vectors  i,  j,  k  are  constant  in  magnitude  and  direction,  but  er 
and  eg  are  not  fixed  in  direction,  so  their  derivatives  are  not  zero.  We  illustrate 
an  algebraic  method  of  finding  the  relation  between  two  sets  of  basis  vectors  by 
finding  them  for  the  cylindrical  system.  (Compare  the  geometrical  method  shown 
in  Chapter  6,  Section  4.) 

Example  1.  Using  (8.4)  and  collecting  coefficients  of  dr,  dd,  and  dz,  we  find 
ds  =  i  dx  +  j  dy  +  k  dz 

(8.7)  =  i(cos  d  dr  —  r  sin  9  dd)  +  j  (sin  0  dr  +  r  cos  9  dd)  +  k  dz 
=  (icos  d  +  j  sin  d)  dr  +  (— ir  sin  9  +  j  r  cos  9)  dd  +  k  dz. 

Comparing  (8.7)  with  (8.6),  we  have 

er  =  i  cos  9  +  j  sin  d 

(8.8)  reg  =  —  irsind  +  j  rcosd 

ez  =  k. 

Notice  that  er  is  already  a  unit  vector  since  sin2  d  +  cos2  9  =  1,  but  reg  must  be 
divided  by  the  scale  factor  r  to  get  the  unit  vector  eg.  It  is  often  convenient  to 
use  basis  vectors  which  we  shall  call  ar  and  a g  (which  are  not  necessarily  of  unit 
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length),  given  by  the  coefficients  of  dr  and  d6  in  (8.7).  Then  we  just  have  to  divide 
each  a  vector  by  its  magnitude  to  get  the  corresponding  e  vector.  Thus  from  (8.7) 


ar  =  er  is  already  a  unit  vector, 
g)  &g  =  —  ir  sin  9  +  j  r  cos  9  has  magnitude  r,  so 

eg  =  -a g  =  —  i  sin  9  +  j  cos  9 
r 

We  can  use  these  formulas  to  find  the  velocity  and  acceleration  of  a  particle 
in  cylindrical  coordinates,  and  similar  formulas  for  any  coordinate  system.  The 
displacement  of  a  particle  from  the  origin  at  time  t  is,  in  cylindrical  coordinates 
(Figure  8.3), 

s  =  rer  +  zez. 


Then 

By  (8.8), 
d 


ds  dr  d  dz 

It  =  dter  +  rJt{er)+MGz- 


.  .  d9  d9  d9 

s(er)  =  -,sm*-+Jcos((-=e,-, 


SO 


(8.10) 


ds  .  ■ 

—  =  rer  +  r9eg  +  zez. 
dt 


By  differentiating  again  with  respect  to  t  and  using  (8.8)  to  find  ( d/dt)(eg ),  we  can 
find  the  acceleration  d2s/dt2  in  cylindrical  coordinates  (Problem  2). 


General  Curvilinear  Coordinates  In  general,  let  xi,  X2,  23  be  the  set  of  vari¬ 
ables  or  coordinates  we  are  considering  (for  example,  in  cylindrical  coordinates, 
x\  =  r,  x-2  =  9 ,  23  =  z).  Then  the  three  sets  of  coordinate  surfaces  are  x\  = 
const.,  X2  =  const.,  X3  =  const.  The  three  coordinate  surfaces  through  a  given 
point  intersect  in  three  coordinate  lines. 


Example  2.  Given  x,  y ,  z  as  functions  of  x±,  22,  ^3:  we  can  find  ds  and  the  a  vectors  as 
we  did  for  cylindrical  coordinates  in  (8.7)  and  (8.9). 


(8.11) 


ds  =  i  dx  +  j  dy  +  k  dz 

.  dx  .  dy  ,  dz 

=  17 -  dxn  +  J- -  dxn  +  k- -  dxn 

UX-n  OXyi  C/Xn 

=  ai  dx  1  +  a2  dx 2  +  a3  dx 3  =  a„  dxn, 


where 


(8.12) 


3-n  — 


dx. 


.  dx  .  dy 

■s  =  1- - bj- 


dx„ 


dXr, 


dz 

Ldxn 


Now  defining  g,j  =  a,  •  a  j,  we  can  write  ds 2  =  ds  •  ds  in  matrix  form  as  follows: 


311 

312 

313  \ 

/  dx  1 

<721 

322 

323 

dx2 

<731 

332 

333  / 

\  dx3 

(8.13) 


ds2  =  (  dx  1  dx 2  dx 3  ) 
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Note  that  g-j-j  is  symmetric  since  the  dot  product  of  two  vectors  is  the  same  in  either 
order.  In  simpler  form  using  summation  convention  (8.13)  becomes 

(8.14)  ds2  =  gij  dxi  dxj. 

We  will  see  later  (Section  10)  that  the  (jij  are  the  components  of  a  tensor  known  as 
the  metric  tensor. 

If  the  coordinate  system  is  orthogonal,  that  is,  if  the  basis  vectors  (e  or  a)  form 
an  orthogonal  triad,  then  ds  and  ds2  can  be  written  in  terms  of  the  scale  factors  as 
follows: 

(8.15)  ds  =  e\h\  dx i  +  62/12  dx 2  +  63/13  dx 3, 

/  h\  0  0  \  /  dx  1  \ 

(8.16)  ds2  =  (  dxi  dx 2  dx 3  )  (  0  h\  0  dx 2  J  . 

\  0  0  h\  )  \  dx3  ) 

Also  note  that  the  volume  element  in  an  orthogonal  system  is  /11A2/13  dx  1  dx 2  dx^ 
(volume  of  a  small  rectangular  parallelepiped  with  edges  h\  dx  1,  /12  dx 2,  /13  dx 3).  For 
example,  in  cylindrical  coordinates,  the  volume  element  is  dr  ■  r  dO  ■  dz  =  r  dr  d6  dz. 


PROBLEMS,  SECTION  8 

1.  Find  ds2  in  spherical  coordinates  by  the  method  used  to  obtain  (8.5)  for  cylindrical 
coordinates.  Use  your  result  to  find  for  spherical  coordinates,  the  scale  factors,  the 
vector  ds,  the  volume  element,  the  basis  vectors  ar,  ag,  and  the  corresponding 
unit  basis  vectors  er,  eg,  e^.  Write  the  g,j  matrix. 

2.  Observe  that  a  simpler  way  to  find  the  velocity  ds/dt  in  (8.10)  is  to  divide  the 
vector  ds  in  (8.6)  by  dt.  Complete  the  problem  to  find  the  acceleration  in  cylindrical 
coordinates. 

3.  Use  the  results  of  Problem  1  to  find  the  velocity  and  acceleration  components  in 
spherical  coordinates.  Find  the  velocity  in  two  ways:  starting  with  ds  and  starting 
with  s  =  rer. 

4.  In  the  text  and  problems  so  far,  we  have  found  the  e  vectors  for  various  coordinate 
systems  in  terms  of  i  and  j  (or  i,  j,  k  in  three  dimensions).  We  can  solve  these 
equations  to  find  i  and  j  in  terms  of  the  e  vectors,  and  so  express  a  vector  given  in 
rectangular  form  in  terms  of  the  basis  vectors  of  another  coordinate  system.  Carry 
out  this  process  to  express  in  cylindrical  coordinates  the  vector  V  =  y\  —  xj  +  k. 
Hint:  Use  matrices  (as  in  Chapter  3)  to  solve  the  set  of  equations  for  i  and  j. 

5.  Using  the  results  of  Problem  1,  express  the  vector  in  Problem  4  in  spherical  coordi¬ 
nates. 

As  in  Problem  1,  find  ds2,  the  scale  factors,  the  vector  ds,  the  volume  (or  area)  element, 
the  a  vectors,  and  the  e  vectors  for  each  of  the  following  coordinate  systems. 


6.  Parabolic  cylinder  coordinates  u,  v,  z\ 


y  =  uv, 
Z  —  z. 


7.  Elliptic  cylinder  coordinates  u,  v,  z\ 

x  =  a  cosh  u  cos  v, 
y  =  a  sinh  u  sin  v, 
z  —  z. 
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8.  Parabolic  coordinates  u,  v,  (f>:  9.  Bipolar  coordinates  u,  v: 


10. 


X  =  UV  COS  4>, 

y  =  uv  sin  4>, 


a  sinh  u 
cosh  u  +  cos  v  ’ 
a  sin  v 

cosh  u  +  cos  v 


Sketch  or  computer  plot  the  coordinate  surfaces  in  Problems  6  to  9. 


Using  the  expression  you  have  found  for  ds,  and  for  the  e  vectors,  find  the  velocity  and 
acceleration  components  in  the  coordinate  systems  indicated. 


11.  Parabolic  cylinder  12.  Elliptic  cylinder 

13.  Parabolic  14.  Bipolar 

15.  Let  x  =  u  +  v,  y  =  v.  Find  ds,  the  a  vectors,  and  ds2  for  the  it,  v  coordinate  system 
and  show  that  it  is  not  an  orthogonal  system.  Hint:  Show  that  the  a  vectors  are  not 
orthogonal,  and  that  ds2  contains  dudv  terms.  Write  the  gtj  matrix  and  observe 
that  it  is  symmetric  but  not  diagonal.  Sketch  the  lines  u  =  const,  and  v  =  const, 
and  observe  that  they  are  not  perpendicular  to  each  other. 


9.  VECTOR  OPERATORS  IN  ORTHOGONAL  CURVILINEAR  COORDINATES 

We  have  previously  (Chapter  6,  Sections  6  and  7)  defined  the  gradient  (Vu),  the 
divergence  (V  •  V),  the  curl  (V  X  V),  and  the  Laplacian  (V2u)  in  rectangular 
coordinates  x,  y ,  z.  Since  in  many  practical  problems  it  is  better  to  use  some  other 
coordinate  system  (cylindrical  or  spherical,  for  example),  we  need  to  see  how  to 
express  the  vector  operators  in  terms  of  general  orthogonal  coordinates  x±,  £2,  £3- 
(We  consider  only  orthogonal  coordinate  systems  here;  see  Section  10  for  the  more 
general  case.)  We  shall  outline  proofs  of  the  formulas;  some  of  the  details  of  the 
proofs  are  left  to  the  problems. 


Gradient,  Vu.  In  Chapter  6,  Section  6,  we  showed  that  the  directional  derivative 
du/ds  in  a  given  direction  is  the  component  of  Vu  in  that  direction. 


Example  1.  In  cylindrical  coordinates,  if  we  go  in  the  r  direction  (9  and  2  constant), 
then  by  (8.5)  ds  =  dr.  Thus  the  r  component  of  Vu  is  du/ds  when  ds  =  dr,  that 
is,  du/dr.  Similarly,  the  6  component  of  Vu  is  du/ds  when  ds  =  rdO,  that  is, 
(l/r)(du/d9).  Thus  Vu  in  cylindrical  coordinates  is 


(9.1) 


Vu  =  er 


du  1  du  du 
'fr+e0rd9+ezd^- 
In  general  orthogonal  coordinates  x±,  X2,  £3,  the  component  of  Vu  in  the  £1 
direction  (£2  and  £3  constant)  is  du/ds  if  ds  =  h\dx\  [from  (8.11)];  that  is,  the 
component  of  Vu  in  the  direction  ei  is  (l/hi)(du/ dx\).  Similar  formulas  hold  for 
the  other  components  and  we  have 


526  Tensor  Analysis 


Chapter  10 


Divergence,  V  •  V  Let 

(9.3)  V  =  ei  V\  +  e2V2  +  e3V3 


be  a  vector  with  components  Vi,  V2,  V3  in  an  orthogonal  system.  We  can  prove 
(Problem  1)  that 


(9.4) 


=  0. 


Let  us  write  (9.3)  as 

(9.5)  V  =  -p-(h2h3V \)  +  j^—{h\h3V2)  +  ^-(hlh2V3). 

Ail  #2-3  All  Ai2 


We  find  V  •  V  by  taking  the  divergence  of  each  term  on  the  right  side  of  (9.5). 
Using  (7.6)  of  Chapter  6,  namely 


(9.6) 


V  •  (4>v)  =  v  •  (V0)  +  0V  •  v, 


with  cj)  =  h2h3Vi  and  v  =  ei/h2h3,  we  find  that  the  divergence  of  the  first  term  on 
the  right  side  of  (9.5)  is 


(9.7) 


V- 


2  A. 3  Li 


ei 

h2h3 


ei 

h2h3 


•  V ( h2h3V\ )  +  h2h3V\  V  • 


By  (9.4),  the  last  term  in  (9.7)  is  zero.  In  the  first  term  on  the  right  side  of  (9.7), 
the  dot  product  of  ei  with  V f/12 h3 V[ )  is  the  first  component  of  S7(h2h3Vi).  By 
(9.2),  this  is 


1  d 

7—^ — (h2h3Vi). 
h\  ox  1 


Calculating  the  divergence  of  the  other  terms  of  (9.5)  in  a  similar  way,  we  get 


VV  = 


1  1  d 

h2h3  hi  dx\ 


(h2h3Vi)  + 


lid 
h\h3  h2  dx2 


{hih3V2)  + 


lid 
h\h2  h3  dx 3 


(■ h\h2V3 ) 


or 


Example  2.  In  cylindrical  coordinates,  hi  =  1,  h2  =  r,  h3  =  1.  By  (9.8),  the  divergence 
in  cylindrical  coordinates  is 


V- V=  - 

r 


-  i/u 

r  dr  r  r^+  r  d6  +  dz  ' 


(9.9) 
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Laplacian,  V2u.  Since  V2w  =  V  •  Va  we  can  find  V2u  by  combining  (9.2)  and 
(9.8)  with  V  =  V«.  We  get 


(9.10) 


d  ( h,2h3  du\  d  f  h\h3  du  \  d  ( /ii/i2  du 

dxi  \  hi  dxi )  +  dx2  \  h2  dx2 )  dx3  \  h3  dx3 


Example  3.  In  cylindrical  coordinates,  the  Laplacian  is  then 


9  1 

V2u=  - 
r 


d_ 

dr 


ld_ 

r  dr 


du\ 

du 
dr 


d_  fldu ' 
h  dd  \r~dO y 
1  d2u  d2u 

r2  dd2  dz2 


d_ 

dz 


du 

dz 


Curl,  V  X  V.  By  methods  similar  to  those  used  in  finding  V  •  V  we  can  find 
V  X  V  (Problem  2).  The  result  is 


(9.11) 


V  x  V  = 


h3ei 

h2e  2 

h3e3 

1 

d 

d 

d 

hih2h3 

dxi 

dx2 

dx  3 

hiVi 

h2V2 

h3V3 

ei 

h2h3 

+ 


n  o 

j^(h3V3)  - —(h2V2) 

dx  2  dx  3 


e2 


+ 


hih3 

e3 

hi 


^-(hiVi)  -  ^-(h3V3) 

OX  3  OX  i 

^-(h2V2)  -  ^-{hiVi) 

OX  i  OX  2 


Example  4.  In  cylindrical  coordinates,  we  find 


Gr 

reg 

ez 

d 

d 

d 

dr 

dO 

dz 

K 

rVg 

Vz 

(1  dvz 

dVg 

\r 

dO 

dz 

PROBLEMS,  SECTION  9 

1.  Prove  (9.4)  in  the  following  way.  Using  (9.2)  with  u  =  xi,  show  that  V*i  =  ei/hi. 
Similarly,  show  that  Vt2  =  e2//i2  and  V®3  =  e3/h3.  Let  ei,  e2,  e3  in  that  order 
form  a  right-handed  triad  (so  that  ei  X  e2  =  e3,  etc.)  and  show  that  Vxi  X  V.X2  = 
e3/(hih2).  Take  the  divergence  of  this  equation  and,  using  the  vector  identities  (h) 
and  (b)  in  the  table  at  the  end  of  Chapter  6,  show  that  V  •  ( e3/h\h2 )  =  0.  The 
other  parts  of  (9.4)  are  proved  similarly. 


528  Tensor  Analysis 


Chapter  10 


2.  Derive  the  expression  (9.11)  for  curlV  in  the  following  way.  Show  that  Van  =  ei/hi 
and  V  X  (Vii)  =  V  X  ( ei/hi )  =  0.  Write  V  in  the  form 

V  =  +  ^(fcaVa)  +  ^-(h3V3) 

ni  n 2  h3 

and  use  vector  identities  from  Chapter  6  to  complete  the  derivation. 

3.  Using  cylindrical  coordinates  write  the  Lagrange  equations  for  the  motion  of  a  par¬ 
ticle  acted  on  by  a  force  F  =  —  VV,  where  V  is  the  potential  energy.  Divide  each 
Lagrange  equation  by  the  corresponding  scale  factor  so  that  the  components  of  F 
(that  is,  of  -  VL)  appear  in  the  equations.  Thus  write  the  equations  as  the  com¬ 
ponent  equations  of  F  =  raa,  and  so  find  the  components  of  the  acceleration  a. 
Compare  the  results  with  Problem  8.2. 

4.  Do  Problem  3  in  spherical  coordinates;  compare  the  results  with  Problem  8.3. 

5.  Write  out  Vt/,  V  •  V,  V2C/,  and  V  X  V  in  spherical  coordinates. 

Do  Problem  3  for  the  coordinate  systems  indicated  in  Problems  6  to  9.  Compare  the 
results  with  Problems  8.11  to  8.14. 


6. 

Parabolic  cylinder 

7. 

Elliptic  cylinder 

8. 

Parabolic 

9. 

Bipolar 

Do  Problem  5  for  the  coordinate  systems  indicated  in  Problems  10 

10. 

Parabolic  cylinder 

11. 

Elliptic  cylinder 

12. 

Parabolic 

13. 

Bipolar 

In  each  of  the  following  coordinate  systems,  find  the  scale  factors  hu  and  hv;  the  basis 
vectors  eu  and  e„;  the  u  and  v  Lagrange  equations,  and  from  them  the  acceleration 
components  (see  Problem  3). 

x  =  uv, 
y  =  u\J  1  —  v2. 

Use  equations  (9.2),  (9.8),  and  (9.11)  to  evaluate  the  following  expressions 

16.  In  cylindrical  coordinates,  V  •  er,  V  ■  eg,  V  X  er,  V  X  eg. 

17.  In  spherical  coordinates,  V  •  er,  V  •  e®,  V  X  eg,  V  X  e$. 

18.  In  cylindrical  coordinates,  V  X  klnr,  V  lnr,  V  •  ( rer  +  zez). 

19.  In  spherical  coordinates,  V  X  ( reg ),  V(rcos#),  V  •  r. 

20.  In  cylindrical  coordinates,  V2r,  V2(l/r),  V2lnr. 

In  spherical  coordinates,  V2r,  V2(r2),  V2(l/r2),  \/2elkrcos  9 


(  X  =  u  —  V, 

1  y  =  2  y/wi. 


21. 
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10.  NON-CARTESIAN  TENSORS 

So  far  we  have  considered  only  the  behavior  of  the  rectangular  components  of  tensors 
under  orthogonal  transformations.  Now  let’s  generalize  this  to  include  any  change 
of  variables. 


Example  1.  In  spherical  coordinates  r,  9 ,  <f>, 

x  =  r  sin  6  cos  <f>, 

(10.1)  y  =  r  sin  9  sin  (/>, 

z  =  rcos9. 


This  is  not  a  linear  transformation,  and  we  cannot  write  equations  like  (2.4)  to  (2.9) 
for  the  relations  between  the  variables.  However,  we  can  write  such  equations  for 
the  relations  between  the  differentials  of  the  variables.  From  (10.1),  we  find  the 
differentials  dx ,  dy,  dz:  in  terms  of  dr,  d9 ,  dip: 


(10.2) 


dx  \ 

[  1 

(  sin  9  cos  <p 

r  cos  9  cos  <j> 

—r  sin  d  sin 

dy 

= 

sin  9  sin  <p 

r  cos  9  sin  (f> 

r  sin  9  cos  <j> 

dz  ) 

1  \ 

\  COS0 

— rsind 

0 

dr  \ 
d9 

#  ) 


Example  2.  For  general  coordinates  x±,  x 2,  X3,  and  x[,  x'2,  x'3,  if  we  are  given  the  relations 
[like  (10.1)]  between  the  two  sets  of  variables,  we  can  write  the  relations  between 
the  two  sets  of  differentials  as  follows: 


(10.3) 


(dx[ 

dx  i 

dx[\ 

dx  1 

dx  2 

dx  3 

dx  2 

dx'2 

dx  2 

dx\ 

dx  2 

dx3 

dx ^ 

dx i 

dx 4 

dx2 

dx  3/ 

More  simply,  using  index  notation  and  summation  convention,  (10.3)  becomes 


(10.4) 


dx'  -^i-dx 
1  ~  dx ,  r 


Compare  this  with  the  transformation  for  the  partial  derivatives  of  a  function  u, 

.  .  du  du  dxj  dxj  du 

dx\  dxj  dx\  dx\  dxj  ’ 

and  compare  both  (10.4)  and  (10.5)  with  the  transformation  for  a  Cartesian  vector 


(10.6) 


V-  =  CLijVj,  (Cartesian). 


For  Cartesian  vectors  you  can  easily  verify  that 

(10.7)  =  aij  =  ^y,  (Cartesian), 

since  both  the  partial  derivatives  in  (10.7)  equal  the  cosine  of  the  angle  between  the 
x\  and  the  Xj  axes  (Problem  1).  This  is  not  true  for  general  coordinate  systems;  for 
example,  in  (10.1),  dx/d9  ^  d9/dx  (see  Problem  2).  Thus  in  general  we  have  two 
possible  definitions  of  a  vector,  which  become  identical  for  Cartesian  vectors. 
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Contravariant  and  Covariant  Vectors  By  definition,  V  is  a  contravariant 
vector  if  its  components  transform  like  this: 


(10.8)  V-  =  — — - Vj ,  (contravariant  vector), 

OXj 

and  V  is  a  covariant  vector  if  its  components  transform  like  this: 


,  dXn 

(10.9) 

V  =  —Vn 

1  dx[3) 

(covariant  vector). 

By  comparing  (10.4)  and  (10.8),  we  see  that  the  differentials  of  the  coordinates  are 
the  components  of  a  contravariant  vector.  Similarly,  by  comparing  (10.5)  and  (10.9), 
we  see  that  the  partial  derivatives  of  a  function  are  the  components  of  a  covariant 
vector. 


Notation  Before  we  define  tensors  in  general,  we  need  to  discuss  a  few  things 
about  notation.  It  is  customary  to  write  the  indices  of  contravariant  vectors  and 
tensors  as  superscripts  rather  than  subscripts.  Be  careful  not  to  confuse  them 
with  exponents!  (You  may  find  the  mnemonic  “low-co”  useful;  lower  indices  are 
covariant  indices,  so,  of  course,  upper  indices  are  contravariant  indices.)  In  this 
notation,  equation  (10.8)  for  a  contravariant  vector  becomes 


Bx ' 

(10.10) 

V'1  =  ^V3, 
dxj 

(contravariant  vector) . 

(In  fact,  to  be  strictly  consistent,  since  the  differentials  are  contravariant,  we  should 
write  dx'1  /  dx3 .  For  our  purposes  this  seems  unnecessary  so  we  will  leave  the  partial 
derivative  notation  as  it  is.)  Also  note  that  the  summation  convention  now  applies 
to  a  pair  of  indices,  one  upper  and  one  lower.  (An  index  in  the  denominator  counts 
as  a  lower  index  and  an  index  in  the  numerator  counts  as  an  upper  index.)  Note 
that  this  new  rule  about  summation  convention  applies  in  (10.9)  and  (10.10)  and 
watch  for  it  in  future  formulas. 

Components  and  basis  vectors  You  may  be  wondering  how  the  vectors  you 
studied  in  vector  analysis  (Section  9  and  Chapter  6)  are  related  to  covariant  and 
contravariant  vectors.  Actually  we  should  speak  of  covariant  and  contravariant 
components,  but  the  former  terminology  is  customary.  Any  vector  has  various 
sets  of  components  relative  to  various  sets  of  basis  vectors.  Let’s  discuss  this  for 
orthogonal  coordinate  systems  where  it  is  especially  simple.  Recall  that  in  vector 
analysis,  we  use  the  unit  basis  vectors  such  as  i,  j,  k  or  er,  eg,  e^;  for  example,  the 
vectors  in  Section  9  are  all  unit  vectors.  Then  the  components  of  a  vector  V  in 
vector  analysis  are  the  projections  e*  ■  V  of  the  vector  on  the  coordinate  directions. 
To  be  able  to  refer  to  these  components,  let’s  call  them  the  physical  components 
(they  have  the  right  physical  dimensions — see  Problem  6).  We  would  like  to  see 
the  relation  between  the  physical  components  and  the  covariant  and  contravariant 
components  of  a  vector,  and  the  relation  between  the  unit  basis  vectors  and  the 
contravariant  and  covariant  basis  vectors. 
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Example  3.  You  have  learned  that,  in  polar  coordinates,  the  (physical)  components  of  ds 
are  dr  and  rdd.  Now  (10.4)  and  (10.10)  tell  us  that  the  contravariant  components 
of  ds  are  just  dr  and  dd  (not  rdd).  Thus  we  may  guess  (correctly)  that  the  con¬ 
travariant  components  of  a  vector  are  the  physical  components  divided  by  the  scale 
factors.  By  considering  the  components  of  the  gradient  (Problem  4),  you  can  show 
that  the  covariant  components  of  a  vector  are  the  physical  components  multiplied 
by  the  scale  factors. 


Example  4.  In  polar  coordinates  we  can  write  [see  equation  (8.9)] 

(10.11)  ds  =  er  dr  +  eg  r  dd  =  &r  dr  +  a.g  dd. 

We  have  written  ds  in  terms  of  its  physical  components  and  the  unit  vectors,  and 
in  terms  of  its  contravariant  components  and  the  covariant  a;  basis  vectors.  From 
(10.11)  and  from  Section  8  we  can  see  that  the  a;  basis  vectors  are  the  e,  unit  vectors 
multiplied  by  the  scale  factors.  Note  that  the  components  and  the  basis  vectors  used 
with  them  vary  in  opposite  ways  so  that  the  scale  factors  cancel.  Similarly  we  can 
write  a  vector  in  terms  of  its  covariant  components  and  the  contravariant  basis 
vectors  a*  which  are  the  unit  vectors  divided  by  the  scale  factors  (Problem  5).  Note 
carefully  that  what  we  have  just  said  applies  only  to  orthogonal  coordinate  systems. 
If  a  coordinate  system  is  not  orthogonal,  then  a ;  and  a*  are  not  in  general  parallel; 
see  the  discussion  just  after  (10.19). 


Definition  of  Tensors  Tensors  may  be  covariant  of  any  rank,  contravariant  of 
any  rank,  or  mixed.  Here  are  some  sample  tensor  definitions;  you  should  be  able  to 
write  the  corresponding  definitions  for  tensors  of  any  rank  or  kind  in  a  similar  way 
(Problem  7). 


(10.12) 


T' 

13 


ji/ijk 

rp/ij 

1  k 


dxk  dxi 
dx)dx)  kl 

dx'j  dx'3  dx'kTl, 

dxi  dxfji  0xn 


dx\  dx'j  dxn  lm 
dx idxmdx'k  n 


(2nd-rank  covariant  tensor), 

(3ld-rank  contravariant  tensor), 

(3rd-rank  mixed  tensor,  one  covariant 
and  two  contravariant  indices). 


Kronecker  delta  We  showed  in  Section  5  that  5%j  is  a  2nd-rank  isotropic 
Cartesian  tensor.  In  a  general  coordinate  system,  the  2nd-rank  tensor  which  is 
equal  to  1  if  i  =  j  and  0  otherwise  in  all  coordinate  systems,  is  a  mixed  tensor  so 
we  write  it  as  <5®.  To  show  that  this  is  correct  we  write  the  tensor  transformation 
equation  for  5jf  to  see  that  we  get  5'). 


(10.13) 


dx)  dxi  k  _  dx)  dxk  _  dx)  _  ,i 
dx k  dx '  1  dxk  dx '  dx '  ■7 


Thus  we  see  that  6)  is  an  isotropic  2nd-rank  tensor  in  general  coordinate  systems. 
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Quotient  Rule  In  Section  3,  we  discussed  the  quotient  rule  for  Cartesian  tensors. 
A  similar  rule  applies  in  general.  To  give  proofs,  we  must  replace  the  aij  by  the 
appropriate  partial  derivatives,  noting  carefully  that  summation  convention  now 
applies  to  a  sum  over  one  lower  and  one  upper  index. 


Example  5.  If  we  are  given  TijV3  =  Ui  where  V  is  an  arbitrary  contravariant  vector  and 
U  is  a  non-zero  covariant  vector,  we  want  to  show  that  is  a  2nd-rank  covariant 
tensor.  We  write  [compare  equations  (3.6)  to  (3.9)] 


(10.14) 


rr'  y'P  —  tt' 
1a/3V  ~  Ua 


Aiy,  =  =  AiTiJ 

dx'  dx'  dx' 


dx3  V’0 
dx'p 


Set  the  first  and  last  steps  equal;  then  since  V'13  is  arbitrary,  its  coefficient  =  0  and 
we  have 


,  _ _2-'T 

a(3~  dx'a  ijdx'p 

which  is  the  transformation  equation  for  a  2nd-rank  covariant  tensor. 


Metric  Tensor;  Raising  and  Lowering  Indices 

Example  6.  From  (8.14)  we  have  (with  the  contravariant  dx  indices  now  written  as  su¬ 
perscripts) 

(10.15)  ds2  =  gij  dx 1  dx3 . 

Since  ds2  is  a  scalar,  and  each  dx  is  a  contravariant  vector,  it  follows  by  the  quotient 
rule  (Problem  8)  that  is  a  2nd-rank  covariant  tensor.  It  is  known  as  the  metric 
tensor.  If  the  elements  of  are  written  as  a  matrix  [see  (8.13)],  then  we  define 
g'3  as  the  elements  of  the  inverse  matrix.  We  can  interpret  gijg3k  as  either  the 
contracted  direct  product  of  two  tensors,  or  as  the  row  times  column  product  of 
two  matrices  which  are  inverses  of  each  other,  that  is,  a  unit  matrix.  Thus  we  can 
write 

(10.16)  9ijg3k  =  5k. 

Then  by  (10.13)  and  the  quotient  rule,  g13  is  a  2nd-rank  contravariant  tensor. 

Example 7.  If  V1  is  a  contravariant  vector  then  V)  =  gtjV3  is  a  covariant  vector  (Problem 
10).  We  can  also  show  that  gl3Vj  gives  back  the  V1  we  started  with: 

(10.17)  g^Vj  =  gijgjkVk  =  6'kVk  =  V*. 

This  process  of  finding  the  contracted  product  of  a  vector  (or  tensor)  with  g%3 
or  g^  is  called  raising  or  lowering  indices.  The  vectors  V1  and  V)  are  called  the 
contravariant  and  covariant  components  of  the  vector  V. 

In  equation  (8.12),  we  defined  the  covariant  basis  vectors  a,  which  we  use  with 
contravariant  components  to  write  a  vector  [see  (8.11)  for  example,  remembering 
that  the  differentials  are  the  contravariant  components  of  ds].  The  contravariant 
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basis  vectors  to  use  with  covariant  components  are  given  by  a*  =  gl°aj.  We  can 
then  write  a  vector  in  two  ways  (Problem  11): 


(10.18) 


V  =  a,;  V1  =  a1  V-i,  where 


V,  -  a, ,V'.  I"  -  g'-'V,. 
a'  =  gvari  a,  =  gtJaJ . 


It  is  interesting  to  consider  the  directions  of  the  vectors  a^  and  a*.  We  have 
defined  a*  =  g^aj  but  you  can  show  (Problem  12)  that  a*  =  Vxj.  Thus  we  have 


(10.19) 


d  .  dx  .  dy  dz 

j^  +  k- 


at  r-i  ^  ^  J  O  7 

OXi  OXi  OXi  OXi 

i  ij  -7  .dxi  ,dxi 

a  =  gJ^  =  Vxi  =  i— +J- 


dy 


dXj 
:  dz  ' 


We  see  from  the  displacement  vector  ds  =  a  j  dx1  that  the  basis  vectors  a^  are  tan¬ 
gent  to  the  coordinate  lines.  The  vectors  a*  =  V:r,;  are  orthogonal  to  the  coordinate 
surfaces  Xi  =  const.  (Recall  that  grad  it  is  orthogonal  tou=  const.)  For  orthogonal 
coordinates,  a;  and  a*  are  in  the  same  direction.  (For  example,  in  spherical  coor¬ 
dinates,  ar  points  in  the  radial  direction,  and  ar  is  orthogonal  to  the  sphere  r  = 
const.;  these  are  the  same  direction.)  Thus  for  orthogonal  coordinates,  if  we  nor¬ 
malize  each  a\  we  get  the  same  set  of  unit  basis  vectors  that  we  get  if  we  normalize 
each  a, .  However,  if  the  coordinate  system  is  not  orthogonal,  then  at  each  point  we 
have  two  different  sets  of  basis  vectors  a  j  and  a1  (see  Problems  16  and  17). 

Just  as  we  did  for  vectors,  any  tensor,  say  T'-k ,  can  be  written  in  various  different 
forms  by  raising  and  lowering  indices  to  get  T^fe,  Tvk,  Tk  .  These  tensors  are  called 
associated  tensors.  They  really  all  represent  the  same  tensor  T,  with  components 
relative  to  various  bases. 


Orthogonal  coordinate  systems  For  orthogonal  coordinate  systems,  formulas 
involving  can  be  written  in  terms  of  the  scale  factors  h\,  h 2,  /13  [compare  (8.13) 
and  (8.16)].  Remember  that  the  glJ  matrix  is  the  inverse  of  the  <7^  matrix  [see 
equation  (10.16)].  Also  let  g  represent  the  determinant  of  the  gtJ  matrix.  Then  you 
can  show  (Problem  13). 


(10.20) 


9ij  — 


0,  i  j, 
hi,  i  =  j, 


g  =  h\h\h\ , 


0,  i  ±  j , 

1  .  . 


Vg=  hih2h3 


Vector  Operators  in  Tensor  Notation  We  state  without  proof  the  following 
tensor  expressions  for  Vit,  V  •  V,  and  V2n.  They  are  correct  for  any  coordinate 
system,  orthogonal  or  not.  Using  (10.20),  you  can  specialize  them  to  orthogonal 
coordinate  systems  and  so  obtain  the  expressions  given  in  Section  9.  (Problems  14 
and  15). 


(10.21) 

(10.22)  V-V 

(10.23) 


du 

The  covariant  components  of  Vu  are  — — . 

OXi 

Id 

——  — — (y/gVz),  where  V1  are  contravariant  components  of  V. 
v  9  vXi 


V2m  = 


1  d 

x fgdxi 


V~9  91' 


du 

dxi 
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PROBLEMS,  SECTION  10 


i. 


2. 


3. 


4. 


5. 


6. 


7. 


8. 


9. 


10. 


11. 


12. 


13. 


Verify  equation  (10.7).  Hint:  Use  equations  (2.4)  to  (2.6)  and  (2.10).  For  example, 
dy'/dz  =  dz/dy'  =  ri2  =  d23- 

From  (10.1)  find  dd/dx  =  (1/r)  cos  9  cos  (f>  and  show  that  dx/dd  ^  dd/dx.  Note 
carefully  that  dx/dd  means  that  r  and  <j>  are  constant,  but  dd/dx  means  that  y  and 
2  are  constant.  (See  Chapter  4,  Example  7.6  for  further  discussion.) 

Divide  equation  (10.4)  by  dt  to  show  that  the  velocity  v  =  ds/dt  is  a  contravariant 
vector.  Note  that  the  contravariant  components  of  the  velocity  in  polar  coordinates 
are  r  and  6  ( not  r  and  rd  which  are  physical  components).  As  we  did  in  (10.11), 
write  the  velocity  v  in  polar  coordinates  in  terms  of  the  unit  e  vectors  and  in  terms 
of  the  covariant  a  vectors.  Repeat  the  problem  in  spherical  coordinates. 

What  are  the  physical  components  of  the  gradient  in  polar  coordinates?  [See  (9.1)]. 
The  partial  derivatives  in  (10.5)  are  the  covariant  components  of  Vu.  What  rela¬ 
tion  do  you  deduce  between  physical  and  covariant  components?  Answer  the  same 
questions  for  spherical  coordinates,  and  for  an  orthogonal  coordinate  system  with 
scale  factors  hi,  A2,  A3. 

Write  V11  in  polar  coordinates  in  terms  of  its  physical  components  and  the  unit 
basis  vectors  e^,  and  in  terms  of  its  covariant  components  and  the  contravariant 
basis  vectors  a1.  What  is  the  relation  between  the  contravariant  basis  vectors  and 
the  unit  basis  vectors?  Hint:  Compare  equation  (10.11)  and  our  discussion  of  it. 

Show  that,  in  polar  coordinates,  the  d  contravariant  component  of  ds  is  dd  which  is 
unitless,  the  d  physical  component  of  ds  is  r  dd  which  has  units  of  length,  and  the  d 
covariant  component  of  ds  is  r2  dd  which  has  units  (length)2. 

As  in  (10.12),  write  the  transformation  equations  for  the  following  tensors:  2nd-rank 
contravariant,  3rd-rank  covariant,  4th-rank  mixed  with  2  covariant  and  2  contravari¬ 
ant  indices. 

Using  (10.15)  show  that  gtj  is  a  2nd-rank  covariant  tensor.  Hint:  Write  the  transfor¬ 
mation  equation  for  each  dx,  and  set  the  scalar  ds12  =  ds2  to  find  the  transformation 
equation  for  gij. 

If  Uz  is  a  contravariant  vector  and  V)  is  a  covariant  vector,  show  that  UlVj  is  a 
2nd-rank  mixed  tensor.  Hint:  Write  the  transformation  equations  for  U  and  V  and 
multiply  them. 

Show  that  if  Vx  is  a  contravariant  vector  then  Vi  =  gijVJ  is  a  covariant  vector,  and 
that  if  14  is  a  covariant  vector,  then  V1  =  g Vj  is  a  contravariant  vector. 

In  (10.18),  show  by  raising  and  lowering  indices  that  aiV1  =  aIVi.  Also  write  (10.18) 
for  an  orthogonal  coordinate  system  with  gij  and  gl]  written  in  terms  of  the  scale 
factors. 


Show  that  in  a  general  coordinate  system  with  variables  xi,  12,  *3,  the  contravariant 
basis  vectors  are  given  by 


a1  =  Vxi 


.dxj  .dx^  dxj_ 

1  dx  dy  dz  ' 


Hint:  Write  the  gradient  in  terms  of  its  covariant  components  and  the  a*  basis 
vectors  to  get  Vu  =  a 3 du/dxj  and  let  u  =  Xi. 

Verify  (10.20). 
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14.  Using  equations  (10.20)  to  (10.23),  write  the  gradient,  divergence,  and  Laplacian  in 
cylindrical  coordinates  and  in  spherical  coordinates.  Change  covariant  or  contravari- 
ant  components  to  physical  components  and  compare  with  the  formulas  stated  in 
Chapter  6,  Sections  6  and  7. 

15.  Do  Problem  14  for  an  orthogonal  coordinate  system  with  scale  factors  hi,  /12,  /13, 
and  compare  with  the  Section  9  formulas. 

16.  Continue  Problem  8.15  to  find  the  g13  matrix  and  the  contravariant  basis  vectors. 
Check  your  result  by  solving  the  given  equations  for  u  and  v  in  terms  of  x  and  y, 
and  finding  the  contravariant  basis  vectors  using  Problem  12.  On  your  Problem 
8.15  sketches  of  the  lines  u  =  const,  and  v  =  const.,  also  sketch  the  covariant  and 
contravariant  basis  vectors.  Observe  that  the  covariant  basis  vectors  lie  along  the 
lines  u  =  const,  and  v  =  const,  and  the  contravariant  basis  vectors  lie  along  the 
normals  to  these  lines. 

17.  Repeat  Problems  8.15  and  10.16  above  for  the  {u,  v)  coordinate  system  if  x  =  2u  —  v, 
y  =  u  —  2v. 

18.  Using  (10.19),  show  that  a1  •  ai  =  5]. 

11.  MISCELLANEOUS  PROBLEMS 

1.  Show  that  the  transformation  equation  for  a  2nd-rank  Cartesian  tensor  is  equivalent 
to  a  similarity  transformation.  Warning  hint:  Note  that  the  matrix  C  in  Chapter 
3,  Section  11,  is  the  inverse  of  the  matrix  A  we  are  using  in  Chapter  10  (compare 
r'  =  Ar  and  r  =  Cr').  Thus  a  similarity  transformation  of  the  matrix  T  with  tensor 
components  Tij  is  T'  =  ATA  1 .  Also  see  “Tensors  and  Matrices”  in  Section  3  and 
remember  that  A  is  orthogonal. 

2.  Let  ei,  e2,  e3  be  a  set  of  orthogonal  unit  vectors  forming  a  right-handed  system  if 
taken  in  cyclic  order.  Show  that  the  triple  scalar  product  ei  ■  (e,  X  e^)  =  tijk- 

3.  In  Chapter  3,  Problem  6.6,  you  are  asked  to  prove  some  identities  among  the  Pauli 
spin  matrices  (called  A,  B,  C,  in  that  problem).  Call  the  Pauli  spin  matrices  a  1,  02, 
<73;  then  show  that  the  identities  can  be  written  in  the  following  summation  forms: 

&k&m  =  'i^-kmn  &  n  +  $k  m  ? 

&k&m£kmn  —  2i(Jn. 

4.  If  E  =  electric  field  and  B  =  magnetic  field,  is  E  X  B  a  vector  or  a  pseudovector? 
Comment :  E  X  B//xo  is  called  the  Poynting  vector;  it  points  in  the  direction  of 
transfer  of  energy.  Does  that  tell  you  from  the  physics  whether  it  is  a  vector  or  a 
pseudovector? 

Do  Problems  5  to  8  for  the  («,  v )  coordinate  system  if  x  =  u(l  —  v),  y  =  u\/2v  —  v2. 

5.  Find  ds2,  the  scale  factors,  the  area  element,  the  vector  ds,  the  unit  basis  vectors, 
and  the  covariant  and  contravariant  basis  vectors. 

6.  Use  Lagrange’s  equations  to  find  the  u  and  v  acceleration  components. 

7.  Write  VI/,  V  •  V,  and  V2U. 

8.  Evaluate  V  •  eu,  V  X  e„,  V2lnw. 

9.  If  u  is  a  vector  specifying  the  displacement  under  stress  of  each  point  of  a  deformable 
medium,  then  Vu  is  a  2  -rank  Cartesian  tensor  (see  Problem  11)  which  describes 
the  strain  at  each  point.  Display  the  components  of  Vu  as  a  matrix.  Write  Vu  as 
the  sum  of  a  symmetric  tensor  and  an  antisymmetric  tensor  [see  (3.5)].  Comment : 
The  symmetric  part  of  Vu  is  called  the  stress  tensor  and  the  antisymmetric  part 
the  rotation  tensor. 
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10.  Show  that  elements  Rij  of  a  rotation  matrix  are  the  elements  of  a  Cartesian  tensor. 
Hints:  Could  you  use  the  quotient  rule?  Could  you  use  Problem  1? 

11.  Show  that  the  nine  quantities  Tij  =  dVi /dxj  (which  are  the  Cartesian  components  of 
VV  where  V  is  a  vector)  satisfy  the  transformation  equations  (2.14)  for  a  Cartesian 
2nd-rank  tensor.  Show  that  they  do  not  satisfy  the  general  tensor  transformation 
equations  as  in  (10.12).  Hint:  Differentiate  (10.9)  or  (10.10)  partially  with  respect 
to,  say,  x'k .  You  should  get  the  expected  terms  [as  in  (10.12)]  plus  some  extra  terms; 
these  extraneous  terms  show  that  dVi/dxj  is  not  a  tensor  under  general  transfor¬ 
mations.  Comment:  It  is  possible  to  express  the  components  of  VV  correctly  in 
general  coordinate  systems  by  taking  into  account  the  variation  of  the  basis  vectors 
in  length  and  direction. 

12.  The  square  matrix  in  equation  (10.3)  is  called  the  Jacobian  matrix  J;  the  determi¬ 
nant  of  this  matrix  is  the  Jacobian  J  =  det  J  which  we  used  in  Chapter  5,  Section  4 
to  find  volume  elements  in  multiple  integrals.  (Note  that  as  in  Chapter  3,  J  repre¬ 
sents  a  matrix;  J  in  italics  is  its  determinant.)  For  the  transformation  to  spherical 
coordinates  in  (10.1)  and  (10.2)  show  that  J  =  det  J  =  r2  sin  61  Recall  that  the 
spherical  coordinate  volume  element  is  r2  sin#  dr  dd  d(f>.  Hint:  Find  JTJ  and  note 
that  det(JTJ)  =  (det  J)2. 

13.  In  equation  (10.13),  let  the  x'  variables  be  rectangular  coordinates  x,  y,  z,  and 
let  xi,  X2,  X3,  be  general  curvilinear  coordinates,  orthogonal  or  not  (see  end  of 
Section  8).  Show  that  JTJ  is  the  gij  matrix  in  (8.13)  [or  in  (8.16)  for  an  orthogonal 
system].  Thus  show  that  the  volume  element  in  a  general  coordinate  system  is 
dV  =  ^/gdxidx2dx3  where  g  =  det  {gij),  and  that  for  an  orthogonal  system,  this 
becomes  [by  (8.16)  or  (10.19)],  dV  =  h\h,2  A3  dx\  dx2  dx3.  Hint:  To  evaluate  the 
products  of  partial  derivatives  in  JTJ,  observe  that  the  same  expressions  arise  as  in 
finding  ds2.  In  fact,  from  (8.11)  and  (8.12),  you  can  show  that  row  i  times  column 
j  in  JTJ  is  just  a;  •  a  j  =  gtj  in  equations  (8.11)  to  (8.14). 
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Special  Functions 


1.  INTRODUCTION 

The  integrals  and  series  and  functions  of  this  chapter  arise  in  a  variety  of  physical 
problems.  Just  as  you  learn  about  trigonometric  functions,  logarithms,  etc.,  and 
use  them  in  applied  problems,  so  you  should  learn  something  about  these  special 
functions  so  that  you  can  use  them  and  understand  their  use  as  they  come  up  in  your 
more  advanced  work.  An  enormous  amount  of  detail  is  known  about  these  functions, 
and  numerous  formulas  involving  them  exist  and  can  be  looked  up  in  books  or  found 
in  your  computer  program.  Our  purpose  is  not  to  study  them  intensively,  but  to 
give  definitions  and  some  of  the  simpler  relations  and  show  their  use.  This  should 
develop  your  ability  and  confidence  to  cope  with  more  complicated  formulas  and 
many  other  similar  functions  and  relations  that  may  crop  up  occasionally  in  texts 
or  computer  results. 

Now  you  may  be  thinking  that  your  computer  will  give  you  the  answers  for  def¬ 
inite  integrals  and  functions  so  you  really  don’t  need  to  bother  with  this  chapter. 
If  all  you  want  is  a  numerical  approximation,  this  may  be  true.  However,  in  theo¬ 
retical  work,  you  often  need  an  exact  expression  (say  in  terms  of  7r  or  \/3  or  In  2) 
and  your  computer  may  not  give  you  the  form  you  need. 


Example  1.  Suppose  you  want  J^2  dO/yJ cos  9.  One  computer  program  gives  you  the 
result  y/2K{l/y/2)  and  another  gives  you  2y/7r  T(5/4)/T(3/4).  In  books  you  find 
answers  \B{  1/4, 1/2)  and  [T(  1  / 4)] 2/ a/87t.  What’s  going  on  here  and  which  is  right? 
They  all  are!  And  when  you  have  studied  the  formulas  in  this  chapter,  you  will  be 
able  to  show  this  (Problem  12.21)  just  as  you  now  recognize  that  sin2  0=1  —  cos2  9. 

Also  in  some  problems  you  may  want  an  algebraic  approximation  for  a  complicated 
expression  rather  than  a  numerical  answer. 


Example  2.  You  will  find  in  thermal  physics  the  approximation  In  TV!  =  TV  In  TV  —  TV;  we 
will  discuss  this  approximation  and  its  accuracy.  (See  Problem  11.3). 
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2.  THE  FACTORIAL  FUNCTION 

Let  us  calculate  the  values  of  some  integrals.  For  a  >  0, 


(2.1) 


e~axdx=  --e 


1 

a 


Next  differentiate  both  sides  of  this  equation  repeatedly  with  respect  to  a  (see 
Chapter  4,  Section  12): 


—xe  ax  dx  = - 7T  or 


xe  ax  dx  =  -rr , 


a * 


v2e~axdx  = 


x3e~ax  dx  = 


a° 

3! 


v4' 


or  in  general 

(2.2) 

Putting  a  =  1,  we  get 
(2.3) 


f 


xne~°iX  dx  = 


n\ 


a 


,n+ 1 


xne  x  dx  =  n\ ,  n  =  1,2,3,- 


Thus  we  have  a  definite  integral  whose  value  is  n!  for  positive  integral  n.  We  can 
use  (2.3)  to  give  a  meaning  to  0!.  Putting  n  =  0  in  (2.3),  we  get 


(2.4) 


0!  =  / 

Jo 


~-x  =  1. 
0 


(This  agrees  with  our  previous  definition  of  0!  in  Chapter  1.) 

PROBLEMS,  SECTION  2 


In  Chapter  4,  Section  12,  do  Problems  14  to  17. 


3.  DEFINITION  OFTHE  GAMMA  FUNCTION;  RECURSION  RELATION 

So  far  n  has  been  a  nonnegative  integer;  it  is  natural  to  define  the  factorial  function 
for  nonintegral  n  by  the  definite  integral  (2.3).  There  is  no  real  objection  to  the 
notation  n\  for  nonintegral  n  (and  we  shall  occasionally  use  it),  but  it  is  customary  to 
reserve  the  factorial  notation  for  integral  n  and  to  call  the  corresponding  function 
for  nonintegral  n  the  gamma  (T)  function.  It  is  also  rather  common  practice  to 
replace  n  by  the  letter  p  when  we  do  not  necessarily  mean  an  integer.  Following 
these  conventions,  we  define,  for  any  p  >  0 


(3.1) 


/•OO 

r(p)  =  /  x^e^  dx , 

Jo 


p  >  0. 
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For  0  <  p  <  1,  this  is  an  improper  integral  because  zp_1  becomes  infinite  at  the 
lower  limit.  However,  it  is  a  convergent  integral  for  p  >  0  (Problem  1).  For  p  <  0, 
the  integral  diverges  and  so  cannot  be  used  to  define  T  (p) ;  we  shall  see  in  Section  4 
how  to  define  T(p)  when  p  <  0.  Then  from  (3.1)  and  (2.3)  we  have 


(3.2) 

POO 

r(n)  =  /  xn-1e~xdx=(n-  1)!, 
Jo 

POO 

T(n  +  1)=  /  xne~xdx  =  n\. 

Jo 

Thus 

r(l)  =  0!  =  1,  r(2)  =  1!  =  1,  F(3)  =  2!  =  2,  r(4)  =  3!  =  6,  •••, 

with  the  usual  meaning  of  factorial  for  positive  integral  n.  The  fact  that  T(n)  = 
(n  —  1)!  and  not  n\  is  unfortunate  but  that’s  the  notation  which  is  used,  so  watch 
out  for  it.  Replacing  p  by  p  +  1  in  (3.1),  we  have 

POO 

(3.3)  r(p+  1)  =  /  xpe~xdx  =  p\,  p>—  1. 

Jo 

Some  authors  use  the  factorial  notation  p\  =  r(p  +  1)  even  though  p  is  not  an 
integer;  this  avoids  the  nuisance  of  the  p  +  1. 

Let  us  integrate  (3.3)  by  parts,  calling  xp  =  u,  e~xdx  =  dv,  then  we  get 

du  =  pxp~1dx,  v  =  —e~x, 

/*00 

T(p  +  1)  =  —  xpe~x\<*>  —  /  (—e~x)pxp-1dx 

Jo 

xp~1e~x  dx  =  pT(p). 


This  equation 


(3.4) 


r(p  + 1)  =pT(P) 


is  called  the  recursion  relation  for  the  T  function.  It  can  be  used  to  simplify  ex¬ 
pressions  involving  T  functions  or  to  write  them  in  a  different  form  (much  as  you 
use  trigonometric  identities). 

Example.  By  (3.4)  we  find  T(9/4)  =  (5/4)r(5/4)  =  (5/4)(l/4)F(l/4); 
then  T(l/4)  4-  T(9/4)  =  16/5. 

PROBLEMS,  SECTION  3 

1.  The  integral  in  (3.1)  is  improper  because  of  the  infinite  upper  limit  and  it  is  also 
improper  for  0  <  p  <  1  because  a:p_1  becomes  infinite  at  the  lower  limit.  However, 
the  integral  is  convergent  for  any  p  >  0.  Prove  this. 
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Use  the  recursion  relation  (3.4),  and  if  needed,  equation  (3.2)  to  simplify: 

2.  r(2/3)/r(5/3)  3.  r(2/3)/r(8/3)  4.  r(2/5)/r(12/5) 

5.  r(i/2)r(4)/r(9/2)  6.  r(io)/r(8)  7.  r(4)r(3/4)/r(7/4) 


Express  each  of  the  following  integrals  as  a  E  function.  By  computer,  evaluate  numerically 
both  the  r  function  and  the  original  integral. 


8. 

POO 

/  x2^3e~x  dx 

9. 

4 

/  e  x  dx  Hint:  Put  x4  =  u. 

Jo 

Jo 

f°°  k  2 

10. 

/  x~2^be~x  dx 

11. 

/  x  e~x  dx  Hint:  Put  x2  =  u. 

Jo 

Jo 

12. 

3 

/  xe  x  dx 

13. 

f  x2  (\n  dx  Hint:  Put  x  =  e~u. 

Jo 

Jo  \  x) 

r  1 

POO 

14. 

/  vTn x  dx 

15. 

/  x~1/3e~8xdx 

Jo 

Jo 

16. 

A  particle  starting  from  rest  at  x  = 

1  moves  along  the  x  axis  toward  the  origin. 

Its  potential  energy  is  V  =  \m  \nx.  Write  the  Lagrange  equation  and  integrate  it 
to  find  the  time  required  for  the  particle  to  reach  the  origin.  Caution:  dx/dt  <  0. 
Answer:  E(I). 

17.  Express  as  a  E  function 

p-i 


/ 

Jo 


In  (  - 


dx.  Hint :  See  Problem  13. 


4.  THE  GAMMA  FUNCTION  OF  NEGATIVE  NUMBERS 

For  p  <  0,  r(p)  has  not  so  far  been  defined.  We  shall  now  define  it  by  the  recursion 
relation  (3.4)  solved  for  T(p). 


(4.1) 

r(p)  =  £r(p  +  i) 

p 

defines  T(p)  for  p  <  0. 

Example. 

r(-°-3)  =  qhr<°-7»’  r<-L3>  =  (-1.3)(-0.3)r(°-7)' 

and  so  on.  Since  T(l)  =  1,  we  see  that 

r(p)  =  £^±11  _>  oo  as  p  -►  0. 

P 

From  this  and  successive  use  of  (4.1)  it  follows  that  T(p)  becomes  infinite  not  only 
at  zero  but  also  at  all  the  negative  integers.  In  the  intervals  between  the  negative 
integers,  it  is  alternately  positive  and  negative,  negative  from  0  to  —1,  positive  from 
—  1  to  —2,  and  so  on,  as  you  can  see  from  computations  like  those  for  T(— 0.3)  and 
r(— 1.3)  above.  See  Problems  5.1  and  5.2. 
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5.  SOME  IMPORTANT  FORMULAS  INVOLVING  GAMMA  FUNCTIONS 

First  we  evaluate  T  ( | ) .  By  definition 


(Note  that  it  does  not  matter  what  letter  we  use  for  the  dummy  variable  of  in¬ 
tegration  in  a  definite  integral.)  Put  t  =  y2  in  (5.1);  then  dt  =  2 ydy,  and  (5.1) 
becomes 

/»  OO  -i  /»00 

r(i)  =  /  -  e~v  2ydy  =  2  /  e~v  dy 

Jo  y  Jo 

or,  with  x  as  the  dummy  integration  variable, 


(5.2) 


dx. 


Multiply  these  two  integrals  for  T(|)  together  and  write  the  result  as  a  double 
integral: 

p  OO  (‘OO 

[r(i)]2  =  4/  /  e-^2+y2)dxdy. 

Jo  Jo 

This  is  an  integral  over  the  first  quadrant;  it  can  be  more  easily  evaluated  in  polar 
coordinates: 


/»7t/2  /* 

[r(^)]  =  4  / 

Jo  Jo 


-  dr  dO  =  4  •  —  • 


2  -2 


Therefore 


(5.3) 


r(^)  =  Vtt- 


We  state  here  another  important  formula  involving  T  functions  (for  proof,  see 
Chapter  14,  Section  7,  Example  5): 


(5.4) 

T(p)T(l  -p)  =  — ■ 

sin  7 vp 

Notice  that  (5.4)  also  gives  T  (|)  =  y/n  if  we  put  p  = 
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PROBLEMS,  SECTION  5 

1.  Using  (5.3)  with  (3.4)  and  (4.1),  find  F(3/2),  T(— 1/2),  and  T(— 3/2)  in  terms  of  -^/n. 

2.  Without  computer  or  tables,  but  just  using  facts  you  know,  sketch  a  quick  rough 
graph  of  the  V  function  from  —2  to  3.  Hint:  This  is  easy;  don’t  make  a  big  job  of 
it.  From  Section  3,  you  know  the  values  of  the  F  function  at  the  positive  integers  in 
terms  of  factorials.  From  Problem  1,  you  can  easily  find  and  plot  the  V  function  at 
±1/2,  ±3/2.  (Approximate  as  a  little  less  than  2.)  From  (4.1)  and  the  discussion 
following  it,  you  know  that  the  T  function  tends  to  plus  or  minus  infinity  at  0  and 
the  negative  integers,  and  you  know  the  intervals  where  it  is  positive  or  negative. 
After  sketching  your  graph,  make  a  computer  plot  of  the  T  function  from  —5  to  5 
and  compare  your  sketch. 

3.  In  Chapter  1,  equations  (13.5)  and  (13.6),  we  defined  the  binomial  coefficients  (p) 
where  n  is  a  non-negative  integer  but  p  may  be  negative  or  fractional.  Show  that 
(p)  can  be  written  in  terms  of  T  functions  as 

( A  =  r(p±  i) 

In)  n!T(p  —  n  ±  1) 


4.  Prove  that,  for  positive  integral  n: 

r  (n+i)  = 


!  _  1  ■  3  ■  5  ■  •  ■  (2n  —  1)  r  _  (2n)! 


4nn\ 


Vn- 


5.  Use  (5.4)  to  show  that 

(a)  r(|  —  n)F(i  ±  n)  =  (— l)nn  if  n  =  a  positive  integer; 

(b)  (z!)(— z)\  =  nz/  sinnz,  where  2  is  not  necessarily  an  integer;  see  comment  after 
equation  (3.3). 

6.  Prove  that 


f 

dn  r 


tp™  = 


xp  1e  Xlnxdx, 


p—  1  —X 


e  x(ln  x)n  dx. 


7.  In  the  Table  of  Laplace  Transforms  (end  of  Chapter  8,  page  469),  verify  the  T 
function  results  for  L5  and  L6.  Also  show  that  L(l/vT)  =  \Jix  jp. 


6.  BETA  FUNCTIONS 


The  beta  function  is  also  defined  by  a  definite  integral: 


(6.1) 


B{p,q)=  [  xp  1(1  —  x)q  1  dx,  p>  0,  q  >  0. 
Jo 


There  are  a  number  of  simple  transformations  of  (6.1)  which  are  useful  to  know 
[see  (6.3),  (6.4),  (6.5)].  It  is  easy  to  show  that  (Problem  1) 


(6.2) 


B(p,q)  =  B{q,p). 
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The  range  of  integration  in  (6.1)  can  be  changed  by  putting  x  =  y/a\  then  x  =  1 
corresponds  to  y  =  a,  and  (6.1)  becomes 


(6.3)  B(p,q) 


[a  ( yy _1  (\  _  ^y-1  (^i 

J0  \aJ  V  a /  a 


1 

ap+?-t 


y)q  1  dy. 


To  obtain  the  trigonometric  form  of  the  beta  function,  let  x  =  sin2  9 ;  then 

dx  =  2  sin  6  cos  6  dd,  (1  —  x)  =  1  —  sin2  9  =  cos2  9 , 
x  =  1  corresponds  to  9  =  7t/2. 

With  these  substitutions,  (6.1)  becomes 

,77/2 

B(p7q)  =  /  (sin2  0)p_1(cos2  0)<J^12sin0cos0(i6i  or 
Jo 


Finally,  let  x  =  y/(l  +  y)  in  (6.1);  then  we  get  (Problem  2): 


(6.5) 


B(p,q) 


yp  Xdy 

(1  +  y)P+<l  ' 


PROBLEMS,  SECTION  6 

1.  Prove  that  B(p,q)  =  B(q,p).  Hint:  Put  x  =  1  —  y  in  Equation  (6.1). 

2.  Prove  equation  (6.5). 

3.  Show  that  for  integral  n,  to, 

Tint  ^  (n  +  m-1 
1/B(n,  to)  =  to 

\  n  —  1 

Hint:  See  Chapter  1,  Section  13C,  Problem  13.3. 


7.  BETA  FUNCTIONS  IN  TERMS  OF  GAMMA  FUNCTIONS 


Beta  functions  are  easily  expressed  in  terms  of  T  functions.  We  shall  show  that 


(7.1) 


B(p,  q) 


r(p)r  (q) 
r  (p  +  q)  ' 


Thus  we  can  evaluate  a  B  function  in  terms  of  T  functions  (see  example  below) . 
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To  prove  (7.1),  we  start  with 

pOO 

T(p)  =  /  tp~1e~t  dt 

Jo 

and  put  t  =  y2 .  Then  we  have 

pOO 

(7.2)  T(p)  =  2  /  y^e-v2  dy. 

Jo 

Similarly  (remember  that  the  dummy  integration  variable  can  be  any  letter), 

poo 

T(q)  =  2  x2q-xe~x 2  dx. 

Jo 

Next  we  multiply  these  two  equations  together  and  change  to  polar  coordinates: 

r(p)T(g)  =  4 

poo  pit 

(7.3)  =  4  /  /  (r  cos  0)2q~ 1  (r  s’m  0)2p~ 1  e~ r  r  dr  d6 

Jo  Jo 

poo  r77/^1 

=  4  /  r2p+2q-1e~r2  dr-  /  (cos  dfq~1  (sin  Of^dO. 

Jo  Jo 


poo  poo 


x2q-ly2p-le-(^W)dxdy 


/0  Jo 

poo  /*7t/2 


The  r  integral  in  (7.3)  is  ^T(p  +  q)  by  (7.2).  The  8  integral  in  (7.3)  is  \B(p,q)  by 

(6.4).  Then  T(p)T(g)  =  4  •  ^T(p  +  q)  ■  \B(p,  q)  and  (7.1)  follows. 

Example.  Find 


poo  3 


i  = 


xA  dx 
(1  +  x)5 ' 


This  is  (6.5)  with  (p  +  q)  =  5,  p  —  1  =  3  or  p  =  4,  q  =  1.  Then  I  =  13(4, 1).  By 
(7.1),  this  is 

T(4)T(1)  3!  1 

T(5)  “  4!  ~  4' 


PROBLEMS,  SECTION  7 

Express  the  following  integrals  as  B  functions,  and  then,  by  (7.1),  in  terms  of  F  functions. 
When  possible,  use  T  function  formulas  to  write  an  exact  answer  in  terms  of  7r,  \/2,  etc. 
Compare  your  answers  with  computer  results  and  reconcile  any  discrepancies. 


r 


a;4  dx 

\/l  —  X‘ 


/■tt/2  -  r  1 

2.  /  v  sin3  x  cos  x  dx  3. 

Jo  Jo 


dx 


1.  f  x2 (1  —  x2)3/2  dx  5.  f 
Jo  Jo 


°°  y2  dy 

OTW6 


,  f 


vT  —  x3 
ydy 

(i  +  y3)2 


! 

Jo 


r/2 


dO 


/  sin  8 


*  L 


2  x2  dx 
V2^x 


7. 
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9.  Prove  B(n,n )  =  B(n,  ^)/22n_1.  Hint:  In  (6.4),  use  the  identity  2sin#cos0  =  sin 26 
and  put  29  =  <fi.  Use  this  result  and  (5.3)  to  derive  the  duplication  formula  for  T 
functions: 

P(2n)  =  -^22rl-1r(n)r(n+  I). 

Check  this  formula  for  the  case  n  =  |  by  using  (5.4). 

Computer  plot  the  graph  of  x3  +  y3  =  8.  Write  the  integrals  for  the  following  quantities 
(see  Chapter  5  if  needed)  and  evaluate  them  as  B  functions. 

10.  The  first  quadrant  area  bounded  by  the  curve. 

11.  The  centroid  of  this  area. 

12.  The  volume  generated  when  the  area  is  revolved  about  the  y  axis. 

13.  The  moment  of  inertia  of  this  volume  about  its  axis. 

8.  THE  SIMPLE  PENDULUM 


A  simple  pendulum  means  a  mass  m  suspended  by  a  string  (or 
weightless  rod)  of  length  l  so  that  it  can  swing  in  a  plane,  as  shown 
in  Figure  8.1.  The  kinetic  energy  of  m  is  then 

(8.1)  T  =  ^mv2  =  ^m(W)2 . 

If  the  potential  energy  is  zero  when  the  string  is  horizontal,  then  at 
angle  6  it  is 

V  =  —  mgl  cos  9. 

Then  the  Lagrangian  is  (see  Chapter  9,  Section  5) 


w/m/////A'///////////A 


Figure  8.1 


L  =  T  -V  =  - ml20 2 
2 


mgl  cos  9 , 


and  the  Lagrange  equation  of  motion  is 


or 


mgl  sin  0  =  0 


(8.2) 


0  =  -ysin0. 


Example  1.  Suppose  the  pendulum  executes  such  small  vibrations  that  sin0  can  be  ap¬ 
proximated  by  0.  Then  (8.2)  becomes  the  usual  equation  for  the  simple  harmonic 
motion  of  a  pendulum  executing  small  vibrations,  namely 

(8.3)  0=-|  9. 

The  solutions  of  (8.3)  are  sinwt  and  cos  a jt  where  w  =  2nu  =  \/g/l',  the  period  of 
the  motion  is  then  (see  Chapter  7,  Problem  2.13,  and  Chapter  8,  Problem  5.34) 

T  =  y  =  2t Ty/ljg. 


(8.4) 
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We  now  want  to  replace  this  approximate  solution  by  one  which 
is  exact  even  for  large  9. 


Example  2.  Going  back  to  the  differential  equation  of  motion  (8.2),  we  multiply  both 
sides  of  it  by  6  and  integrate,  thus  obtaining 

99  =  —  y  sin  99  or  9  d9  =  —  j  sin  9  dO ; 

(8.5)  i  91  =  j  cos  9  +  const. 

We  shall  come  back  to  the  general  solution  of  this  equation  when  we  discuss  elliptic 
integrals;  for  now  let  us  find  the  period  for  180°  swings  (back  and  forth  from  —90° 
to  +90°).  For  this  case,  9  =  0  when  9  =  90°,  so  the  constant  in  (8.5)  is  zero,  and 
we  have  (compare  Chapter  8,  Problem  7.13) 


-02  —  -1  cos  9, 
2  l 


d9 

dt 


■  V  cos  9, 


d9 


Vcos 9 


■  dt. 


From  9  =  0  to  9  =  90°  is  one-quarter  of  a  period;  hence  the  period  for  180°  swings 
is  given  by  T  in  the  equation 


t/2 


d0 


l2g 


rT/ 4 


Vcos  9 


=  \  —  /  dt  =  \ 


2  g  T 


Then  the  period  is 


(8.6) 


T  =  4 


d9 

Vcos 9 


We  can  see  by  comparing  (8.6)  with  (6.4)  that  this  is  a  B  function. 
By  computer  or  tables  we  find  that  T  =  7 A2^/l/ g  (see  Problem  1 
and  Problem  12.21).  We  can  find  the  period  for  only  this  one  special 
case  (180°  swings)  by  B  functions;  the  general  case  gives  an  elliptic 
integral  (Section  12). 


PROBLEMS,  SECTION  8 

1.  Complete  the  pendulum  problem  to  find  the  period  for  180°  swings  as  a  multiple  of 
\JTfg  [that  is,  evaluate  the  integral  in  (8.6)]. 

2.  Suppose  that  a  car  with  a  door  open  at  right  angles  (9  =  90°)  starts  up  and  ac¬ 
celerates  at  a  constant  rate  a  =  1  mph/sec.  The  differential  equation  for  6(t )  is 
9  =  —  ,4 sin#  where  A  =  3a/2w  for  a  uniform  door  of  width  w.  If  w  =  3.5  ft,  find 
how  long  it  takes  for  the  door  to  close. 

3.  The  figure  is  part  of  a  cycloid  with  parametric  equa¬ 
tions 

x  =  a(9  +  sin  9),  y  =  a(l  —  cos9). 

(The  graph  shown  is  like  Figure  4.4  of  Chapter  9 
with  the  origin  shifted  to  P2O  Show  that  the  time 
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for  a  particle  to  slide  without  friction  along  the  curve  from  (x\,y\)  to  the  origin  is 
given  by 


t  = 


dy 


Vy(yi  -  v ) 


Hint:  Show  that  the  arc  length  element  is  ds  =  sj 2a/y  dy.  Evaluate  the  integral  to 
show  that  the  time  is  independent  of  the  starting  height  y\ . 


9.  THE  ERROR  FUNCTION 

You  will  meet  this  function  in  probability  theory  (Chapter  15,  Section  8),  and 
consequently  in  statistical  mechanics  and  other  applications  of  probability  theory. 
You  have  probably  heard  of  “grading  on  a  curve.”  The  “curve”  means  the  bell¬ 
shaped  graph  of  y  =  e~x  (see  Problem  1);  the  error  function  is  the  area  under  part 
of  this  curve.  We  define  the  error  function  as 


(9.1) 


erf  (a;) 


dt. 


Although  this  is  the  usual  definition  of  erf(*),  there  are  other  closely  related  inte¬ 
grals  which  are  used  and  sometimes  referred  to  as  the  error  function.  Consequently, 
you  need  to  look  carefully  at  the  definition  in  the  reference  you  are  using  (text, 
tables,  computer).  Here  are  some  integrals  you  may  find  and  their  relation  to  (9.1) 
(see  Problem  2). 


The  standard  normal  or  Gaussian  cumulative  distribution  function  <3>(:r)  [see 
Chapter  15,  equation  (8.5)]: 

1  fX 

(9.2a)  <l>(x)  =  /  e_t  dt  =  \  +  \  erf(a;/v/2  ), 

V27T  J — oo 

(9.2b)  <J>(a;)  —  i  =  — - )L=  f  e~l  dt  =  \evi{x/V2). 

V  27t  Jo 


We  can  also  use  (9.2)  to  write  erf(x)  in  terms  of  the  standard  normal  cumulative 
distribution  function  [Chapter  15,  equation  (8.5)]. 


(9.4)  erf  (2)  =  24>(a:v/2)  —  1- 
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We  next  consider  several  useful  facts  about  the  error  function.  You  can  easily 
prove  that  the  error  function  is  odd;  that  is,  erf(— x)  =  —  erf  (a;)  (Problem  3).  We 
show  that  erf(oo)  =  1  as  follows: 


(9.5) 


erf(oo) 


•to 


by  (5.2)  and  (5.3).  For  very  small  values  of  x,  erf  (a;)  can  be  approximated  by  ex¬ 
panding  e^f  in  a  power  series  and  integrating  term  by  term.  We  get 


For  large  x,  say  x  >  3,  erf(x)  differs  from  erf(oo)  =  1  [see  (9.5)]  by  less  than 
1CT4  (and  of  course  even  less  for  larger  x).  We  are  then  usually  interested  in 
1  —  erf  (a;)  =  erfc(x).  This  is  best  approximated  by  an  asymptotic  series;  we  shall 
discuss  such  expansions  in  Section  10. 

The  function  erfi(x),  called  the  imaginary  error  function,  is  similar  to  the  error 
function  but  with  a  positive  exponential.  We  define 

2  fx  2 

(9.7)  erfi(x)  =  —=  /  et  dt. 

V71"  Jo 

You  can  show  (Problem  5)  that  erf(ix)  =  ierfi(x). 

The  Fresnel  integrals  (Chapter  1,  Section  15)  are  related  to  the  error  function 
(Problem  6).  Also  see  Section  10,  Problem  3  for  other  relations  involving  error 
functions. 


PROBLEMS,  SECTION  9 

_  2 

1.  Sketch  or  computer  plot  a  graph  of  the  function  y  =  e  x  . 

2.  Verify  equations  (9.2),  (9.3),  and  (9.4).  Hint:  In  (9.2a),  you  want  to  write  $(x) 
in  terms  of  an  error  function.  Make  the  change  of  variable  t  =  uy/2  in  the  4>(x) 
integral.  Warning:  Don’t  forget  to  adjust  the  limits;  when  t  =  x,  u  =  x/\/2. 

3.  Prove  that  erf(x)  is  an  odd  function  of  x.  Hint:  Put  t  =  —s  in  (9.1). 

/OO  2 

e~y  dy  =  V2n 

-OO 

(a)  by  using  (9.5)  and  (9.2a); 

(b)  by  reducing  it  to  a  V  function  and  using  (5.3). 

5.  Replace  x  by  ix  in  (9.1)  and  let  t  =  iu  to  show  that  erf (ix)  =  ierfi(x),  where  erfi(:c) 
is  defined  in  (9.7). 
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6.  Assuming  that  x  is  real,  show  the  following  relation  between  the  error  function  and 
the  Fresnel  integrals. 


erf 


Hint:  In  (9.1),  make  the  change  of  variables  t  =  — -=-u. 

v  2 


(cos  u2  +  i  sin  u 2 )  du. 


1  —  i 


10.  ASYMPTOTIC  SERIES 

Since  you  have  spent  some  time  learning  to  test  series  for  convergence,  it  may 
surprise  you  to  learn  that  there  are  divergent  series  which  can  be  of  practical  use. 
We  can  show  this  best  by  an  example. 


Example  1.  From  (9.3a) 

(10.1)  erfc(x)  =  1  —  erf(x) 


dtt. 


We  are  going  to  expand  the  integral  in  (10.1)  in  a  series  of  inverse  powers  of  x.  To 
do  this  we  write 


(10.2) 


1  d 
t  dt 


and  integrate  by  parts  as  follows: 


Now  in  the  last  integral  in  (10.3),  write  (1  /t2)e  *2  =  (1  /t3)(d/dt)(—^e  *2),  and 
again  integrate  by  parts: 


Continue  this  process,  and  substitute  (10.3)  and  the  following  steps  into  (10.1)  to 
get  (Problem  1) 


(10.4) 


erfc(x)  =  1  —  erf(x) 


e"*2  /  1  1-3  1-3-5 

~  2^2  +  (2^2)2  ~  +  '  " 


[Use  this  when  |x|  ^  1.  Compare  (9.6).] 
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(We  shall  explain  the  exact  meaning  of  the  symbol  ~  shortly.)  This  series  diverges 
for  every  x  because  of  the  factors  in  the  numerator.  However,  suppose  we  stop  after 
a  few  terms  and  keep  the  integral  at  the  end  so  that  we  have  an  exact  equation.  If 
we  stop  after  the  second  term,  we  have 


(10.5) 


erfc(x) 


dt. 


There  is  no  approximation  here.  This  is  not  an  infinite  series  so  there  is  no  question 
of  convergence.  However,  we  shall  show  that  the  integral  at  the  end  is  negligible 
for  large  enough  x\  this  will  then  make  it  possible  for  us  to  use  the  rest  of  (10.5) 
[that  is,  the  first  two  terms  of  (10.4)]  as  a  good  approximation  for  erfc(x)  for  large 
x.  This  is  the  meaning  of  an  asymptotic  series.  As  an  infinite  series  it  may  diverge, 
but  we  do  not  use  the  infinite  series.  Instead,  using  an  exact  equation  [like  (10.5) 
for  this  example],  we  show  that  the  first  few  terms  which  we  do  use  give  a  good 
approximation  if  x  is  large. 


Example  2.  Now  let’s  look  at  the  integral  in  (10.5);  we  want  to  estimate  its  size  for  large 
x.  The  t  in  the  integrand  takes  values  from  x  to  oo;  therefore  t>x  or  l/x>l/t 
for  all  values  of  t  from  £  to  oo.  Let  us  write  the  integral  as 


dt. 


We  increase  the  value  of  this  integral  if  we  replace  1/t5  by  l/x5  since  1/x  >  1/t. 
Thus 


dt  < 


1  f°°  2 

dt=  —  fe_t  dt 

x5  Jx 


When  we  stop  in  (10.5)  with  the  term  in  e  1  /x3,  the  error  is  of  the  order  of 
e~x  /a;5,  which  becomes  much  smaller  than  e~x  /x3  as  x  increases.  Thus  we  have 
shown  that  two  terms  of  (10.4)  give  a  good  approximation  for  erfc(x)  when  x  1. 
A  similar  result  can  be  shown  for  an  approximation  using  any  number  of  terms  of 
the  asymptotic  series  (10.4)  with  the  error  depending  on  the  “left-over”  integral  and 
the  value  of  x. 

We  can  make  the  above  discussion  more  precise.  For  (10.4),  we  have  seen  that 
if  we  stop  after  the  term  in  x~3e~x  ,  the  error  is  of  the  order  of  x~°e~x  .  Then  the 
ratio  of  the  error  to  the  last  term  kept  (namely  x~5e~x  4-  x~3e~x  —  x~2)  tends 
to  zero  as  x  tends  to  infinity,  that  is,  the  approximation  becomes  increasingly  good 
for  larger  x  as  we  have  said.  The  “error”  in  an  asymptotic  expansion  means  in 
general  the  difference  between  the  function  being  expanded  and  a  partial  sum  (first 
N  terms)  of  the  series.  A  series  is  called  an  asymptotic  expansion  (about  oo)  of  a 
function  f(x)  if,  for  each  fixed  N,  the  ratio  of  the  error  to  the  last  (nonzero)  term 
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kept,  tends  to  zero  as  x  — >  oo.  In  symbols 

OO 

f(x)  ~  ) 


n— 0 


(10.6) 


if  for  each  fixed  N 


n— 0 


N 


f{x)  -  ^2^>n(x) 


n— 0 


-T-  4>n{x)  —■ >  0  as  x  — >  oo. 


Frequently,  the  terms  of  an  asymptotic  series  (about  oo)  are  inverse  powers  of  x.  [We 
could  write  (10.4)  this  way  by  multiplying  through  by  e°°  .]  Then  (10.6)  becomes 


a „ 
x” 


(10.7) 


/(*)  -  E  5 


f(x)  ~  E  ■ 

71—0 

if  for  each  fixed  N 

JV 

xN  — >  0  as  x  — >  oo. 


71=0 


We  can  also  have  asymptotic  series  about  the  origin  (or  any  point — compare  Taylor 
series).  We  say  that 


(10.8) 


f(x)  ~  ^2  anxn 

71=0 

if  for  each  fixed  N 


f(x)  - 


N 

E 

71=0 


anx 


—  x 


N 


0  as  x  — >  0. 


Although  we  have  discussed  the  particularly  interesting  case  of  divergent  asymp¬ 
totic  series,  it  is  not  necessary  for  such  series  to  diverge.  Note  that  to  test  a  series 
for  convergence,  we  fix  x  and  let  n  tend  to  infinity;  to  see  if  a  series  is  asymptotic, 
we  fix  n  and  let  x  tend  to  a  limit.  A  given  series  may  meet  both  tests,  or  only  one 
or  the  other  (or  neither) . 


PROBLEMS,  SECTION  10 

1.  Carry  through  the  algebra  to  get  equation  (10.4). 

2.  The  integral  f°°  tp~1e~t  dt  =  F(p,x)  is  called  an  incomplete  T  function.  [Note  that 
if  x  =  0,  this  integral  is  T(p).[  By  repeated  integration  by  parts,  find  several  terms 
of  the  asymptotic  series  for  T(p,x). 

3.  Express  the  complementary  error  function  erfc(x)  as  an  incomplete  E  function  (see 
Problem  2)  and  use  your  result  in  Problem  2  to  obtain  (again)  the  asymptotic 
expansion  of  erfc(x)  as  in  (10.4). 
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4. 


5. 

6. 

7. 


e~xt  fx  eJ 

En(x)  =  /  - dt,  n  =  0,  1,  2,  •  •  •,  and  Eify)  =  /  —  dt,  and  other  similar 

J 1  tn  7-oo  * 

integrals  are  called  exponential  integrals.  By  making  appropriate  changes  of  variable, 
show  that 


(a)  Exix)  =  f 
J  X 


t 


■  dt 


(b)  Eify)  =  -  J  — 

r  e}^_ 

Jo  t 


dt 


(c)  Ei(x)  —  -Ei(-x)  (d)  /  - — dt  =  E1(-l/x) 

Jo  t 

( Caution :  Various  notations  are  used;  check  carefully  the  notation  in  references  you 
are  using.) 

(a)  Express  E\(x )  as  an  incomplete  T  function. 

(b)  Find  the  asymptotic  series  for  E\{x). 

/X 

- — -.  Express  as  exponential  integrals 

w  “(*)  (»>  “(»■>  w  £  wm 

Computer  plot  graphs  of 
(a)  En(x)  for  n  =  0  to  10  and  x  =  0  to  2; 

(b)  E\[x)  and  Ei(x)  for  x  —  0  to  2; 


(c)  the  sine  integral  Si(i)  =  /  - - -  dt  and  the  cosine  integral  Ci(a;)  =  — 

Jo  t 

for  x  =  0  to  4-7T. 


F00  cos  t 

Jx  ~ 


11.  STIRLING'S  FORMULA 

Formulas  involving  n !  or  T(p)  are  not  convenient  to  simplify  algebraically  or  to 
differentiate.  Here  is  an  approximate  formula  for  the  factorial  or  T  function  known 
as  Stirling’s  formula  which  can  be  used  to  simplify  formulas  involving  factorials: 


(11.1)  n!  ~  nne  nV2nn  or  T(p+l)~ppe  p\/2tt p.  Stirling’s  formula 


The  sign  ~  (read  “is  asymptotic  to”)  means  that  the  ratio  of  the  two  sides 

n\ 

nne~n\f72/nn 

tends  to  1  as  n  — >  oo.  Thus  we  get  better  approximations  to  n!  as  n  becomes  large. 
Actually  the  absolute  error  (difference  between  the  Stirling  approximation  and  the 
correct  value)  increases ,  but  the  relative  error  (ratio  of  the  error  to  the  value  of  n!) 
tends  to  zero  as  n  increases.  To  get  some  idea  of  how  this  formula  arises,  we  outline 
what  could,  with  a  little  more  detail,  be  a  derivation  of  it.  (For  more  detail,  consult 
advanced  calculus  books.)  Start  with 

/•OO  POO 

(11.2)  r(p+l)=p!=  /  xpe~xdx=  /  ePlnx~xdx. 

Jo  Jo 

Substitute  a  new  variable  y  such  that 


dt 


x  =  p+  yy/p. 
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Then 


dx  =  y/pdy, 

x  =  0  corresponds  to  y  =  —  y/p, 


and  (11.2)  becomes 

/OO 

e  Pln(p+vVp)-p-yVp^dy. 

-y/p 

For  large  p ,  the  logarithm  can  be  expanded  in  the  following  power  series: 


(11.4) 


In (p  +  yy/p)  =  In p  +  In  ^1  +  j  =  In  p  +  -^=  - 


Substituting  (11.4)  into  (11.3),  we  get 


The  first  integral  is  easily  shown  to  be  y/2n  (Problem  9.4).  The  second  integral 
tends  to  zero  as  p  — >  oo,  and  we  have 

p\  ~  j? e~p  yj 2np 

which  is  (11.1).  With  more  work,  it  is  possible  to  find  an  asymptotic  expansion  for 

F(p+1): 

(11.5)  r(p  +  l)  =p\=p»e-P^(l  +  ^  +  ^  +  •")■ 

This  is  another  example  of  an  asymptotic  series  which  is  divergent  as  an  infinite 
series;  however,  the  first  term  alone  (Stirling’s  formula)  is  a  good  approximation 
when  p  is  large,  and  the  second  term  can  be  used  to  estimate  the  relative  error 
(Problem  1). 


PROBLEMS,  SECTION  11 

1.  Use  the  term  l/(12p)  in  (11.5)  to  show  that  the  error  in  Stirling’s  formula  (11.1)  is 
<  10%  for  p  >  1;  <  1%  for  p  >  10;  <  0.1%  for  p  >  100;  <  0.01%  for  p  >  1000. 

2.  (a)  To  see  the  results  in  Problem  1  graphically,  computer  plot  the  percentage  error 

in  Stirling’s  formula  as  a  function  of  p  for  values  of  p  from  1  to  1000.  Make 
separate  plots,  say  for  p  =  1  to  10,  10  to  100,  100  to  1000,  to  make  it  easier  to 
read  values  from  your  plots. 

(b)  Repeat  part  (a)  for  the  percentage  error  in  (11.5)  using  two  terms  of  the  asymp¬ 
totic  series,  that  is,  Stirling’s  formula  times  [1  +  l/(12p)]. 
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3.  In  statistical  mechanics,  we  frequently  use  the  approximation  In  IV!  =  N  In  N  —  N , 
where  N  is  of  the  order  of  Avogadro’s  number.  Write  out  In  AH  using  Stirling’s 
formula,  compute  the  approximate  value  of  each  term  for  N  =  1023,  and  so  justify 
this  commonly  used  approximation. 


4. 


Use  Stirling’s  formula  to  evaluate 


(2n)\V* 

nWi  22"(n!)2 


5. 


Use  Stirling’s  formula  to  evaluate 


lim 

n — >oo 


r(n+f) 

V/nU(n  +  1) ' 


6.  Use  equations  (3.4)  and  (11.5)  to  show  that  T(p)  ~  ppe  p^/2n/p  ^1  +  +  ■  ■  ■  j  . 

7.  The  function  i/>(p)  =  ^  lnT(p)  is  called  the  digamma  function,  and  the  polygamma 

functions  are  defined  by  il>n(p)  =  [  Warning :  Some  authors  define  ijj(p)  as 

£lnp!  =  £lnr(p+l).] 

(a)  Show  that  ip(p  +  1)  =  ip(p)  +  j;.  Hint:  See  (3.4). 

(b)  Use  Problem  6  to  obtain  t/)(p)  ~  lnp  —  A.  —  .... 

8.  Sketch  or  computer  plot  a  graph  of  y  =  In  a;  for  x  >  0.  Show  that  Inn!  is  between  the 
values  of  the  integrals  J2”+1  lna;  dx  and  f™lnxdx.  (Hint:  Inn!  =  In  1  +  ln  2  +  ln  3+- •  • 
is  the  sum  of  the  areas  of  rectangles  of  width  1  and  height  up  to  the  lna;  curve  at 
x  —  1,  2,  3,  ■  •  •.)  By  considering  the  values  of  the  two  integrals  for  very  large  n  as 
in  Problem  3,  show  that  Inn!  =  nlnn  —  n  approximately  for  large  n. 


9.  The  following  expression  occurs  in  statistical  mechanics: 


P  = 


(np  +  u)\  ( nq  —  u)\ 


np-\-u  nq—u 

p  q  ■ 


Use  Stirling’s  formula  to  show  that 

1  npx  nqy  fc\ 

p  ~  x  p  y  ^/2impqxy, 

U  'LL 

where  x  =  1  H - ,  y  =  1 - ,  and  p  +  q  =  1.  Hint:  Show  that 

np  nq 


/  \np-\-u  /  \ nq  —  u  n  np-\-u  nq— 

(np)  (nq)  =np  q 


and  divide  numerator  and  denominator  of  P  by  this  expression. 

10.  Use  Stirling’s  formula  to  find  liinrl^00(n!)1^’1/n. 


12.  ELLIPTIC  INTEGRALS  AND  FUNCTIONS 

This  is  another  collection  of  integrals  and  related  functions  which  may  arise  in  ap¬ 
plied  problems  and  as  computer  answers  (see  problems) .  We  shall  merely  summarize 
the  basic  definitions  and  properties — there  are  whole  books  on  the  subject — and  you 
may  find  useful  formulas  and  information  in  your  computer  program  and  in  reference 
books  and  tables. 
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Legendre  Forms  The  Legendre  forms  of  the  elliptic  integrals  of  the  first  and 
second  kinds  are: 


(12.1) 


F(<t>,k)=  f 
Jo 


d6 


o  \/l  —  k2  sin2  6 


0  <  k  <  1, 


E(cf>,k)  =  f  \/ 1  —  k2  sin2  8  dd,  0  <  k  <  1. 
Jo 


There  is  also  an  elliptic  integral  of  the  third  kind  which  occurs  less  frequently.  In 
(12.1),  <p  is  called  the  amplitude  and  k  is  called  the  modulus  of  the  elliptic  integral. 


Jacobi  Forms  If  we  put  t  =  sin 6,  x  =  simp  in  the  Legendre  forms  (12.1),  we 
obtain  the  Jacobi  forms  of  the  elliptic  integrals  of  the  first  and  second  kind: 


t  =  sin  9 , 

,  dt  dt 

dt  =  cos  0  dO  or  dO  = - -  =  -^=^= . 

cos  9  ST~t2 

The  limits  9  =  0  to  <p  become  t  =  0  to  x. 


Then 


(12.2) 


F(cp,k) 

E(cp,k) 


y/l  —  t 2  /l  —  k2t2 
Vl  -  k2  sin2  9  d9  =  [  —  ~  IS  dt. 

Jo  Jo  VT 


Complete  Elliptic  Integrals  The  complete  elliptic  integrals  of  the  first  and 
second  kind  are  the  values  of  F  and  E  when  <p  =  7t/2  or  x  =  sin  <p  =  1: 


(12.3) 


K  or  K(k)  =  F 


E  or  E(k )  =  E 


d9 

y/l  —  k2  sin2  9 
\J\  —  k2  sin2  9  d9 


dt 


a/1  —  t2  \/l  —  k2t2 


a/1  —  k2t2 

~VT^W 


dt. 


Warning:  The  notation  used  for  elliptic  integrals  is  not  uniform.  Most  references 
use  F  and  E,  but  you  may  find  <p  replaced  by  x  =  sin  </>,  and  instead  of  k  you  may 
find  to  =  k2,  or  sin-1  k.  Also  (( f>,k )  may  be  written  as  ( k,(p ),  and  other  variations 
exist.  So  check  carefully  the  notation  of  any  book  or  computer  program  you  are 
using  and  reconcile  the  results  with  the  notation  used  here. 


/■’ r/3  j - 

Example  1.  /  yl  —  (1/2)  sin2  9d9  =  E(<p,  k)  =  E(jr/3,l/y/2)  in  our  notation.  Other 

Jo 

books  or  computer  programs  might  give:  E(cp,m)  =  E(ir/3,l/2),  or  E(x,k)  = 
E(\/ 3/2, 1/ a/2 )  or  E(^,sin_1fc)  =  E(tt/3,tt/4J),  etc.  Of  course,  all  of  them  will 
give  the  same  numerical  approximation  0.964951. 

Many  integrals  can  be  written  in  the  form  of  one  of  the  integrals  in  (12.2). 
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r  / - 

Example  2.  /  v  16  —  8  sin2  9  dd  becomes  4  times  the  integral  in  Example  1  if  we  divide 

Jo 

f*/ 3  j - 

out  a  factor  of  4  to  get  4  /  1  —  (1/2)  sin2  9  d6. 

Jo  v 


Example  3. 


(■2/5 


=  F(<f>,k)  =  F(sin  4,2)  in  the  notation  of  (12.2), 


i  Vl~^  Vl  -  4 f  2 

except  that  we  have  previously  required  k  <  1,  and  here  k  =  2.  However,  we  can 
put  this  integral  in  the  standard  form  with  k  <  1  by  making  the  change  of  variable 
4 t2  =  r2,  or  r  =  2t.  Substituting  this  into  the  given  integral  gives 


dr  12 

sjl  -  r2/ 4  Vl  -  r2 


which,  by  (12.2),  is  \F((j),k)  =  If  (sin  1  |).  (See  Problem  24.) 

It  is  sometimes  useful  to  note  that  the  integrands  in  elliptic  integrals  are  all 
functions  of  sin2  9  and  so  are  even  functions  of  9.  Thus  an  elliptic  integral  from 
~4>i  to  4>2  (</> i  and  (j> 2  both  positive)  is  equal  to  the  integral  from  0  to  </>i  plus  the 
integral  from  0  to  <f>2  and  we  have 


sin2  9  d9  =  E((f> i,  k )  +  E((f> 2,  k) 


and  a  similar  formula  for  F(cj),  k).  Also  we  may  note  that  a  function  of  sin2  9  has 
period  7 r  and  is  symmetric  about  9  =  mr  +  7r/2  (look  at  a  graph  of  sin2  9).  Thus, 
using  the  complete  elliptic  integrals  in  (12.3),  we  can  write  (Problem  2) 


(12.4) 


F(mr  ±  cj),  k)  =  2nK  ±  F(<f>,  k ), 
E(mr  ±  (f>,  k)  =  2nE  ±  E(<fi,  k ). 


Since  k2  sin2  9  <  1  (for  k 2  <  1),  we  get  convergent  infinite  series  for  elliptic 
integrals  by  expanding  their  integrands  using  the  binomial  theorem,  and  then  inte¬ 
grating  term  by  term  (Problem  1).  For  small  k  these  series  converge  rapidly  and 
provide  a  good  method  for  approximating  elliptic  integrals  when  k  <C  1. 

Here  are  some  examples  where  elliptic  integrals  occur. 


Example  4.  Find  the  arc  length  of  an  ellipse.  This  is  the  problem  that  gave  elliptic 
integrals  their  name.  We  write  the  equation  of  the  ellipse  in  the  parametric  form 


x  =  a  shod, 
y  =  bcos9, 

for  the  case  a  >  b.  (If  b  >  a,  use  the  form  x  =  a  cos  9,  y  =  bsin9;  see  Problem  15.) 
Then  for  a  >  b,  we  have 

ds2  =  dx2  +  dy2  =  ( a 2  cos2  9  +  b2  sin2  9)  d92. 

Since  a2  —  b2  >  0,  we  can  write 

J  ds  =  J  \j a2  —  ( a 2  —  b2)  sin2  9  d9  =  a  J  \jl  —  —  ^  ^  sin2^  d9. 
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This  is  an  elliptic  integral  of  the  second  kind  where  k2  =  (a2  —  b2)/ a2  =  e2  (e  is  the 
eccentricity  of  the  ellipse  in  analytic  geometry) .  If  we  want  the  complete  circumfer¬ 
ence,  9  goes  from  0  to  27 r,  and  the  answer  is  AaEjn/2,  k )  =  4 aE(k).  For  a  smaller 
arc,  we  use  the  appropriate  limits  (J>i  and  </> 2  and  obtain  E{<j> 2,  A;)  —  E{<j>\,k).  For 
any  given  ellipse  (that  is,  given  a  and  6),  we  can  find  the  numerical  value  of  the 
desired  arc  length  from  computer  or  tables. 


Example  5.  Let  a  pendulum  swing  through  large  angles.  We  had  in  Section  8 


(12.5) 


2  9 


T  —  -j-  cos  9  +  const., 


and  we  considered  180°  swings,  that  is  of  amplitude  90°.  Now  we  want  to  consider 
swings  of  any  amplitude,  say  a;  then  9  =  0  when  9  =  a,  and  (12.5)  becomes 


(12.6)  92  =  ^j-{cos9  —  cosa). 

Integrating  (12.6),  we  get 


(12.7) 


d9 


Jo  \/cos9^cosa 


where  Ta  is  the  period  for  swings  from  — a  to  +ct  and  back.  This  integral  can  be 
written  as  an  elliptic  integral;  its  value  (Problem  17)  is 

(12.8)  V2K  (sin|j  . 

Then  (12.7)  gives  for  the  period 


For  a  not  too  large  (say  a  <  90°,  \a  <  45°,  so  that  sin2 (a/2)  <  1),  we  can  get  a 
good  approximation  to  Ta  by  series  (Problem  1): 


(12.9) 


For  a  small  enough  so  that  sina/2  can  be  approximated  by  a/2,  we  can  write 


(12.10) 


For  very  small  a,  we  get  the  familiar  formula  for  simple  harmonic  motion,  T  = 
27 T\Jl/g  independent  of  a.  For  somewhat  larger  a,  say  a  =  \  radian  (about  30°), 
we  get 


(12.11) 


This  would  mean  that  a  pendulum  started  at  30°  would  get  exactly  out  of  phase 
with  one  of  very  small  amplitude  in  about  32  periods. 

For  another  physics  problem  giving  rise  to  an  elliptic  integral,  see  Am.  J.  Phys.  55, 
763  (1987). 
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Elliptic  Functions  Recall  that 


dt 


=  sin 


-l 


defines  u  as  a  function  of  x ,  or  x  as  a  function  of  u;  in  fact  x  =  sin  u.  In  a  similar 
way,  u  =  F(<j>,  k)  in  (12.2)  defines  u  as  a  function  of  <j>  (or  of  x  =  sin</>)  or  it  defines 
x  or  (f>  as  functions  of  u  (we  are  assuming  k  fixed).  We  write 

rx  dt 

(12.12)  u=  z  =  sn-1  x. 

Jo  Vl  —  t2  \/\  —  k2t2 

or  x  =  sn  it.  The  function  snzt  (read  ess-en  of  u)  is  an  elliptic  function.  Since 
x  =  sin  we  have 


(12.13) 


i  =  snu=  sin  <j>. 


There  are  other  elliptic  functions,  related  to  snu;  you  will  notice  [in  (12.14)]  that 
they  have  some  resemblance  to  the  trigonometric  functions.  We  define 


(12.14) 


cn  u  =  cos  (f>  —  y  1  —  sin2  <t>  =  \/l  —  sn2  u  =  \/\  —  x2, 

dn  u  =  — -  =  — — y—  =  \/l  —  k2  sin2  tfr  =  \/l  —  k2  sn2  u  =  \/l  —  k2x2. 
du  du/ dip  v 


[The  value  of  du/d<f>  is  found  from  u  =  F((f> ,  k)  in  (12.2).]  There  are  many  formulas 
relating  these  functions — for  example,  addition  formulas,  integrals,  derivatives,  etc. 
These  can  be  looked  up  or,  in  some  cases,  easily  worked  out.  For  example,  since 
snit=  sin  (j>,  we  have 


—  (snu)  =  —  (sin<^)  =  cos  =  cnudntt. 
du  du  du 


For  a  physical  problem  using  elliptic  functions,  see  Am.  J.  Phys.  68,  888-895  (2000). 


PROBLEMS,  SECTION  12 

1.  Expand  the  integrands  of  K  and  E  [see  (12.3)]  in  power  series  in  k2  sin2  9  (assuming 
small  fc),  and  integrate  term  by  term  to  find  power  series  approximations  for  the 
complete  elliptic  integrals  K  and  E. 

2.  Use  a  graph  of  sin2  6  and  the  text  discussion  just  before  (12.4)  to  verify  the  equations 
(12.4).  Note  that  the  area  under  the  sin2  9  graph  from  0  to  7r/2  and  the  area  from 
7r/2  to  7r  are  mirror  images  of  each  other,  and  this  will  be  true  also  for  any  function 
of  sin2  9. 

3.  Computer  plot  graphs  of  K(k)  and  E{k)  in  (12.3)  for  k  from  0  to  1.  Also  plot  3D 
graphs  of  F{(j>,k )  and  E(cj),k)  in  (12.1)  for  k  from  0  to  1  and  <j>  from  0  to  n/2  and 
also  from  0  to  27r.  Warning:  Be  sure  you  understand  the  notation  used  by  your 
computer  program;  see  text  discussion  just  after  (12.3)  and  Example  1. 
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In  Problems  4  to  13,  identify  each  of  the  integrals  as  an  elliptic  integral  (see  Examples  1 
and  2).  Learn  the  notation  of  your  computer  program  (see  Problem  3)  and  then  evaluate 
the  integral  by  computer. 


/’ 
’  L 
<  l 


dt 


VI  ~t2  y/1-  t2/4 

n/3  de 


\/9  -  sin2  6 
' 7 5/2  V49  -  4t2 


8.  /  - ;  dt 


de 

\/ 4  —  sin2  9 

1/2  V100  -  t2 


■  L 

lo  \/l  —  l2 


dt 


'■  L'  f-We 


de 


/•57r/4  _ 

7.  /  ^25  -  sin2 

Jo 


-1/2  vT^f2  V4-3t2 
Svr/S  d0 


'  /_ 

■  L 

I- 1/2  vi  —  t2 


-V2  ^l-^sin^ 

3/4  V9  -  4t2  , 
dt 


14.  Find  the  circumference  of  the  ellipse  4x2  +  9i/2  =  36. 

15.  Find  the  length  of  arc  of  the  ellipse  x2  +  (j/2/4)  =  1  between  (0,2)  and  (^,\/3)- 
(Note  that  here  b  >  a;  see  Example  4.) 

16.  Find  the  arc  length  of  one  arch  of  y  =  sin  x. 

17.  Write  the  integral  in  equation  (12.7)  as  an  elliptic  integral  and  show  that  (12.8) 
gives  its  value.  Hints:  Write  cos#  =  1  —  2  sin2  (9/2)  and  a  similar  equation  for  cos  a. 
Then  make  the  change  of  variable  x  =  sin(0/2)/sin(a/2). 

18.  Computer  plot  graphs  of  snu,  enu,  and  dnu,  for  several  values  of  k,  say,  for  example, 
k  =  1/4,  1/2,  3/4,  0.9,  0.99.  Also  plot  3D  graphs  of  sn,  cn,  and  dn  as  functions  of 
u  and  k. 


19.  If  u  =  ln(sec0  +  tan  cp),  then  <f>  is  a  function  of  u  called  the  Gudermannian  of  u, 
(j>  =  gdu.  Prove  that: 

u  =  In  tan  (  —  +  —  )  ,  tan  gd  u  =  sinh  u ,  sin  gd  u  =  tanh  u,  4—  gd  u  =  sech  u. 

V  4  2 /  du 


20.  Show  that  for  k  =  0: 


u  =  0)  =  <f),  snu  =  sinu,  enu  =  cosu,  dnu  =  1; 

and  for  k  =  1: 


u  =  1)  =  ln(sec  (f>  +  tan  </)  or  <fi  =  gdu  (Problem  19), 

sn  u  =  tanh  u,  enu  =  dn  u  =  sech  u. 


Show  that  the  four  answers  given  in  Section  1  for  J/T/2  d9/V cos  9  are  all  correct. 
Hints:  For  the  beta  function  result,  use  (6.4).  Then  get  the  gamma  function  results 
by  using  (7.1)  and  the  various  P  function  formulas.  For  the  elliptic  integral,  use  the 
hint  of  Problem  17  with  a  =  tt/2. 


21. 
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22.  In  the  pendulum  problem,  6  =  a  sin  \fg]lt  is  an  approximate  solution  when  the 
amplitude  a  is  small  enough  for  the  motion  to  be  considered  simple  harmonic.  Show 
that  the  corresponding  exact  solution  when  a  is  not  small  is 

•  0  ■  a  m 

Sm2=Sm2Sn\lt 

where  k  =  sin(a/2)  is  the  modulus  of  the  elliptic  function.  Show  that  this  reduces 
to  the  simple  harmonic  motion  solution  for  small  amplitude  a. 

23.  A  uniform  solid  sphere  of  density  |  is  floating  in  water.  (Compare  Chapter  8,  Prob¬ 
lem  5.37.)  It  is  pushed  down  just  under  water  and  released.  Write  the  differential 
equation  of  motion  (neglecting  friction)  and  solve  it  to  obtain  the  period  in  terms  of 
K( 5-1'2).  Show  that  this  period  is  approximately  1.16  times  the  period  for  small 
oscillations. 

24.  Sometimes  you  may  find  the  notation  F(cf),k)  in  (12.2)  used  when  k  >  1.  Allowing 
this  notation,  show  that  |i',(sin~1  |,  |)  =  i.F(sin_1  |,  |).  Hints:  Using  the  Jacobi 
form  of  F  in  (12.2),  write  the  integral  which  is  equal  to  |.F(sin_1  |,  |).  Follow 
Example  3  to  make  a  change  of  variable,  write  the  corresponding  integral,  and 
verify  that  it  is  equal  to  |F’(sin_1  |,  |). 

25.  As  in  Problem  24,  show  that  ^.F(sin-1  jjr,  |)  =  |.F(sin_1  |,  |). 


13.  MISCELLANEOUS  PROBLEMS 


1.  Show  that 

fx  Vmdy  1 

Jo  (1  +  y)n+1  ( n-m)C(n,m ) 

for  positive  integral  m  and  n,  n  >  m,  where  C(n,  m)  = 

2.  Show  that  B(m,  n)B(m  +  n,  k)  =  B(n,  k)B{n  +  k,  m ). 

3.  Use  Stirling’s  formula  to  show  that 


lim  nx B(x,n)  =  T(x). 


4.  Verify  the  asymptotic  series 


r°°  e-t  dt 

Jo  (1  +  xt) 


^(-l)nn!af 


[see  equation  (10.8)].  Hint:  Integrate  by  parts  repeatedly,  integrating  e  1  dt  and 
differentiating  the  powers  of  (1  +  xt )-1. 


5. 

6. 


Use  gamma  and  beta  function  formulas  to  show  that 

dx 


f 

Jo 


dx 


(1  +  X)y/X 


Generalize  Problem  5  to  show  that 


f 

Jo 


(1  +  x)xp  sin7rp 


,  0  <  p  <  1. 


Identify  each  of  the  following  integrals  or  expressions  as  one  of  the  functions  of  this  chapter. 
Check  your  work  by  evaluating  both  your  answer  and  the  original  problem  by  computer. 
Be  sure  you  understand  your  computer  program’s  notation. 


3  —x  j 

x  e  dx 


8. 


/’ 


e 


dx 


9. 


Jo  V  1  -x2 


dx 


7. 
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10. 

r 37r/4  d(f> 

U-  J 

r  3/5 

dt 

J  —  7r/4  a/I  +  COS 2  (j) 

0  Vl-t2  /16  -  25 12 

12. 

r/2  dx 

Jo  'J‘1  —  sin2  x 

13. 

l(cnu) 

14. 

fe-^dx 

15. 

POO 

/  x5^2e~x  dx 

Jo 

16. 

r°°  2 

/  e  x  dx 

■J  —  oo 

17. 

f r/2  , - 

/  v  sin3  Q  cos5  6  dQ 

Jo 

18. 

e~xdx 

Jo  *1/4 

19. 

fe-*2dx 

20. 

rn/2 

/  (cos  x)5^2  dx 

Jo 

p7ir  /  8 

21. 

/  x-1/3(5-*)10/3 
Jo 

dx 

22‘  J 

f 

0 

sin2  x  dx 

23. 

Find  an  expression  for  the  exact  value  of  r(55.5)  in  terms  of  double  factorials  (!!), 
powers  of  2  and  For  !!,  see  Chapter  1,  Section  13C,  Example  2. 

24. 

Using  your  result  in 
value  of  F(— 54.5). 

Problem  23  and  equation  (5.4),  find  an  expression  for  the  exact 

25. 

As  in  problems  23  and  24, 
r(-27.5). 

find  expressions 

for  the  exact  values  of  T(28.5)  and 
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Series  Solutions  of  Differential 
Equations;  Legendre,  Bessel, 
Hermite,  and  Laguerre  Functions 


1.  INTRODUCTION 

By  now  you  are  well  aware  that  physical  problems  in  many  fields  lead  to  differential 
equations  to  be  solved.  In  Chapter  13,  we  will  discuss  a  variety  of  physical  prob¬ 
lems  which  lead  to  partial  differential  equations.  To  solve  them,  we  will  need  the 
solutions  of  some  ordinary  differential  equations  which  cannot  be  solved  in  terms 
of  elementary  functions.  So  in  this  chapter  we  will  learn  about  these  equations  and 
their  solutions.  However,  if  you  would  prefer  to  see  some  of  the  physics  before  you 
study  the  math,  and  if  you’ve  studied  Chapters  7  and  8,  you  could  first  do  Sections  1 
to  4  of  Chapter  13,  and  then  come  back  to  Chapter  12  to  learn  the  material  needed 
for  the  rest  of  Chapter  13.  (See  the  Preface.) 

Now  you  may  be  thinking  that  your  computer  will  give  you  the  solutions  of  these 
differential  equations  so  you  don’t  need  to  study  this.  What  your  computer  may 
give  you  is  the  name  of  a  function.  What  you  need  to  know  is  something  about  the 
function:  graphs;  formulas  for  derivatives  and  integrals;  formulas  that  correspond  to 
trigonometric  identities  for  sine  and  cosine  functions;  and  other  useful  information 
so  that  you  can  work  with  these  named  functions  which  occur  often  in  applications. 
This  is  what  we  will  discuss  in  this  chapter. 

The  differential  equations  we  are  going  to  solve  are  linear,  like  the  equations 
of  Chapter  8,  Section  5,  but  with  coefficients  which  are  functions  of  x  instead  of 
constants,  that  is,  of  the  form  y"  +  f(x)y'  +  g(x)y  =  0.  A  method  of  solving  such 
equations  which  we  will  find  useful  is  to  assume  an  infinite  series  solution. 

Example  1.  We  illustrate  the  method  of  series  solution  by  solving  the  following  simple 
equation  (which  you  can  easily  solve  by  elementary  methods  also!): 

(1.1)  y'  =  2  xy. 
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We  assume  a  solution  of  this  differential  equation  in  the  form  of  a  power  series, 
namely 


y  =  ao  +  a\x  +  a2x2  +  03a;3  +  •  •  •  +  anxn  +  •  •  • 


OO 


(1.2) 


n— 0 


n=  1 


We  substitute  (1.2)  and  (1.3)  into  the  differential  equation  (1.1);  we  then  have  two 
power  series  equal  to  each  other.  Now  the  original  differential  equation  is  to  be 
satisfied  for  all  values  of  x,  that  is,  y'  and  2xy  are  to  be  the  same  function  of  x. 
Since  a  given  function  has  only  one  series  expansion  in  powers  of  x  (see  Chapter  1, 
Section  11),  the  two  series  must  be  identical,  that  is,  the  coefficients  of  corresponding 
powers  of  x  must  be  equal.  We  get  the  following  set  of  equations  for  the  a’s: 

(1.4)  cq  =0,  a2  =  ao,  a3  =  |ai  =  0,  04  = 

or  in  general: 


odd  n. 


(1.5) 


even  n. 


Putting  n  =  2m  (since  only  even  terms  appear  in  this  series),  we  get 


1 


1  1 


1 


(1.6)  a2m  —  x — d2m-2  —  — a2m-2  — 
2  m  m 


d2m—4  —  ‘  ‘  '  —  fa 0- 

m\ 


mm  —  1 


Substituting  these  values  of  the  coefficients  into  the  assumed  solution  (1.2)  gives 
the  solution 


(1.7) 


m— 0 


Example  2.  Compare  this  with  the  solution  by  an  elementary  method  (in  this  case, 
separation  of  variables) : 


—  =2x  dx,  In  y  =  x2  +  In  c,  y  =  ce 
V 

Expanding  this  in  a  series  of  powers  of  x 2,  we  get: 


)X' 


,2 


which,  with  c  =  do,  is  the  same  as  the  series  solution  (1.7). 
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You  cannot  always  expect  to  find  the  closed  form  of  a  power  series  solution 
(that  is,  an  elementary  function  for  which  your  series  solution  is  the  power  series 
expansion),  but  in  simple  cases  you  may  recognize  it.  Of  course,  in  that  case,  the 
problem  could  have  been  done  without  series;  the  real  need  for  series  is  in  problems 
for  which  there  is  no  closed  form  in  terms  of  elementary  functions.  Also  you  should 
realize  that  not  all  solutions  have  series  expansions  in  powers  of  x,  for  example,  In  a; 
or  1/x2.  All  we  can  say  is  that  if  there  is  a  solution  which  can  be  represented  by  a 
convergent  power  series  this  method  will  find  it.  We  shall  discuss  later  (Section  21) 
some  theorems  which  tell  us  when  we  can  expect  to  find  such  a  solution. 

In  the  following  sections  we  consider  some  differential  equations  which  occur 
frequently  in  applied  problems  and  which  are  usually  solved  by  series  methods. 

PROBLEMS,  SECTION  1 

Solve  the  following  differential  equations  by  series  and  also  by  an  elementary  method 
and  verify  that  your  solutions  agree.  Note  that  the  goal  of  these  problems  is  not  to 
get  the  answer  (that’s  easy  by  computer  or  by  hand)  but  to  become  familiar  with 
the  method  of  series  solutions  which  we  will  be  using  later.  Check  your  results  by 
computer. 


1. 

xy'  =xy  +  y 

2. 

/ 

y  = 

=  3x2  y 

3. 

II 

^3 

4. 

// 

y 

=  -% 

5. 

// 

y  =y 

6. 

// 

y 

-2y'  +  y  =  0 

7. 

x2y"  —  3xy'  +  3y  =  0 

8. 

(• x 2 

+  2 x)y"  -  2(x  +  1  )y  +  2 y 

9. 

(a;2  +  1  )y"  -  2xy'  +  2y  =  0 

10. 

y" 

-  4a :y'  +  (4a;2  —  2 )y  =  0 

2.  LEGENDRE'S  EQUATION 


The  Legendre  differential  equation  is 

(2.1)  (1  —  x2)y"  —  2xy'  +  1(1  +  l)y  =  0, 


where  l  is  a  constant.  This  equation  arises  in  the  solution  of  partial  differential  equa¬ 
tions  in  spherical  coordinates  (see  Problem  10.2  and  Chapter  13,  Section  7)  and  so 
in  problems  in  mechanics,  quantum  mechanics,  electromagnetic  theory,  heat,  etc., 
with  spherical  symmetry.  Also  see  an  application  in  Section  5. 

Although  the  most  useful  solutions  of  this  equation  are  polynomials  (called  the 
Legendre  polynomials) ,  one  way  to  find  them  is  to  assume  a  series  solution  of  the 
differential  equation,  and  show  that  the  series  terminates  after  a  finite  number  of 
terms.  [There  are  other  ways  of  finding  the  Legendre  polynomials;  see  Sections  4 
and  5,  and  Chapter  3,  Section  14,  Example  6.]  We  assume  the  series  solution  (1.2) 
for  y  and  differentiate  it  term  by  term  twice  to  get  y '  and  y": 

{y  =  a0  +  aix  +  a2a:2  +  a3x3  +  04a:4  +  •  •  •  +  anxn  +  •  •  •  , 
y'  =  a\  +  2  a2X  +  3a3x2  +  40,4a;3  +  •  •  •  +  na„a;n_1  +  •  •  •  , 
y"  =  2o2  +  603a;  +  1204a;2  +  2005a;3  +  •  •  •  +  n[n  —  l)anxn~ 2  +  •  •  •  . 
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We  substitute  (2.2)  into  (2.1)  and  collect  the  coefficients  of  the  various  powers  of  x; 
it  is  convenient  to  tabulate  them  as  follows: 


const. 

X 

x2 

s3 

...  xn  ... 

y" 

2a2 

6  a3 

12(24 

20a.5 

(n  +  2  )(n+  l)an+2 

-x2y" 

—  2(22 

-6a3 

—71(71  —  l)an 

-2  xy' 

— 2ai 

-4a2 

-6a3 

—2nan 

1(1  + l)y 

/(/  +  l)a0 

l{l  +  l)fli 

1(1  +  1)^2 

1(1  +  1)03 

1(1  +  1  )an 

Next  we  set  the  total  coefficient  of  each  power  of  x  equal  to  zero  [because,  as 
discussed  in  Section  1,  y  must  satisfy  (2.1)  identically].  For  the  first  few  powers  of 
x  we  get 

2a2  +  1(1  +  l)ao  =  0  or  02 

6a3  +  (l2  +  l  —  2)ai  =0  or  03 

(2.3) 

12a4  +  (l2  +  l  —  6)02  =  0  or  04 


and  from  the  xn  coefficient  we  get 

(2.4)  (n  +  2)(n  +  l)on_j_2  +  (l“  +  l  —  ti2  —  u)on  =  0. 

The  coefficient  of  an  in  (2.4)  can  be  factored  to  give 

(2.5)  l2  —  n2  +  l  —  n  =  (l  +  n)(l  —  n)  +  (/  —  n)  =  (l  —  n){l  +  n  +  1). 


1(1+1 ) 

- 2 - °o; 

(i  -m+ 2) 

6  ai’ 

(l-2)(l  +  S) 

12  °2 

1(1  +  1)(Z  —  2)(Z  +  3) 
- - a0i 


Then  we  can  write  a  general  formula  for  an_|_2  in  terms  of  an.  This  formula  (2.6) 
includes  the  formulas  (2.3)  for  02,  <23,  and  04,  and  makes  it  possible  for  us  to  find 
any  even  coefficient  as  a  multiple  of  ag,  and  any  odd  coefficient  as  a  multiple  of  ai. 
Solving  (2.4)  for  an+ 2  and  using  (2.5),  we  have 


(2.6) 


fln+2  —  — 


(l  —  n)(l  +  n+l) 
(71 +  2)  (n+1) 


The  general  solution  of  (2.1)  is  then  a  sum  of  two  series  containing  (as  the  solu¬ 
tion  of  a  second-order  differential  equation  should)  two  constants  ao  and  ai  to  be 
determined  by  the  given  initial  conditions: 


(2.7) 


y  = 


1(1+1)  2  ,  Z(Z  +  l)(Z-2)(Z  +  3)  4 
2!  x  '  4,  * 

(l-l)(l  +  2)  3  ,  (Z-l)(Z  +  2)(Z-3)(Z  +  4) 

- - x  4 - r, - 


From  equation  (2.6)  you  can  see  by  the  ratio  test  that  these  series  converge  for 
x1  <  1.  It  can  be  shown  that,  in  general,  they  do  not  converge  for  x2  =  1. 
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Example.  Consider  the  a\  series  for  1  =  0.  If  x2  =  1,  this  series  is  1  +  |  +  -|  +  •  •  • ,  which 
is  divergent  by  the  integral  test  (Chapter  1,  Section  6B).  Now  in  many  applications 
x  is  the  cosine  of  an  angle  9,  and  l  is  a  (nonnegative)  integer.  We  want  a  solution 
which  converges  for  all  9 ,  that  is,  a  solution  which  converges  at  x  =  ±1  as  well 
as  for  |  a;  |  <  1.  We  can  always  find  one  (but  not  two)  such  solutions  when  l  is  an 
integer;  let  us  see  how. 

Legendre  Polynomials  We  have  seen  that  for  l  =  0  the  a-|  series  in  (2.7)  di¬ 
verges.  But  look  at  the  ao  series;  it  gives  just  y  =  ao  for  l  =  0  since  all  the  rest  of 
the  terms  contain  the  factor  l.  If  l  =  1,  the  ao  series  is  divergent  at  x2  =  1,  but  the 
a\  series  stops  with  y  =  a\X  [since  all  the  rest  of  the  terms  in  the  a\  series  contain 
the  factor  (l  —  1)].  For  any  integral  l,  one  series  terminates  giving  a  polynomial 
solution;  the  other  series  is  divergent  at  x2  =  1.  (Negative  integral  values  of  l  would 
simply  give  solutions  already  obtained  for  positive  V s;  for  example,  l  =  —2  gives 
the  polynomial  solution  y  =  a\X  which  is  the  same  as  the  l  =  1  solution.  Conse¬ 
quently,  it  is  customary  to  restrict  l  to  nonnegative  values.)  Thus  we  obtain  a  set  of 
polynomial  solutions  of  the  Legendre  equation,  one  for  each  nonnegative  integral  l. 
Each  solution  contains  an  arbitrary  constant  factor  (ao  or  ai);  for  l  =  0,  y  =  ao ;  for 
l  =  1,  y  =  a\X,  and  so  on.  If  the  value  of  ao  or  ai  in  each  polynomial  is  selected  so 
that  y  =  1  when  x  =  1,  the  resulting  polynomials  are  called  Legendre  Polynomials , 
written  P;(x).  From  (2.6)  and  (2.7)  and  the  requirement  P;(  1)  =  1,  we  find  the 
following  expressions  for  the  first  few  Legendre  polynomials: 


(2.8)  Po(x)  =  1,  Pi{x)=x,  P2(x)  =  ±(3x2-l). 


Finding  a  few  more  Legendre  polynomials  by  this  method  and  other  methods  will 
be  left  to  the  problems.  Although  P;(x)  for  any  integral  l  may  be  found  by  this 
method,  simpler  ways  of  obtaining  the  Legendre  polynomials  for  larger  l  will  be 
outlined  in  Sections  4  and  5.  Of  course,  if  you  just  want  the  formula  for  a  particular 
Pi,  you  can  find  it  by  computer  or  in  reference  books. 

Eigenvalue  Problems  In  finding  the  Legendre  polynomials  as  solutions  of  Leg¬ 
endre’s  equation  (2.1),  we  have  solved  an  eigenvalue  problem.  (See  Chapter  3, 
Sections  11  and  12.)  Recall  that  in  an  eigenvalue  problem  we  are  given  an  equation 
or  a  set  of  equations  containing  a  parameter,  and  we  want  solutions  that  satisfy 
some  special  requirement;  in  order  to  obtain  such  solutions  we  must  choose  partic¬ 
ular  values  (called  eigenvalues)  for  the  parameter  in  the  problem.  In  finding  the 
Legendre  polynomials,  we  asked  for  series  solutions  of  Legendre’s  equation  (2.1) 
which  converged  at  x  =  ±1.  We  saw  that  we  could  obtain  such  solutions  if  the 
parameter  took  on  any  integral  value.  The  values  of  l,  namely  0,  1,  2,  •  •  ■ ,  are 
called  eigenvalues  (or  characteristic  values );  the  corresponding  solutions  P/(x)  are 
called  eigenfunctions  (or  characteristic  functions) . 

Note  the  parallel  between  the  eigenvalue-eigenvector  problems  of  Chapter  3  and 
the  eigenvalue-eigenfunction  problems  of  this  chapter.  Recall  that  in  Chapter  3,  we 
wrote  an  eigenvalue  equation  as  Mr  =  Ar  where  M  was  a  matrix  operator  which 
operated  on  the  eigenvector  r  to  produce  a  multiple  of  r.  The  Legendre  equation 
is  of  the  form  f(D)y(x)  =  1(1  +  l)y(x)  where  f(D)  is  a  differential  operator  which 
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operates  on  the  eigenfunction  y(x)  to  produce  a  multiple  of  y(x).  See  Section  22 
and  Chapter  13  for  further  examples  of  differential  equations  whose  solutions  are 
eigenfunctions. 

The  Legendre  polynomials  are  also  called  Legendre  functions  of  the  first  kind. 
The  second  solution  for  each  /,  which  is  an  infinite  series  (convergent  for  x2  <  1), 
is  called  a  Legendre  function  of  the  second  kind  and  is  denoted  by  Qi(x )  (See 
Problem  4.)  The  functions  Qi(x)  are  not  used  as  frequently  as  the  polynomials 
Pi(x).  For  fractional  l  both  solutions  are  infinite  series;  these  again  occur  less 
frequently  in  applications. 


PROBLEMS,  SECTION  2 

1.  Using  (2.6)  and  (2.7)  and  the  requirement  that  Pi(l)  =  1,  find  P2{x),  P3(x),  and 
Pi[x).  Check  your  results  by  computer. 

2.  Show  that  P;(— 1)  =  (— 1)L  Hint:  When  is  Pi(x)  an  even  function  and  when  is  it 
an  odd  function? 

3.  Computer  plot  graphs  of  Pi(x)  for  l  =  0,1,  2,  3,  4,  and  x  from  —1  to  1. 

4.  Use  the  method  of  reduction  of  order  [Chapter  8,  Section  7(e)]  and  the  known 
solution  Pi  ( x )  of  Legendre’s  equation  to  find  the  second  solution  Qi  (x)  (in  terms  of 
an  integral).  Evaluate  the  integral  for  the  cases  l  =  0  and  l  =  1  to  find  Q o  and  Q i. 
Note  the  divergence  of  the  logarithms  at  x  =  ±1.  Expand  the  logarithms  in  Q o  to 
get  the  divergent  series  mentioned  above  [ai  series  in  (2.7)  with  l  =  0,  x2  =  1]. 


3.  LEIBNIZ'  RULE  FOR  DIFFERENTIATING  PRODUCTS 

Let  us  digress  for  a  moment  to  discuss  a  very  useful  formula  called  Leibniz  rule 
for  finding  a  high  order  derivative  of  a  product.  We  shall  first  illustrate  this  by  a 
numerical  example.  We  could,  of  course,  do  a  numerical  problem  by  computer,  but 
our  purpose  is  to  understand  the  general  formula  which  we  will  need  in  derivations. 
Also,  when  you  know  Leibniz  rule,  you  may  find  in  simple  numerical  cases  that  you 
can  write  down  the  answer  for  a  high  order  derivative  of  a  product  faster  than  you 
can  type  the  problem  into  the  computer  (see  Problems  2  to  5). 


Example.  Find  (d9 /dx9)(xsinx). 

Leibniz  rule  says  that  the  answer  is 


(3.1) 


d 9  .  d  .  .  d8  .  9  •  8  d2  ,  d7  ,  .  . 

x) + 9s(x)  i^{am  x) + (l)  +  ■  ■  ■  ■ 


This  should  remind  you  of  a  binomial  expansion 


(a  +  b)9  =  a°b9  +  9  ab8  +  ^ a2b 7  +  ■  •  •  . 

The  coefficients  in  (3.1)  are,  in  fact,  binomial  coefficients,  and  the  sum  of  the  orders 
of  the  two  derivatives  in  each  term  is  9.  (You  may  find  the  second  hint  in  Problem  6 
useful  in  understanding  and  remembering  this.)  Now  if  it  happens,  as  here,  that 
the  derivatives  of  one  factor  become  zero  after  the  first  few,  the  rule  saves  much 
work.  In  (3.1),  (d2 /dx2)(x)  =  0  and  all  higher  derivatives  of  x  are  zero  so  we  get 
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PROBLEMS,  SECTION  3 

1.  By  Leibniz’  rule,  write  the  formula  for  (dn /dxn)(uv). 

Use  Problem  1  to  find  the  following  derivatives. 

2.  (d10 / dx10)(xex)  3.  (d6/dx6)(x2  sinx) 

4.  d25  /dx25)(x  cosx)  5.  d100 /dx100)(x2e~x) 

6.  Verify  Problem  1.  Hints:  One  method  is  to  use  mathematical  induction.  Another 
method  is  to  write 

—  (uv)  =  D(uv)  =  (Du  +  Dv)(uv), 
dx 

where  Du  acts  only  on  u  and  Dv  acts  only  on  v,  that  is,  Du(uv)  means  v(du/dx ), 
etc.  Then 

uv )  =  (Du  +  Dv)n(uv). 

Expand  (Du  +  Dv)n  by  the  binomial  theorem  and  interpret  the  terms  to  get  Leibniz’ 
rule. 


4.  RODRIGUES' FORMULA 

We  have  obtained  the  Legendre  polynomials  as  solutions  of  Legendre’s  equation 
when  l  is  an  integer;  there  are  other  ways  of  obtaining  them.  We  shall  prove  that 
Rodrigues’  formula 


1  dl  n  7 

(4.1) 

-1)' 

gives  correctly  the  Legendre  polynomials  Pi(x).  There  are  two  parts  to  the  proof. 
First  we  show  that  if 


(4.2)  v=(x2-l)\ 

then  dlv/dxl  is  a  solution  of  Legendre’s  equation;  then  we  show  that  Pi(  1)  =  1  in 
(4.1).  To  prove  the  first  part,  find  dv/dx  in  (4.2)  and  multiply  it  by  x2  —  1: 

(4.3)  (x2  —  l)-y-  =  (x2  —  1  )l(x2  —  1)/_1  •  2x  =  2  Ixv. 

dx 

Differentiate  (4.3)  l  +  1  times  by  Leibniz’  rule: 


(4.4) 


(x2  -  1) 


dl+2v 

dxl+2 


(l  +  l)(2a;) 


dl+1v  (1  +  1)1 


dxl+1 


2! 


=  2  lx 


dlv 
dx1 
dl+1v 


dxl+l 


2 1(1- 


1)  —  . 
dx1 


Simplifying  (4.4),  we  get  (Problem  1) 
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This  is  just  Legendre’s  equation  (2.1)  with  y  =  dlv/dxl;  thus  we  see  that  dlv/dxl  = 
(dl / dxl)(x2  —  l)z  is  a  solution  of  Legendre’s  equation  as  we  claimed.  It  is  a  poly¬ 
nomial  of  degree  Z,  and  since  we  have  previously  called  the  polynomial  solution  of 
degree  /  the  Legendre  polynomial  Pi  (x) ,  this  must  be  it  with  the  possible  exception 
of  the  numerical  factor  which  must  give  P;(l)  =  1.  A  simple  method  of  showing 
that  Pi(  1)  =  1  for  the  functions  Pi{x)  in  (4.1)  is  outlined  in  Problem  2. 

PROBLEMS,  SECTION  4 

1.  Verify  equations  (4.4)  and  (4.5). 

2.  Show  that  P;(l)  =  1,  with  Pi(x)  given  by  (4.1),  in  the  following  way.  Factor  (x2  —  l)1 
into  (x  +  l)1  (x  —  l)1  and  differentiate  the  product  l  times  by  Leibniz’  rule.  Without 
writing  out  very  many  terms  you  should  see  that  every  term  but  one  contains  the 
factor  x  —  1  and  so  becomes  zero  when  x  =  1.  Use  this  to  evaluate  Pi(x)  in  (4.1) 
when  x  =  1  to  get  Pi(l)  =  1. 

3.  Find  Po(i),  Pi(ie),  P2(x),  Ps(x),  and  Pa{x)  from  Rodrigues’  formula  (4.1).  Check 
your  results  by  computer. 

4.  Show  that  f*  xmPi(x)  dx  =  0  if  m  <  l.  Hint:  Use  Rodrigues’  formula  (4.1)  and 
integrate  repeatedly  by  parts,  differentiating  the  power  of  x  and  integrating  the 
derivative  each  time. 


5.  GENERATING  FUNCTION  FOR  LEGENDRE  POLYNOMIALS 


The  expression 


(5.1)  4>(a;,  h)  =  (1  —  2xh  +  h2)  1^2,  |h|  <  1, 


is  called  the  generating  function  for  Legendre  polynomials.  We  shall  show  that 


OO 

(5.2)  4>(a;,  h)  =  Pq(x)  +  hPi(x)  +  h2P2{x)  H - =  ^  hlPi(x), 

i= o 


where  the  functions  Pi(x)  are  the  Legendre  polynomials.  (For  discussion  of  conver¬ 
gence  of  the  series,  see  Chapter  14,  Problem  2.43.)  Let  us  first  verify  a  few  terms 
of  (5.2).  For  simplicity  put  2 xh  —  h2  =  y  into  (5.1),  expand  (1  —  y)-1'2  in  powers 
of  y,  then  substitute  back  y  =  2xh  —  h2  and  collect  powers  of  h  to  get 

$  =  (i  -  v)~1/2  =  1  +  h  +  fy 2  +  •  •  • 

=  1  +  ^(2  xh  —  h2)  +  |(2  xh  —  h2)2  +  ••• 

=  1  +  xh  —  \h2  +  |  (4  x2h2  —  Axh 3  +  h 4)  +  ••• 

=  1  +  xh  +  h2{ |s2  —  |)  H - 

=  Pq(x)  +  hPi(x)  +  h2P2(x)  -\ - . 


(5.3) 
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This  is  not  a  proof  that  the  functions  called  Pi(x)  in  (5.2)  are  really  Legendre 
polynomials,  but  merely  a  verification  of  the  first  few  terms.  To  prove  in  general 
that  the  polynomials  called  Pi{x)  in  (5.2)  are  Legendre  polynomials  we  must  show 
that  they  satisfy  Legendre’s  equation  and  that  they  have  the  property  P/(l)  =  1. 
The  latter  is  easy  to  prove;  putting  x  =  1  in  (5.1)  and  (5.2),  we  get 

(5  *(1,  h)  =  (l-2h  +  h2)-1'2  =  =  1  +  h  +  h2  +  --- 

=  P0(1)+Pi(l)h  +  P2(l)h2  +  ---  . 


Since  this  is  an  identity  in  h,  the  functions  Pi(x)  in  (5.2)  have  the  property  P;(l)  =  1. 
To  show  that  they  satisfy  Legendre’s  equation,  we  shall  use  the  following  identity 
which  can  be  verified  from  (5.1)  by  straightforward  differentiation  and  some  algebra 
(Problem  2): 

„  <92$  d2 

(5.5,  +  = 

Substituting  the  series  (5.2)  for  into  (5.5),  we  get 

oo  oo  oo 

(5.6)  (1  -  x2)  J2  hlP"{x)  ~2xJ2  hlP[(x)  +  ^  l{l  +  1  )hlPi{x)  =  0. 

1=0  1=0  1=0 

This  is  an  identity  in  h ,  so  the  coefficient  of  each  power  of  h  must  be  zero.  Setting 
the  coefficient  of  hl  equal  to  zero,  we  get 

(5.7)  (1  -  x2)P”(x)  -  2 xP[(x)  +  1(1  +  l)Pz(x)  =  0. 

This  is  Legendre’s  equation,  so  we  have  proved  that  the  functions  Pi(x)  in  (5.2) 
satisfy  it  as  claimed. 


Recursion  Relations  The  generating  function  is  useful  in  deriving  the  recursion 
relations  (also  called  recurrence  relations)  for  Legendre  polynomials.  These  recur¬ 
sion  relations  are  identities  in  x  and  are  used  (as  trigonometric  identities  are)  to 
simplify  work  and  to  help  in  proofs  and  derivations.  Some  examples  of  recursion 
relations  are: 


We  shall  now  derive  (5.8a);  the  problems  outline  derivations  of  the  other  equations. 
From  (5.1)  we  get 


=  —^(1  —  2 xh  +  h2)  3/2(—2x  +  2 h)\ 
(1  —  2xh  +  =  (x  —  hy b. 


(5.9) 


<9$ 

~dh. 
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Substituting  the  series  (5.2)  and  its  derivative  with  respect  to  h  into  (5.9),  we  get 


OO 


OO 


(1  —  2 xh  +  h2)  ^2  1Pl{x)  =  {x  —  h)  ^2  hlPi(x). 


This  is  an  identity  in  h  so  we  equate  coefficients  of  hl  1 .  Carefully  adjusting  indices 
so  that  we  select  the  term  in  hl_1  each  time,  we  find 

(5.10)  IP^x)  -  2 x(l  -  l)Pi-i(x)  +{l-  2 )P/_2(x)  =  xPi- 1(®)  -  Pi-2(x) 


which  simplifies  to  (5.8a).  The  recursion  relation  (5.8a)  gives  the  simplest  way 
of  finding  any  Legendre  polynomial  when  we  know  the  Legendre  polynomials  for 
smaller  l  (Problem  3). 

Expansion  of  a  Potential  The  generating  function  is  useful  in  problems  involv¬ 
ing  the  potential  associated  with  any  inverse  square  force.  Recall  that  the  gravi¬ 
tational  force  between  two  point  masses  separated  by  a  distance  d  is  proportional 
to  1/d2  and  the  associated  potential  energy  is  proportional  to  1/d.  Similarly,  the 
electrostatic  force  between  two  electric  charges  a  distance  d  apart  is  proportional 
to  1  / d2  and  the  associated  electrostatic  potential  energy  is  proportional  to  1  / d. 

Example  1.  In  either  case  we  can  write  the  potential  as 


(5.11) 


where  K  is  an  appropriate  constant.  In  Figure  5.1,  let  the  two  masses  (or  charges) 
be  at  the  heads  of  vectors  r  and  R.  Then,  by  the  law  of  cosines,  the  distance 
between  them  is 


d=  |R-r| 


=  \J  R2  —  2  Rr  cos  9  +  r2 


(5.12) 


Figure  5.1 


and  the  gravitational  or  electric  potential  is 


-1/2 


(5.13) 


For  |r|  <  |R|,  we  make  the  change  of  variables 


(5.14) 


x  =  cos  0. 


(Note:  x  is  not  a  coordinate  but  just  a  new  variable  standing  for  cos  9.)  Then  in 
terms  of  the  generating  function  4>  in  (5.1)  we  have 


d  =  R\/l  —  2  hx  +  h 2 


V=I2-(l-2  hx  +  h2)-1/2  =  ^. 
Jti  li 


(5.15) 
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Using  (5.2),  we  can  write  the  potential  V  as  an  infinite  series 

K  00 

(5.16)  V=-Y,hlpi{x) 

n  ;=o 

or  in  terms  of  r  and  9  [using  (5.14)] 

/C1^  t  t  K  rlPt(cos9)  rl  Pi  (cos  9) 

(5.17)  g,  g+i  - 

1=0  1=0 

In  many  applications  the  distance  |R|  is  much  larger  than  jrj.  Then  the  terms  of 
the  series  (5.17)  decrease  rapidly  in  magnitude  because  of  the  factor  ( r/R )l,  and 
the  potential  can  be  approximated  by  using  only  a  few  terms  in  the  series. 

We  can  make  (5.17)  more  general  and  useful  by  considering  the  following  prob¬ 
lem.  (We  shall  discuss  the  electrical  case  for  definiteness — the  gravitational  case 
could  be  discussed  in  parallel  fashion.) 


Example  2.  Suppose  there  are  a  large  number  of  charges  qi  at  points  ry.  The  electrostatic 
potential  Vi  at  the  point  R  due  to  the  charge  qi  at  ry  means  the  electrostatic 
potential  energy  of  a  pair  of  charges,  namely,  a  unit  charge  at  R  and  the  charge 
qi  at  r?:;  this  is  given  by  (5.11)  and  (5.12),  or  by  (5.17),  with  r  =  ry,  9  =  9i ,  and 
K  =  qi  •  1  •  K ',  where  K'  is  a  numerical  constant  depending  on  the  choice  of  units: 


(5.18) 


Vi  =  K' qi  Y, 
1=0 


r\Pi  (cos  9i) 
Rl+l 


The  total  potential  V  at  R  due  to  all  the  charges  qi  is  then  a  sum  over  i  of  all  the 
series  (5.18),  namely 


(5.19) 


i 


oo 


1=0 


r\ Pi  (cos  9i) 
R l+l 


Ei^PKcosflO 
K  /L - fli+i - 


Example  3.  If,  instead  of  a  set  of  discrete  charges,  we  have  a  continuous  charge  distribu¬ 
tion,  then  the  sum  over  i  becomes  an  integral,  namely 


(5.20) 


Pi  (cos  9)  dq 


or 


Pi(cos9)pdr, 


where  p  is  the  charge  density,  and  the  integral  is  over  the  space  occupied  by  the 
charge  distribution.  Then  (5.19)  becomes 

(5.21)  ^  =  K'Y^+ijJj [rlPi(cos9)pdr. 

The  terms  of  the  series  (5.21)  can  be  interpreted  physically.  The  l  =  0  term  is 

(5.22)  JJJ  pdr  =  ■  (total  charge). 

Thus  if  R  is  large  enough  compared  to  all  the  vy  or  all  the  values  of  r  at  points 
of  the  charge  distribution,  we  can  approximate  the  potential  of  the  distribution  as 
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that  of  a  single  charge  at  the  origin  of  magnitude  equal  to  the  total  charge  of  the 
distribution.  The  l  =  1  term  of  the  series  (5.21)  is 


(5.23) 


1 

B? 


r  cos  9  p  dr. 


To  interpret  this  recall  that  the  electric  dipole  moment  of  a  pair  +<? 

of  charges  +q  and  — q  a  distance  d  apart  (as  in  Figure  5.2)  is  Ia 

defined  as  the  vector  qd.  where  d  is  the  vector  from  —q  to  +q.  o  ~2 

Since  the  vector  qd  is  equal  to  q( rq  —  r2)  =  qri  —  qr2,  we  often 

call  qri  and  —qr2  the  dipole  moments  of  +q  and  —q  about  O;  Figure  5.2 

then  the  total  dipole  moment  due  to  the  two  charges  is  just  the 

sum  of  the  two  moments.  Suppose  we  calculate  the  dipole  moment  about  O  of  all 

the  charges  qp.  this  is  the  vector  sum  )T)?:  q^i  cos  9i,  since  Oi  is  the  angle  between  R 

and  r,.  In  the  case  of  a  continuous  charge  distribution  this  sum  becomes 


(5.24) 


r  cos  0  p  dr. 


Thus  we  see  from  (5.23)  and  (5.24)  that  the  second  term  of  the  series  (5.21)  is 
1  / R2  times  the  component  in  the  R  direction  of  the  dipole  moment  of  the  charge 
distribution.  If  you  consider  the  fact  that  the  first  term  of  (5.21)  involves  the  total 
charge  (a  scalar,  that  is,  a  tensor  of  rank  zero)  and  the  second  term  involves  the 
dipole  moment  (a  vector,  that  is,  a  tensor  of  rank  one),  it  may  not  surprise  you 
to  learn  that  the  third  term  involves  a  2nd-rank  tensor  known  as  the  quadrupole 
moment  of  the  charge  distribution,  the  fourth  term  involves  a  3rd-rank  tensor  known 
as  the  octopole  moment,  etc.  (See  Problem  15  for  more  detail.) 


Example  4.  Given  a  charge  or  mass  distribution,  the  moments  of  various  ranks  and  the 
terms  in  (5.21)  can  be  computed.  The  opposite  process  is  often  of  great  interest 
in  applied  problems.  Consider  a  satellite  circling  the  earth;  it  is  moving  in  the 
gravitational  field  of  the  earth’s  mass.  If  the  mass  distribution  of  the  earth  were 
spherically  symmetric,  then  only  the  first  term  would  appear  in  the  series  for  the 
gravitational  potential  [this  series  would  be  (5.21)  with  p  a  mass  density  instead 
of  a  charge  density].  But  since  the  earth  is  not  a  perfect  sphere  (equatorial  bulge, 
etc.),  other  terms  are  present  in  (5.21)  and  the  corresponding  forces  affect  the 
motion  of  satellites.  From  accurate  measurements  of  the  satellite  orbits,  it  is  now 
possible  to  calculate  many  terms  of  the  series  (5.21).  Similarly,  in  the  electrical  case, 
experimental  measurements  give  us  information  about  the  distribution  of  electric 
charge  inside  atoms  and  nuclei;  our  discussion  here  and  equation  (5.21)  provide 
the  basis  for  the  interpretation  of  such  measurements,  and  the  terminology  used  in 
discussing  them. 


PROBLEMS,  SECTION  5 

1.  Find  7*3(2:)  by  getting  one  more  term  in  the  generating  function  expansion  (5.3). 

2.  Verify  (5.5)  using  (5.1). 

Use  the  recursion  relation  (5.8a)  and  the  values  of  Pq(x)  and  Pi(x)  to  find  P2( x), 
7*3(2;),  7*4(2;),  7*5(2;),  and  7*6(2:).  [After  you  have  found  7*3(2;),  use  it  to  find  7*4(2:), 
and  so  on  for  the  higher  order  polynomials.] 
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4.  Show  from  (5.1)  that 


{x~h)d^  =  hdh- 


Substitute  the  series  (5.2)  for  <E>,  and  so  prove  the  recursion  relation  (5.8b). 

5.  Differentiate  the  recursion  relation  (5.8a)  and  use  the  recursion  relation  (5.8b)  with 
l  replaced  by  l  —  1  to  prove  the  recursion  relation  (5.8c). 


6.  From  (5.8b)  and  (5.8c),  obtain  (5.8d)  and  (5.8f).  Then  differentiate  (5.8d)  with 
respect  to  x  and  eliminate  P/_1(a:)  using  (5.8b).  Your  result  should  be  the  Legendre 
equation.  The  derivation  of  Problems  4  to  6  constitutes  an  alternative  proof  [to  that 
of  equations  (5.5)  to  (5.7)]  that  the  functions  Pi  ( x )  in  (5.2)  are  Legendre  polynomials. 


7.  Write  (5.8c)  with  l  replaced  by  l  +  1  and  use  it  to  eliminate  the  xPl(x)  term  in 
(5.8b).  You  should  get  (5.8e). 


Express  each  of  the  following  polynomials  as  linear  combinations  of  Legendre  polynomials. 
Hint:  Start  with  the  highest  power  of  x  and  work  down  in  finding  the  correct  combination. 

8.  5  —  2*  9.  3a:2  +  x  —  1  10.  x 4 

11.  x  —  x3  12.  7x4  —  3a:  +  1  13.  x5 

14.  Show  that  any  polynomial  of  degree  n  can  be  written  as  a  linear  combination  of 
Legendre  polynomials  with  l  <  n. 

15.  Expand  the  potential  V  =  K/d  in  (5.11)  in  the  following  way  in  order  to  see  how 
the  terms  depend  on  the  tensors  mentioned  above.  In  Figure  5.1  let  R  have  the 
coordinates  X,  Y,  Z  and  r  have  coordinates  x,  y,  z.  [Note:  The  coordinate  x  here 
is  not  the  x  in  (5.14).]  Then 

V  =  ^  =  K[(X  -  x)2  +  (Y  —  y)2  +  (Z-  zf\~1/2. 

Consider  X,  Y,  Z  as  constants  and  expand  V  (x,  y,  z )  in  a  three  variable  power  series 
about  the  origin.  (See  Chapter  4,  Section  2,  for  discussion  of  two-variable  power 
series  and  generalize  the  method.)  You  should  find 


T  _  K  K  (X 
-  R  +  R?  \RX  +  '" 


K 
+  JP 


3  X2 

2R2 


-^U2  +  - 


3  X  Y 

■  H - 2xy  +  ■ 

2  R  R  y 


+  ■ 


and  similar  terms  in  y,  z,  y2,  xz,  and  so  on.  Now  letting  r  =  ri  and  K  =  K'qi 
for  a  charge  distribution  as  in  (5.18),  and  summing  (or  integrating)  over  the  charge 
distribution,  show  that:  the  first  term  is  just  (K' /R)  ■  total  charge;  the  next  group 
of  terms  (in  x,  y,  z)  involve  the  three  components  of  the  electric  dipole  moment;  the 
sum  of  these  terms  is  (K' / R2)  ■  component  of  the  dipole  moment  in  the  R  direction; 
the  next  group  (quadratic  terms)  involve  six  quantities  of  the  form 


JJf^pdr 
Iff  2  xypdr 


and  similar  y,  z  integrals, 
and  similar  xz,  yz  integrals. 


If  we  split  the  2 xy  term  into  xy  and  yx  (and  similarly  for  the  2 xz  and  2 yz  terms),  we 
have  the  nine  components  of  a  2nd-rank  tensor  called  the  quadrupole  moment.  Use 
the  “direct  product”  method  of  Chapter  10,  Section  2  to  show  that  it  is  a  2nd-rank 
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tensor.  (Remember  from  Chapter  10  that,  by  definition,  x,  y,  z  are  the  components 
of  a  vector,  that  is,  a  1  -rank  tensor.)  Just  as  two  charges  +q  and  —q  form  an 
electric  dipole,  so  four  charges  like  this  ^22+  form  an  electric  quadrupole  and  the 
quadratic  terms  in  the  V  series  give  the  potential  of  such  a  charge  configuration. 
Again  using  Chapter  10,  Section  2,  show  that  the  third-order  terms  in  x,  y,  z  form 
a  3rd-rank  tensor;  this  is  known  as  the  octopole  moment.  It  can  be  represented 
physically  by  two  quadrupoles  side  by  side  just  as  the  quadrupole  above  was  formed 
by  two  dipoles  side  by  side. 

6.  COMPLETE  SETS  OF  ORTHOGONAL  FUNCTIONS 

Orthogonal  functions  Two  vectors  A  and  B  are  orthogonal  (perpendicular)  if 

their  scalar  product  is  zero,  that  is,  if 

(6.1)  J2AiBi  =  0- 

i 

[See  Chapter  3,  equations  (4.12)  and  (10.3).]  Recall  from  Chapter  3,  Section  14, 

that  we  can  think  of  functions  as  elements  of  a  vector  space.  Then  by  analogy  with 

(6.1)  we  say  that  two  functions  A(x)  and  B( x)  are  orthogonal  on  (a,  b )  if 

(6.2)  f  A(x)B(x)  dx  =  0. 

J  a 

If  the  functions  A(x)  and  B[x)  are  complex,  the  definition  of  orthogonality  is  [see 

Chapter  3,  equation  (14.3)] 


A(x)  and  B(x)  are  orthogonal  on  (a,  b)  if 


(6.3) 


[  A*(x)B(x)  dx  =  0, 

J  a 


where  A*{x)  is  the  complex  conjugate  of  A(x)  (see  Problem  1). 


Since  (6.3)  is  identical  with  (6.2)  if  A(x)  and  B(x)  are  real,  we  can  take  (6.3)  as 
the  general  definition  of  orthogonality  of  A(x)  and  B(x)  on  (a,  b). 

If  we  have  a  whole  set  of  functions  An[x)  where  n=l,2,3,---,  and 


(6.4) 


A*  (x)Am(x)  dx 


0  if  m  /  n, 

const.  ^  0  if  m  =  n, 


we  call  the  functions  An{x)  a  set  of  orthogonal  functions.  We  have  already  used 
such  sets  of  functions  in  Fourier  series.  Recall  that  [Chapter  7,  equation  (5.2)] 


(6.5) 


sin  nx  sin  mx  dx 


0  if  m  /  n, 

7 r  if  m  =  n  yf  0. 


Thus  sinna;  is  a  set  of  orthogonal  functions  on  (— 7r,7r),  or  in  fact  on  any  other 
interval  of  length  2-7T.  Similarly,  the  functions  cos  nx  are  orthogonal  on  (— 7r,  7r). 
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Also  the  whole  set  consisting  of  sin  nx  and  cos  nx  is  a  set  of  orthogonal  functions 
on  (— 7T,  7r)  since 


sin  nx  cos  mx  dx  =  0 


for  any  n  and  to. 


We  have  used  complex  functions  also,  namely  the  set  emx.  For  this  set  the  orthog¬ 
onality  property  is  given  by  (6.4),  namely 


(6.6) 


e-iuxeimx  dx  = 


if  to  yf  n, 
if  m  =  n. 


Recall  that  sin  tot  and  cos  nx  (or  elnx )  were  the  functions  used  in  a  Fourier  se¬ 
ries  expansion  on  (— 7r,  n).  You  should  now  realize  that  it  was  the  orthogonality 
property  that  we  used  in  getting  the  coefficients.  When  we  multiplied  the  equation 
f(x )  =  X!m=-oo  Cm.elmx  by  e~mx  and  integrated,  the  integrals  of  all  the  terms  in 
the  series  except  the  cn  term  were  zero  by  the  orthogonality  property  (6.6).  There 
are  many  other  sets  of  orthogonal  functions  besides  the  trigonometric  or  exponential 
ones.  Just  as  we  used  the  sine-cosine  or  exponential  set  to  expand  a  function  in  a 
Fourier  series,  so  we  can  expand  a  function  in  a  series  using  other  sets  of  orthogonal 
functions.  We  shall  show  this  for  the  functions  Pi(x)  after  we  prove  that  they  are 
orthogonal. 


Complete  sets  There  is  another  important  point  to  consider  when  we  want  to 
expand  a  function  in  terms  of  a  set  of  orthogonal  functions.  Again  let  us  consider 
the  vector  analogy.  We  write  vectors  in  terms  of  their  components  and  the  basis 
vectors  i,  j,  k.  In  two  dimensions  we  need  only  two  basis  vectors,  say  i  and  j. 
But  if  we  tried  to  write  three-dimensional  vectors  in  terms  of  just  i  and  j,  there 
would  be  some  vectors  we  could  not  represent;  we  say  that  (in  three  dimensions) 
i  and  j  are  not  a  complete  set  of  basis  vectors.  A  simple  way  of  expressing  this 
(which  generalizes  to  n  dimensions)  is  to  say  that  there  is  another  vector  (namely 
k)  which  is  orthogonal  to  both  i  and  j.  Thus  we  define  a  set  of  orthogonal  basis 
vectors  as  complete  if  there  is  no  other  vector  orthogonal  to  all  of  them  (in  the 
space  of  the  number  of  dimensions  we  are  considering).  By  analogy,  we  define  a  set 
of  orthogonal  functions  as  complete  on  a  given  interval  if  there  is  no  other  function 
orthogonal  to  all  of  them  on  that  interval.  Now  it  is  easy  to  see  that  there  are 
some  vectors  in  three  dimensions  which  cannot  be  represented  using  only  i  and 
j  .  Similarly,  there  are  functions  which  cannot  be  represented  by  a  series  using  an 
incomplete  set  of  orthogonal  functions.  We  have  discussed  one  example  of  this  in 
Fourier  series  (Chapter  7,  Section  11).  If  we  are  trying  to  represent  a  sound  wave 
by  a  Fourier  series,  we  must  not  leave  out  any  of  the  harmonics;  that  is,  the  set  of 
functions  sinner,  cosnx  on  (—7 r,  7r)  would  not  be  complete  if  we  left  out  some  of  the 
values  of  n.  As  another  example,  the  set  of  functions  sin  nx  is  an  orthogonal  set 
on  (— 7r,7r).  However,  it  is  not  complete;  to  have  a  complete  set  we  must  include 
also  the  functions  cos  net,  and  you  should  recall  that  this  is  what  we  did  in  Fourier 
series.  On  the  other  hand,  sin  nx  is  a  complete  set  on  (0, 7r);  we  used  this  fact  when 
we  started  with  a  function  given  on  (0,7r),  defined  it  on  (— 7r,  0)  to  make  it  odd, 
and  then  expanded  it  in  a  sine  series.  Similarly,  cos  nx  is  a  complete  set  on  (0,7r). 
In  this  chapter,  we  are  particularly  interested  in  the  fact  (which  we  state  without 
proof)  that  the  Legendre  polynomials  are  a  complete  set  on  (—1,1). 
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PROBLEMS,  SECTION  6 

1.  Show  that  if  A* (x)B(x)  dx  =  0  [see  (6.3)],  then  f^A(x)B*(x)dx  =  0,  and  vice 
versa. 

2.  Show  that  the  functions  e‘‘nnx/\  n  =  0,  ±1,  ±2,  •  •  •  ,  are  a  set  of  orthogonal  functions 
on  (—1,  l). 

3.  Show  that  the  functions  x2  and  sin  x  are  orthogonal  on  (—1, 1).  Hint:  See  Chapter  7, 
Section  9. 

4.  Show  that  the  functions  f(x)  and  g(x)  are  orthogonal  on  (—a,  a)  if  f(x)  is  even  and 
g(x)  is  odd.  (See  Problem  3.) 

5.  Evaluate  Po(x)P2(x)  dx  to  show  that  these  functions  are  orthogonal  on  (—1, 1). 

6.  Show  in  two  ways  that  Pi(x)  and  Pl(x)  are  orthogonal  on  (—1, 1).  Hint:  See  Prob¬ 
lem  4  and  Problem  4.4. 

7.  Show  that  the  set  of  functions  sinna;  is  not  a  complete  set  on  (— 7r,7r)  by  trying  to 
expand  the  function  f(x)  =  1  on  (— 7r,  7t)  in  terms  of  them. 

8.  Show  that  the  functions  cos  (n  +  ^) x,  n  =  0, 1,  2,  •  ■  ■  ,  are  orthogonal  on  (0,  n).  Ex¬ 
pand  the  function  f(x)  =  1  on  (0,  tt)  in  terms  of  them.  (Is  it  a  complete  set?  See 
Chapter  7,  end  of  Section  11.) 

9.  Show  in  two  ways  that  P2n+i(x)  dx  =  0. 

7.  ORTHOGONALITY  OF  THE  LEGENDRE  POLYNOMIALS 

We  are  going  to  show  that  the  Legendre  polynomials  are  a  set  of  orthogonal  functions 
on  (—1, 1),  that  is,  that 


(7.1) 


Pi(x)Pm(x)  dx  =  0 


unless  l  =  to. 


To  prove  this  we  rewrite  the  Legendre  differential  equation  (2.1)  in  the  form 
(7.2)  A[(i  _  x2)P!(x )]  +  1(1  +  l)^)  =  0. 

Write  (7.2)  for  Pi(x)  and  for  Pm(x);  multiply  the  Pi(x)  equation  by  Pm(x),  and  the 
Pm(x)  equation  by  Pi{x)  and  subtract  to  get 

(7  3)  Pm{x)^[{l  ~  *2)P/(X)]  “  Pl{-x)iL[{1  ~  x2)P^ 

+  [1(1  +  1)  -  m(m  +  1  )\Pm(x)Pi(x)  =  0. 

The  first  two  terms  of  (7.3)  can  be  written  as 

(7-4)  ^[(l-x2)(PmPl-PiP'm)\ 

where,  for  simplicity,  we  have  used  Pi  =  Pi(x),  and  so  forth.  Integrating  (7.3) 
between  —1  and  1  and  using  (7.4),  we  get 

/I 

Pm(x)Pl(x)dx  =  0. 
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The  integrated  term  is  zero  because  (1  —  x2)  =  0  at  s  =  ±1,  and  Pm(x)  and  Pi(x) 
are  finite.  The  bracket  in  front  of  the  integral  is  not  zero  unless  m  =  l.  Therefore 
the  integral  must  be  zero  for  l  ^  m  and  we  have  (7.1). 

The  method  we  have  used  here  is  a  standard  one  which  can  be  used  for  many 
other  sets  of  orthogonal  functions  to  prove  the  orthogonality  property  by  using  the 
differential  equation  satisfied  by  the  functions.  (See  Problems  1  and  2;  also  see 
Section  19  and  Problems  10.3,  22.7,  22.16,  22.24,  23.24b,  and  23.25.) 

Recall  (Section  5,  Problems  8  to  14)  that  we  can  write  any  polynomial  of  degree 
n  as  a  linear  combination  of  Legendre  polynomials  of  degree  <  n.  Thus,  by  (7.1), 
any  polynomial  of  degree  <  l  is  orthogonal  to  Pi  ( x ) : 


(7.6) 


Pi{ x)  ■  (any  polynomial  of  degree 


i-i 


<l)dx  =  0. 


PROBLEMS,  SECTION  7 

1.  By  a  method  similar  to  that  we  used  to  show  that  the  Pi  s  are  an  orthogonal  set 
of  functions  on  (—1, 1),  show  that  the  solutions  of  y "  =  —n2yn  are  an  orthogonal 
set  on  (— 7r,7r).  Hint:  You  should  know  what  functions  the  solutions  yn  are;  do  not 
use  the  functions  themselves,  but  you  may  use  their  values  and  the  values  of  their 
derivatives  at  —  n  and  n  to  evaluate  the  integrated  part  of  your  equation. 

2.  Following  the  method  in  (7.2)  to  (7.5),  show  that  the  solutions  of  the  differential 
equation 

(1  -  x2)y"  -  2 xy  +  [1(1  +  1)  -  (1  -  x2)_1]y  =  0 
are  a  set  of  orthogonal  functions  on  (—1,1). 

3.  Use  Problem  4.4  to  show  that  J21Prn(x)Pi(x)  dx  =  0  if  m  <  l.  Comment:  This 
amounts  to  a  different  proof  of  orthogonality — via  Rodrigues’  formula  instead  of  the 
differential  equation. 

4.  Use  equation  (7.6)  to  show  that  Pi(x)Pl'_1(x)  dx  =  0.  Hint:  What  is  the  degree 
of  P{_i(x)?  Also  show  that  P[(x)Pi+i(x)  dx  =  0. 

5.  Show  that  Pi(x)  dx  =  0,  l  >  0.  Hint:  Consider  Pi(x)Pq(x)  dx. 

6.  Show  that  Pi(x)  is  orthogonal  to  [Pi(x)]2  on  (—1, 1).  Hint:  See  Problem  6.4. 

8.  NORMALIZATION  OF  THE  LEGENDRE  POLYNOMIALS 

If  we  take  the  scalar  product  of  a  vector  with  itself,  A  •  A  =  A2 ,  we  get  the  square 
of  the  length  (or  norm)  of  the  vector.  If  we  divide  A  by  its  length,  we  get  a  unit 
vector.  In  Chapter  3,  Section  14  we  showed  that  we  can  think  of  functions  as  the 
vectors  of  a  vector  space  and  we  defined  the  norm  N  of  a  function  A(x)  on  (a,  b) 
by  [see  Chapter  3,  equation  (14.2)] 

rb  nb 

/  A*(x)A(x)  dx  =  /  \A(x)\2  dx  =  N2 . 

J  a  J  a 

We  also  say  that  the  function  N~1A(x)  is  normalized ;  like  a  unit  vector,  a  normal¬ 
ized  function  has  norm  =  1.  The  factor  A-1  is  called  the  normalization  factor.  For 
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example,  sin2  nx  dx  =  tt/2.  Then  the  norm  of  sinnx  on  (0,7r)  is  y/ir/2,  and  the 
functions  nx  have  norm  1  on  (0,7r),  that  is,  they  are  normalized.  A  set  of 

normalized  orthogonal  functions  is  called  orthonormal.  For  example,  y/2/7rsin  nx 
is  an  orthonormal  set  on  (0,7r). 

Such  a  set  of  orthonormal  functions  may  remind  us  of  i,j,k;  like  these  unit 
vectors,  the  functions  are  orthogonal  and  have  norm  =1.  If  the  elements  of  a 
vector  space  are  functions,  we  can  then  use  a  (complete)  orthonormal  subset  of  the 
functions  as  the  basis  vectors  of  the  space.  We  think  of  expanding  other  functions 
in  terms  of  them  (by  analogy  with  writing  a  three-dimensional  vector  in  terms  of 
i,j,k).  For  example,  suppose  we  have  expanded  a  given  function  f(x)  on  (0,7r)  in 
a  Fourier  sine  series: 


/0) 


sin  nx. 


We  call  f(x)  a  vector  with  components  Bn  in  terms  of  the  basis  vectors  y/2/ir  sin  nx. 
Thus,  in  quantum  mechanics,  we  often  refer  to  a  function  which  describes  the  state 
of  a  physical  system  as  either  a  state  function  or  a  state  vector.  Just  as  we  can 
write  a  three-dimensional  vector  in  terms  of  i,  j,  k,  or  in  terms  of  another  basis,  say 
er,eg,e0,  so  we  can  expand  a  given  /(x)  in  terms  of  another  orthonormal  set  of 
functions  and  find  its  components  relative  to  this  new  basis.  In  Section  9,  we  shall 
see  how  to  expand  functions  in  Legendre  series. 

Just  as  we  needed  the  norm  of  sinnx  in  Fourier  series,  so  we  shall  need  the  norm 
of  Pi(x )  in  expanding  functions  in  Legendre  series.  We  shall  prove  that 


Then  the  functions  yj (21  +  l)/2  P;  (x)  are  an  orthonormal  set  of  functions  on  (—1, 1). 
To  prove  (8.1),  we  use  the  recursion  relation  (5.8b),  namely, 

(8.2)  lPi{x)  =  xP/ (x)  -  P/_i(x). 

Multiply  (8.2)  by  Pi(x)  and  integrate  to  get 

(8.3)  l  J  [P/(x)]2  dx  =  J  xPi(x)P[{x)  dx  —  J  P;(x)P/_1(x)  dx. 

The  last  integral  is  zero  by  Problem  7.4.  To  evaluate  the  middle  integral  in  (8.3), 
we  integrate  by  parts: 


xPi{x)P[(x)  dx  =  -  [Pi{x))2 


1 


\Pi{x)]2  dx 


= 1  -  2  J  m*)]'* 


(see  Problem  2.2).  Then  (8.3)  gives 


[P/(x)]2  dx 
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which  simplifies  to  (8.1).  We  can  combine  (7.1)  and  (8.1)  to  write 


(8.4) 


I  Pl{x)Pm(x)  dx  =  +  l  Sim. 


PROBLEMS,  SECTION  8 

Find  the  norm  of  each  of  the  following  functions  on  the  given  interval  and  state  the 
normalized  function. 


1. 

cos  nx 

on  (0, 7r) 

2. 

P2(a; )  on  (-1,1) 

3. 

-x/2 

xe  ' 

on  (0, 00) 

4. 

e~x  on  (—00, 00 

5. 

-X2 /2 

xe  ' 

on  (0,  00) 

Hint:  See  Chapter  4,  Section  12. 

6.  Give  another  proof  of  (8.1)  as  follows.  Multiply  (5.8e)  by  Pi{x)  and  integrate  from 
—  1  to  1.  To  evaluate  the  middle  term,  integrate  by  parts.  Then  use  Problem  7.4. 

7.  Using  (8.1),  write  the  first  four  normalized  Legendre  polynomials  and  compare  with 
the  answers  we  found  by  a  different  method  in  Chapter  3,  Section  14,  Example  6. 


9.  LEGENDRE  SERIES 

Since  the  Legendre  polynomials  form  a  complete  orthogonal  set  on  (—1, 1),  we  can 
expand  functions  in  Legendre  series  just  as  we  expanded  functions  in  Fourier  series. 

Example  1.  Expand  in  a  Legendre  series  the  function  /( x)  given  by 

(9.1) 

(see  Figure  9.1).  We  put 

(9.2) 

Our  problem  is  to  find  the  coefficients  c; .  We  do  this  by  a  method  parallel  to  the  one 
we  used  in  finding  the  formulas  for  the  coefficients  in  a  Fourier  series.  We  multiply 
both  sides  of  (9.2)  by  Pm(x)  and  integrate  from  —1  to  1.  Because  the  Legendre 
polynomials  are  orthogonal,  all  the  integrals  on  the  right  are  zero  except  the  one 
containing  cm,  and  we  can  evaluate  it  by  (8.1).  Thus  we  get 

/I  oo 

f{x)Pm(x)  dx=YJCi 
1  1=0 


Pl^X^Pm^x'j  dx  —  C77 


2m  +  1 


f(x)  = 


0,  —  1  <  x  <  0, 

1,  0  <  x  <  1, 


f(x)  =Y2cipi(x)- 


f(x) 

1 


1 


I 


-1  1 

Figure  9.1 


1=0 
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Using  this  result  in  our  example  (9.1),  we  find 


J  f(x)P0{x)dx  =  c0J  [P0{x)}2  dx 
J  f(x)P\(x)  dx  =  Ci  J  [Pi(x)]'2dx 
J  f{x)P2(x)  dx  =  c2  J  [P2(x)}2  dx 


or 


or 


or 


J  dx  =  cq-2,  c0  = 


x  dx  =  Ci  ■  Ci  = 


f  (fa;2  -  i)  dx  =  c2  ■  §,  c2  =  0. 
Jo 


Continuing  in  this  way  we  find  for  the  function  given  in  (9.1) 

(9.4)  f(x)  =  \Po{x)  +  |Pi( x)  -  jqP3(x)  +  §P5(x)  +  ■■■  ■ 

It  is  unnecessary  for  fix)  to  be  continuous  as  it  must  be  for  expansion  in  a 
Maclaurin  series.  Just  as  for  Fourier  series,  the  Dirichlet  conditions  (see  Chapter  7, 
Section  6)  are  a  convenient  set  of  sufficient  conditions  for  a  function  /(x)  to  be 
expandable  in  a  Legendre  series.  If  /(x)  satisfies  the  Dirichlet  conditions  on  (—1,1), 
then  at  points  inside  (—1,1)  (not  necessarily  at  the  endpoints),  the  Legendre  series 
converges  to  /(x)  anywhere  /(x)  is  continuous  and  converges  to  the  midpoint  of 
the  jump  at  discontinuities. 


Example  2.  Here  is  an  interesting  fact  about  Legendre  series.  Sometimes  we  want  to  fit 
a  given  curve  as  closely  as  possible  by  a  polynomial  of  a  given  degree,  say  a  cubic. 
The  criterion  of  “Least  Squares”  is  often  used  to  determine  the  best  fit.  This  means 
that  if,  say,  we  want  to  fit  a  given  /(x)  on  (—1, 1)  by  a  cubic,  we  find  the  coefficients 
a,  b,  c,  d  so  that 


is  as  small  as  possible.  Then 

(9.6)  /(x)  =  ax3  +  bx2  +  cx  +  d 

is  called  the  best  approximation  (by  a  cubic)  in  the  least  squares  sense.  It  can 
be  proved  that  an  expansion  (as  far  as  the  desired  degree  of  the  polynomial  ap¬ 
proximation)  in  Legendre  polynomials  gives  this  best  least  squares  approximation 
(Problem  16). 


PROBLEMS,  SECTION  9 


Expand  the  following  functions  in  Legendre  series. 


1. 

fix) 

—  1  <  X 

0  <  x 

<  0 

<  1 

2. 

3. 

fix) 

—  P3  (x) 

4. 

5. 

f(x) 

fix) 

fix) 


I  0,  —  1  <  x  <  0 

lx,  0  <  x  <  1 

arc  sin  x 


1 
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6. 

/(*) 

=uv 

on  (—1, 0) 
on  (0, 1) 

Hint:  See 

7. 

/(*) 

-!  0 

on  (—1, 0) 

Hint:  See 

[Vi  -  x 

on  (0, 1) 

8. 

fix) 

Hint:  Solve  the  recursion 

— 

1  j - 

, 

!  , 

/  Pi  (x)  dx  = 

-l  a  l 


Chapter  11,  Section  3,  Problem  13. 

Chapter  11,  Sections  6  and  7. 
relation  (5.8e)  for  Pi  (a;)  and  show  that 

(a)-fl+1(o)]- 


9.  /(x)  =  P„(x).  Hint:  For  l  >  n,  P„(x)Pi(x)  dx  =  0  (Why?);  for  l  <  n,  integrate 

by  parts. 

Expand  each  of  the  following  polynomials  in  a  Legendre  series.  You  should  get  the  same 
results  that  you  got  by  a  different  method  in  the  corresponding  problems  in  Section  5. 

10.  3a:2  +  a:  —  1  11.  7a:4  -  3a:  +  1  12.  x  -  x3 


Find  the  best  (in  the  least  squares  sense)  second-degree  polynomial  approximation  to  each 
of  the  given  functions  over  the  interval  —1  <  x  <  1.  (See  Problem  16.) 

13.  a:4  14.  1*1  15.  cos7rx 


16.  Prove  the  least  squares  approximation  property  of  Legendre  polynomials  [see  (9.5) 
and  (9.6)]  as  follows.  Let  /( x)  be  the  given  function  to  be  approximated.  Let  the 
functions  pi  (x)  be  the  normalized  Legendre  polynomials,  that  is, 

pi(x)  =  sJ*±±Pl(x)  go  that  J  \pi(x)]2  dx  =  1. 

Show  that  the  Legendre  series  for  /(x)  as  far  as  the  P2{x)  term  is 

/(x)  =  copo(x)  +  cipi(x)  +  C2P2 (x)  with  ci  =  J  f(x)pi(x)  dx. 

Write  the  quadratic  polynomial  satisfying  the  least  squares  condition  as  bopo(x)  + 
bipi(x)  +  b2p2(x')  (by  Problem  5.14  any  quadratic  polynomial  can  be  written  in  this 
form).  The  problem  is  to  find  bo,bi,b2  so  that 

/  =  J  [/(x)  -  (bopo(x)  +  bipi(x)  +  b2p2(x))f  dx 


is  a  minimum.  Square  the  bracket  and  write  I  as  a  sum  of  integrals  of  the  individual 
terms.  Show  that  some  of  the  integrals  are  zero  by  orthogonality,  some  are  1  because 
the  pi ’s  are  normalized,  and  others  are  equal  to  the  coefficients  c; .  Add  and  subtract 
Co  +  Ci  +  c|  and  show  that 

/  =  J  [/2(x)  +  (bo  -  co)2  +  (bi  -  ci)2  +  (62  -  c2)2  -  Cq  -  c2  -  c2]  dx. 


Now  determine  the  values  of  the  b’s  to  make  I  as  small  as  possible.  (Hint:  The 
smallest  value  the  square  of  a  real  number  can  have  is  zero.)  Generalize  the  proof 
to  polynomials  of  degree  n. 
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10.  THE  ASSOCIATED  LEGENDRE  FUNCTIONS 


A  differential  equation  closely  related  to  the  Legendre  equation  is 


(10.1) 


(1  ~x2)y" 


2  xy'  + 


1(1  +  1) 


y  =  o 


with  m2  <  l2.  We  could  solve  this  equation  by  series;  however,  it  is  more  useful 
to  know  how  the  solutions  are  related  to  Legendre  polynomials,  so  we  shall  simply 
verify  the  known  solution.  First  we  substitute 

(10.2)  y  =  (1  —  x2)m/2u 
into  (10.1)  and  obtain  (Problem  1) 

(10.3)  (1  —  x2)u"  —  2  (to  +  1  )xu'  +  [1(1  +  1)  —  to(to  +  l)]it  =  0. 

For  to  =  0,  this  is  Legendre’s  equation  with  solutions  Pi(x).  Differentiate  (10.3), 
obtaining  (Problem  1) 

(10.4)  (1  -  x2)(u')''  -  2 [(to  +  1)  +  1  ]x(u')'  +  [1(1  +  1)  -  (to  +  l)(m  +  2 )}u'  =  0. 

But  this  is  just  (10.3)  with  u'  in  place  of  u,  and  (to  +  1)  in  place  of  to.  In  other 
words,  if  Pi(x)  is  a  solution  of  (10.3)  with  m  =  0,  P((x)  is  a  solution  of  (10.3)  with 
to  =  1,  P" (x)  is  a  solution  with  m  =  2,  and  in  general  for  integral  to,  0  <  m  <  /, 
(dm / dxm)Pi(x)  is  a  solution  of  (10.3).  Then 

dm 

(10.5)  y  =  {1-X2r/2_  pl{x) 

is  a  solution  of  (10.1).  The  functions  in  (10.5)  are  called  associated  Legendre  func¬ 
tions  are  are  denoted  by 


jm 

(10.6)  PT(x)  =  (  l-x2r/2—Pl(x) 


Associated  Legendre  functions 


[Some  authors  include  a  factor  (— l)m  in  the  definition  of  P(n(x)] 

A  negative  value  for  to  in  (10.1)  does  not  change  to2,  so  a  solution  of  (10.1) 
for  positive  m  is  also  a  solution  for  the  corresponding  negative  m.  Thus  many 
references  define  Pfn(x)  for  —l<m<l  as  equal  to  P\m\x).  Alternatively,  we  may 
use  Rodrigues’  formula  (4.1)  for  Pi(x)  in  (10.6)  to  get 


(10.7) 


pr(x)  =  ±(i-x2r'2-P^(x2-iy. 


It  can  be  shown  that  (10.7)  is  a  solution  of  (10.1)  for  either  positive  or  nega¬ 
tive  to;  however  Pl~m(x)  and  P(n(x)  are  then  proportional  rather  than  equal  (see 
Problem  8). 
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For  each  m,  the  functions  P{n(x)  are  a  set  of  orthogonal  functions  on  (—1,1) 
(Problem  3).  The  normalization  constants  can  be  evaluated;  for  the  definition  (10.7) 
we  find  (Problem  10) 


(10.8) 


2  (l  +  m)\ 
21  +  1  (l  —  m)l 


The  associated  Legendre  functions  arise  in  many  of  the  same  problems  in  which 
Legendre  polynomials  appear  (see  the  first  paragraph  of  Section  2);  in  fact,  the 
Legendre  polynomials  are  just  the  special  case  of  the  functions  P^ix)  when  m  =  0. 


PROBLEMS,  SECTION  10 

1.  Verify  equations  (10.3)  and  (10.4). 

2.  The  equation  for  the  associated  Legendre  functions  (and  for  Legendre  functions 
when  m  =  0)  usually  arises  in  the  form  (see,  for  example,  Chapter  13,  Section  7) 


1  d 
sin  8  dd 


1(1  +  1)- 


sin2  8 


V  =  0. 


Make  the  change  of  variable  x  =  cos 8,  and  obtain  (10.1). 


3. 


Show  that  the  functions  P[rl(x)  for  each  m  are  a  set  of  orthogonal  functions  on 
(-1,1),  that  is,  show  that 


P™(x)P™(x)  dx  =  0,  l  ^  n. 


Hint:  Use  the  differential  equations  (10.1)  and  follow  the  method  of  Section  7. 

Substitute  the  Pi(x)  you  found  in  Problems  4.3  or  5.3  into  equation  (10.6)  to  find  P{n(x)\ 
then  let  x  =  cos  8  to  evaluate: 


4.  Pi  (cos  6<) 


5.  P41  (cos  6») 


6.  P32  (cos  8) 


7.  Show  that 


dl 


:(*  -i  )  = 


i  _  (l  -  m)\ ,  2 


/  2  i  \r 

(x  -1) 


d 


Z+n 


:(z2-l  y. 


dx l~rn  (l  +  m) !  dxl+n 

Hint:  Write  (x2  —  l)1  =  (x  —  l)l(x  +  1)*  and  find  the  derivatives  by  Leibniz’  rule. 

8.  Write  (10.7)  with  m  replaced  by  — m;  then  use  Problem  7  to  show  that 

ppm( x)  =  (-i)mj^pr(x). 

Comment:  This  shows  that  (10.7)  is  a  solution  of  (10.1)  when  m  is  negative. 

9.  Use  Problem  7  to  show  that 


(l-m) !  2T! 


10.  Derive  (10.8)  as  follows:  Multiply  together  the  two  formulas  for  Pjrn(x)  given  in 
(10.7)  and  Problem  9.  Then  integrate  by  parts  repeatedly  lowering  the  l  +  m  deriva¬ 
tive  and  raising  the  l  —  m  derivative  until  both  are  l  derivatives.  Then  use  (8.1). 
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11.  GENERALIZED  POWER  SERIES  OR  THE  METHOD  OF  FROBENIUS 

It  may  happen  that  the  solution  of  a  differential  equation  is  not  a  power  series 
S^Lo  anXn  but  may  either 

(a)  contain  some  negative  powers  of  x,  for  example, 

cos  x  1  1  x2 

^  x2  x 2  2!  4! 

or 

(b)  have  a  fractional  power  of  £  as  a  factor,  for  example, 


Both  these  cases  (and  others — see  Section  21)  are  covered  by  a  series  of  the  form 

oo  oo 

(11.1)  y  =  Xs  anxn  =  Y  gnxn+s, 

n— 0  n—0 

where  s  is  a  number  to  be  found  to  fit  the  problem;  it  may  be  either  positive  or 
negative  and  it  may  be  a  fraction.  (In  fact,  it  may  even  be  complex,  but  we  shall 
not  consider  this  case.)  Since  clqx8  is  to  be  the  first  term  of  the  series,  we  assume 
that  oo  is  not  zero.  The  series  (11.1)  is  called  a  generalized  power  series.  We  shall 
consider  some  differential  equations  which  can  be  solved  by  assuming  a  series  of 
the  form  (11.1);  this  way  of  solving  differential  equations  is  called  the  method  of 
Frobenius. 

Example  1.  As  an  illustration  of  this  method  we  solve  the  equation 

(11.2)  x2y"  +  4xy'  +  (x2  +  2  )y  =  0. 

From  (11.1)  we  have 

OO 

y  =  aoXs  +  aixs+1  +  a2Xs+2  H - =  ^  anxn+s , 

n—0 

y'  =  sa  oXs_1  +  (s  +  l)aixs  +  (s  +  2)a2Xs+1  +  •  •  • 

OO 

(11.3)  =  (n  +  s)anxn+s-\ 

n—0 

y"  =  s(s  —  l)aoXs~2  +  (s  +  l)saixs^1  +  (s  +  2)(s  +  l)a2Xs  +  •  •  • 

OO 

=  ^2(n  +  s)(n  +  s  -  1  )anxn+s~2. 

n—0 

We  substitute  (11.3)  into  (11.2)  and  set  up  a  table  of  powers  of  x  as  we  did  for  the 
Legendre  equation: 


Xs 

xs+1 

xs+2 

rj-n+s 

x2y" 

s(s  -  1  )a0 

( s  +  l)sai 

( S  +  2)(s  +  I)(l2 

(n  +  s)(n  +  s  —  l)an 

4  xy' 

4sao 

4(s  +  l)ai 

2(s  +  2)a2 

4(n  +  s)an 

x2y 

a0 

O-n-2 

2  y 

2ag 

2ai 

2a2 

‘Zdn 
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The  total  coefficient  of  each  power  of  x  must  be  zero.  From  the  coefficient  of  Xs  we 
get  (s2  +  3s  +  2)ao  =  0,  or  since  ao  yf  0  by  hypothesis, 

(11.4)  s2  +  3s  +  2  =  0. 

This  equation  for  s  is  called  the  indicial  equation.  We  solve  it  and  find 

s  =  -2,  s  =  — 1. 

From  here  on  we  solve  two  separate  problems,  one  when  s  =  —2,  and  another  when 
s  =  —  1;  a  linear  combination  of  the  two  solutions  so  obtained  is  then  the  general 
solution  just  as  A  sin  x  +  Bcosx  is  the  general  solution  of  y"  +  y  =  0. 


Example  2.  For  s  =  —1,  the  coefficient  of  xs+1  in  the  table  gives  oq  =  0.  From  the 
xs+2  column  on,  we  can  use  the  general  formula  given  by  the  last  column.  Notice, 
however,  that  the  first  two  columns  in  the  table  do  not  contain  the  an_ 2  term,  so 
you  must  be  careful  about  using  the  general  term  at  first  (Problems  13  and  14). 
From  the  general  column  with  s  =  —  1,  we  have 

an[(n  -  \){n  +  2)  +  2]  =  -a„_2 


or 


"On— 2 


for  n  >  2. 


n(n  +  1) 

Since  ai  =  0,  this  gives  all  odd  a’s  equal  to  zero.  For  even  a’s: 


(11.5) 


a2  = 


a  0 


04  = 


ao 

5l"’ 


06  — 


ao 

7!  ’ 


Then  one  solution  of  (11-2)  is 

(11.6) 


_i  ao  ,  °o  3 
y  =  a0x  -  —a;  +  —x  - 


=  CLqX 


-2 


X  — 


X3  X5 

5f 


ao  sin  x 
x2 


The  other  solution,  when  s  =  —2,  will  be  left  to  Problem  1. 


PROBLEMS,  SECTION  11 

1.  Finish  the  solution  of  equation  (11.2)  when  s  =  —2.  Write  your  solution  in  closed 
form  as  in  (11.6).  To  avoid  confusion  with  the  an  values  we  found  when  s  =  —1,  you 
may  want  to  call  the  coefficients  in  your  series  a'n  or  bn;  however,  this  is  not  essential 
as  long  as  you  realize  that  there  are  two  separate  problems,  one  when  s  =  —  1  and 
one  when  s  =  —2,  and  each  series  has  its  own  coefficients. 

Solve  the  following  differential  equations  by  the  method  of  Frobenius  (generalized  power 
series).  Remember  that  the  point  of  doing  these  problems  is  to  learn  about  the  method 
(which  we  will  use  later),  not  just  to  find  a  soiution.  You  may  recognize  some  series  [as 
we  did  in  (11.6)]  or  you  can  check  your  series  by  expanding  a  computer  answer. 


2. 

2  // 
x  y 

+ 

xy'  —  9y  =  0 

3. 

2  // 
x  y 

+  2xy'  —  6y  =  0 

4. 

2  n 
x  y 

- 

O 

II 

CD 

5. 

2  xy" 

+  y'  +2y  =  0 

6. 

3  xy" 

+ 

(3z  +  I)?/  +  y  =  0 

7. 

2  // 
*  y 

-  (x2  +  2)y  =  0 

8. 

2  // 
x  y 

+ 

to 

B 

to 

1 

to 

0 

9. 

// 

xy  - 

-  y1  +  9 x5y  =  0 

10. 

2  xy" 

- 

0 

II 

CM 

+ 

11. 

36a;2; 

y"  +  (5  -  9a 2)y  =  0 

Section  12 


Bessel’s  Equation  587 


12.  3 xy"  -  2(3*  -  1  )y  +  (3*  —  2)y  =  0 

Consider  each  of  the  following  problems  as  illustrations  showing  that,  in  a  power  series 
solution,  we  must  be  cautious  about  using  the  general  recursion  relation  between  the 
coefficients  for  the  first  few  terms  of  the  series. 

13.  Solve  y"  +  y' /x2  =  0  by  power  series  to  find  the  relation 

n(n  —  1) 

O'Ti+l  =  "  CL-n- 

71+  1 

If,  without  thinking  carefully,  we  test  the  series  anXn  for  convergence  by  the 

ratio  test,  we  find 

\an+1xn+1\ 

lim  — : - ; — =00.  (Show  this.) 

n=oo  \anxn\ 

Thus  we  might  conclude  that  the  series  diverges  and  that  there  is  no  power  series 
solution  of  this  equation.  Show  why  this  is  wrong,  and  that  the  power  series  solution 
is  y  =  const. 

14.  Solve  y"  =  —y  by  the  Frobenius  method.  You  should  find  that  the  roots  of  the 

indicial  equation  are  s  =  0  and  s  =  1.  The  value  s  =  0  leads  to  the  solutions  cos* 
and  sin*  as  you  would  expect.  For  s  =  1,  call  the  series  y  =  bnxrl+1,  and  find 

the  relation 

^  _  bn 

+2  =  _(n  +  3)(n  +  2)’ 

Show  that  the  bo  series  obtained  from  this  relation  is  just  sin  *,  but  that  the  b\  series 
is  not  a  solution  of  the  differential  equation.  What  is  wrong? 

12.  BESSEL'S  EQUATION 

Like  Legendre’s  equation,  Bessel’s  equation  is  another  of  the  “named”  equations 
which  have  been  studied  extensively.  There  are  whole  books  on  Bessel  functions, 
and  you  will  find  numerous  formulas,  graphs,  and  numerical  values  available  in 
your  computer  program  and  in  reference  tables.  You  can  think  of  Bessel  functions 
as  being  something  like  damped  sines  and  cosines.  In  fact,  if  you  had  first  learned 
about  sinnx  and  cos  nx  as  power  series  solutions  of  y"  =  — n2y  instead  of  in  ele¬ 
mentary  trigonometry,  you  would  not  feel  that  Bessel  functions  were  appreciably 
more  difficult  or  strange  than  trigonometric  functions.  Like  sines  and  cosines,  Bessel 
functions  are  solutions  of  a  differential  equation;  they  can  be  represented  by  power 
series,  their  graphs  can  be  drawn,  and  many  formulas  involving  them  (compare 
trigonometric  identities)  are  known.  Of  special  interest  to  science  students  is  the 
fact  that  they  occur  in  many  applications.  The  following  list  of  some  of  the  prob¬ 
lems  in  which  they  arise  will  give  you  an  idea  of  the  great  range  of  topics  which 
may  involve  Bessel  functions:  problems  in  electricity,  heat,  hydrodynamics,  elas¬ 
ticity,  wave  motion,  quantum  mechanics,  etc.,  involving  cylindrical  symmetry  (for 
this  reason  Bessel  functions  are  sometimes  called  cylinder  functions);  the  motion 
of  a  pendulum  whose  length  increases  steadily;  the  small  oscillations  of  a  flexible 
chain;  railway  transition  curves;  the  stability  of  a  vertical  wire  or  beam;  Fresnel 
integrals  in  optics;  the  current  distribution  in  a  conductor;  Fourier  series  for  the  arc 
of  a  circle.  We  shall  discuss  some  of  these  applications  later  (see  Section  18,  and 
Chapter  13,  Sections  5  and  6). 
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Bessel’s  equation  in  the  usual  standard  form  is 


(12.1) 


x2y"  +  xy'  +  (x2  -  p2)y  =  0, 


where  p  is  a  constant  (not  necessarily  an  integer)  called  the  order  of  the  Bessel  func¬ 
tion  y  which  is  the  solution  of  (12.1).  You  can  easily  verify  that  a '{xy')'  =  x2y”+xy ', 
so  we  can  write  (12.1)  in  the  simpler  form 

(12.2)  x(xy')'  +  (x2  —  p2)y  =  0. 

We  find  a  generalized  power  series  for  (12.2)  in  the  same  way  that  we  solved  (11.2). 
[In  fact,  (11.2)  is  a  form  of  Bessel’s  equation!  See  Problems  16.1  and  17.1.]  Writing 
only  the  general  terms  in  the  series  for  y  and  the  derivatives  we  need  in  (12.2),  we 
have 

OO 

y='^2  anXn+S 
n— 0 
oo 

v'  =  an(n  +  s)xn+s~1 

71=0 

OO 

(12.3)  xy'  =  an(n  +  s)xn+s 

n— 0 
oo 

C xy'Y  =  ^2  an(n  +  s)2a:”'+s_1 

n— 0 
oo 

x(xy')'  =  ^2  an{n  +  s)2xn+s 

n= 0 

We  substitute  (12.3)  into  (12.2)  and  tabulate  the  coefficients  of  powers  of  x: 


Xs 

xs+1 

xs+2 

xs+n 

x{xy')' 

s2a0 

(1  +  s)2ai 

(2  +  s)2a  2 

( n  +  s)2an 

x2y 

ao 

an-2 

- p2y 

-p2a0 

—p2  0,1 

- P2a2 

~P2an 

The  coefficient  of  xs  gives  the  indicial  equation  and  the  values  of  s: 

s2  —  p2  =  0,  s  =  ±p. 

The  coefficient  of  rrs+1  gives  ai  =  0.  The  coefficient  of  xs+2  gives  02  in  terms  of  ao, 
etc.,  but  we  may  as  well  write  the  general  formula  from  the  last  column  at  this 
point.  We  get 

[(n  +  s)2  -  p2)an  +  an-2  =  0 


or 


&n— 2 

(n  +  s)2  —  p2 


(12.4) 
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(12.5) 


First  we  shall  find  the  coefficients  for  the  case  s  =  p.  From  (12.4)  we  have 
On-2  On_  2  an- 2 


an  =  - 


(n  +  p)2  —  p2  n2  +  2np  n(n  +  2p) 


Since  ai  =  0,  all  odd  a’s  are  zero.  For  even  a’s  it  is  convenient  to  replace  n  by  2 n; 
then  from  (12.5)  we  have 


(12.6) 


02 n  ~  ~ 


02n-2 


02n-2 


2n(2n  +  2  p)  2  2n(n  +  p) 


The  formulas  for  the  coefficients  can  be  simplified  by  the  use  of  the  T  function 
notation  (Chapter  11,  Sections  2  to  5)  as  you  can  see  by  examining  (12.7)  below. 
Recall  that  T(p  +  1)  =  pT(p)  for  any  p ,  so, 

r(p  +  2)  =  (p  +  i)r(p  +  i), 

F(p  +  3)  =  (p  +  2)r(p  +  2)  =  (p  +  2)(p  +  i)r(p  + 1), 

and  so  on.  Then  from  (12.6)  we  find 

oo  a0T(l+p) 


a2  = 


(12.7) 


a4  =  — 


clq  —  — 


22r(2 +vy 

a0 


a0r(l  +p) 


22(1  +  p) 

02 

23(2  +  p)  ~  2!24(1  +  p)(2  +p)  ~  2!24r(3  +  p)  ’ 
CL/±  a  o 

3!2(3  +  p)  =  3!26(1  +  p)(2  +p)(3  +  p) 

oqT(1  +p) 

3!26T(4  +  p)  ’ 

and  so  on.  Then  the  series  solution  (for  the  s  =  p  case)  is 

1  1 


y  =  a0xpT(l+p) 
1 

+ 


(12.8) 


(!) 

a02^(|)Pr(l+p) 


r(i  +  p)  r(2  +  p) 
4  1 


2!r(3+p) 

x\P 


3!r(4  +  p) 
1 


(f)' 

(i)e 


r(i)r(i+P)  r(2)r(2  +  p) 


(f): 


r(3)r(3  +  p) 


fx\A  1  ( 

r\6 

v 2/  r(4)r(4  +  p)  v 

2)  +'"J 

We  have  inserted  T(l)  and  T(2)  (which  are  both  equal  to  1)  in  the  first  two  terms 
and  written  xp  =  2p(x/2)p  to  make  the  series  appear  more  systematic.  If  we  take 


o0  = 


2PY{l+p) 


or 


2  PpV 


then  y  is  called  the  Bessel  function  of  the  first  kind  of  order  p ,  and  written  Jp(x). 
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Jp{%)  — 


(tr 


(12.9) 


r(i)r(i  +  P) 

l 


x\2+p 


=  £ 


r(3)r(3  +  p) 

(-i)” 


(1) 


r(2)r(2  +  p) 

\  4+p  1 


(!) 


X\  6+P 


r(4)r(4  +  p) 

X \ 2 n+p 


(!) 


—'T  (n  +  l)r(n  +  1  +  p) 

n—0 


(!) 


PROBLEMS,  SECTION  12 


1. 

Show  by  the  ratio  test  that  the  infinite 

series  (12.9)  for  JP{x)  converges  for  all  x. 

Use  (12.9)  to  show  that: 

2. 

J*(x)  =  (2/x)Ji(x)  -  Jo(x) 

3.  Ji{x)  +  Js(x)  =  (4/x)J2(x) 

4. 

(d/dx)Jo(x)  =  —Ji(x) 

5.  (d/dx)[xJi(x)\  —  xJo(x) 

6. 

Jo(x)  —  J2(x)  =  2{d/dx)J\[x) 

7.  lim  Ji(x)/x  =  1 

x — >0  z 

8. 

lim  x  3,72  J3/2(x)  =  3-1-y/2/7r  Hint: 

x — >0 

See  Chapter  11,  equations  (3.4)  and  (5.3) 

9. 

\J -xx  j2  J\/2  (x)  =  sin  a: 

13.  THE  SECOND  SOLUTION  OF  BESSEL'S  EQUATION 

We  have  found  just  one  of  the  two  solutions  of  Bessel’s  equation,  that  is,  the  one 
when  s  =  p;  we  must  next  find  the  solution  when  s  =  —p.  It  is  unnecessary  to 
go  through  the  details  again;  we  can  just  replace  p  by  —  p  in  (12.9).  In  fact,  the 
solution  when  s  =  —p  is  usually  written  J_p.  From  (12.9)  we  have 


(13.1) 


71=0 


(-1)" 

r(n+l)r(n-p+l) 


X\2n~P 
2  ) 


If  p  is  not  an  integer,  Jp(x)  is  a  series  starting  with  xp  and  J_p  is  a  series  starting 
with  x~p.  Then  Jp(x)  and  J_p( x)  are  two  independent  solutions  and  a  linear 
combination  of  them  is  a  general  solution.  But  if  p  is  an  integer,  then  the  first  few 
terms  in  J_p  are  zero  because  T(n  ~  p  +  1)  in  the  denominator  is  T  of  a  negative 
integer,  which  is  infinite.  You  can  show  (Problem  2)  that  J-P(x)  starts  with  the 
term  xp  (for  integral  p)  just  as  Jp(x)  does,  and  that 

(13.2)  J-P(x)  =  (— l)pJp(x)  for  integral  p\ 

thus  J-P(x)  is  not  an  independent  solution  when  p  is  an  integer.  The  second  solution 
in  this  case  is  not  a  Frobenius  series  (11.1)  but  contains  a  logarithm.  Jp(x)  is  finite 
at  the  origin,  but  the  second  solution  is  infinite  and  so  is  useful  only  in  applications 
involving  regions  not  containing  the  origin. 

Although  J-p(x)  is  a  satisfactory  second  solution  when  p  is  not  an  integer,  it  is 
customary  to  use  a  linear  combination  of  Jp(x)  and  J_p(x)  as  the  second  solution. 
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This  is  much  as  if  sin  x  and  (2  sin  x  —  3  cos  x)  were  used  as  the  two  solutions  of 
y"  +  y  =  0  instead  of  sinx  and  cosx.  Remember  that  the  general  solution  of 
this  differential  equation  is  a  linear  combination  of  sinx  and  cosx  with  arbitrary 
coefficients.  But  A  sin  x  +  B{ 2  sin  x  —  3  cos  x)  is  just  as  good  a  linear  combination  as 
Ci  sinx  +  C2  cosx.  Similarly,  any  combination  of  Jp(x)  and  J_p(x)  is  a  satisfactory 
second  solution  of  Bessel’s  equation.  The  combination  which  is  used  is  called  either 
the  Neumann  or  the  Weber  function  and  is  denoted  by  either  Np  or  Yp: 


(13.3) 


Np(x)  =  Yp(x) 


cos(np)  Jp(x)  —  J-P(x) 
sin  np 


For  integral  p  this  expression  is  an  indeterminate  form  0/0.  However,  for  any  x/0 
it  has  a  limit  (as  p  tends  to  an  integral  value)  which  gives  a  second  solution.  This  is 
why  the  special  form  (13.3)  is  used;  it  is  valid  for  any  p.  Np  or  Yp  are  called  Bessel 
functions  of  the  second  kind.  The  general  solution  of  Bessel’s  equation  (12.1)  or 
(12.2)  may  then  be  written  as 

(13.4)  y  =  AJp{x)  +  BNp{x), 

where  A  and  B  are  arbitrary  constants. 

PROBLEMS,  SECTION  13 

1.  Using  equations  (12.9)  and  (13.1),  write  out  the  first  few  terms  of  Jo(x),  Ji(x), 

J- i(x),  J2 (x) ,  2 (x) .  Show  that  J_i(x)  =  —  Ji(x)  and  J_2(x)  =  J2(x). 

2.  Show  that,  in  general  for  integral  n,  J_n(x)  =  (— 1  )nJn(x),  and  Jn(— x)  =  (— l)nJn(x) 
Use  equations  (12.9)  and  (13.1)  to  show  that: 

3.  ybrx/2  J_i/2(x)  =  cos  x 

4.  ^3/2(x)  =  x_1Ji/2(x)  -  J— i/2(x). 

5.  Using  equation  (13.3),  show  that  N1/2(x)  =  — J_i/2(x);  that  N3/2{x)  =  J_ 3/2(x). 

6.  Show  from  (13.3)  that  lV(2n+i)/2(x)  =  (-l)n+1  J_(2n+i)/2(x). 

14.  GRAPHS  AND  ZEROS  OF  BESSEL  FUNCTIONS 

You  can  find  the  values  of  Bessel  functions  both  from  your  computer  program  and 
in  reference  books,  and  you  can  use  your  computer  to  plot  graphs  of  Bessel  functions 
(see  problems).  Except  for  J0(x),  all  the  Jp  s  start  at  the  origin  behaving  like  xp 
and  then  oscillate  something  like  sin  x  but  with  decreasing  amplitude.  Jo(x)  is  equal 
to  1  at  x  =  0  and  so  looks  something  like  a  damped  cosine.  All  the  TV’s  are  ±00  at 
the  origin,  but  away  from  it  they  also  oscillate  with  decreasing  amplitude. 

The  values  of  x  for  which  sin  x  =  0  (called  the  zeros  for  sin  x)  do  not  need  to  be 
computed  because  they  are  just  x  =  nn  for  n  =  0, 1, 2,  •  ■  ■ .  The  zeros  of  the  Bessel 
functions,  however,  do  not  occur  at  regular  intervals;  they  have  to  be  computed 
numerically.  You  can  find  their  values  by  computer  or  in  tables.  It  is  worth  noticing 
that  the  difference  between  two  successive  zeros  becomes  approximately  n  (as  it  is 
for  sin  x  and  cos  x)  when  x  is  large.  You  can  see  this  from  graphs  of  the  functions  or 
from  tables  of  the  zeros,  or  from  the  approximate  formulas  for  the  Bessel  functions 
when  x  is  large.  (See  Section  20). 
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PROBLEMS,  SECTION  14 


1.  By  computer,  plot  graphs  of  JP(x)  for  p  =  0,1,  2,  3,  and  x  from  0  to  15. 

2.  From  the  graphs  in  Problem  1,  read  approximate  values  of  the  first  three  zeros  of 
each  of  the  functions.  Then,  by  computer,  find  more  accurate  values  of  the  zeros. 

3.  By  computer,  plot  Nq{x)  for  x  from  0  to  15,  and  Np(x)  for  p  =  1,2,3,  and  x  from 
1  to  15. 

4.  From  the  graphs  in  Problem  3,  read  approximate  values  of  the  first  three  zeros  of 
each  of  the  functions,  and  then  find  more  accurate  values  by  computer. 

5.  By  computer,  plot  \fx  J\/2{x)  for  x  from  0  to  4-7T.  Do  you  recognize  the  curve?  See 
Problem  12.9. 

6.  By  computer,  find  30  zeros  of  Jo  and  note  that  the  spacing  between  consecutive 
zeros  is  tending  to  7r. 


15.  RECURSION  RELATIONS 


The  following  useful  relations  hold  among  Bessel  functions  and  their  derivatives. 
Although  we  state  them  and  outline  proofs  for  Jp(x),  they  also  hold  for  Np(x). 


(15.1) 


d 


(15.2) 


d 


(15.3) 

(15.4) 

(15.5) 


To  prove  (15.1),  first  multiply  (12.9)  by  xv  and  differentiate  to  get 


Use  the  fact  that  T(n+  1+p)  =  ( n+p)T(n+p ),  and  cancel  the  factors  2  and  ( n+p ) 
to  get 


Divide  by  xv  and  compare  with  (12.9);  this  gives 


since  this  series  is  just  (12.9)  with  p  replaced  by  p  —  1.  Proofs  of  the  other  relations 
are  outlined  in  Problems  1  to  3. 
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PROBLEMS,  SECTION  15 


1.  Prove  equation  (15.2)  by  a  method  similar  to  the  one  used  above  to  prove  (15.1). 

2.  Solve  equations  (15.1)  and  (15.2)  for  Jp+i(x)  and  Jp-i(x).  Add  and  subtract  these 
two  equations  to  get  (15.3)  and  (15.4). 

3.  Carry  out  the  differentiation  in  equations  (15.1)  and  (15.2)  to  get  (15.5). 

4.  Use  equations  (15.1)  to  (15.5)  to  do  Problems  12.2  to  12.6. 

5.  Using  equations  (15.4)  and  (15.5),  show  that  Jo(x)  =  J2(x)  at  every  maximum  or 
minimum  of  Ji(x),  and  Jq(x)  =  —  J^x)  =  J[(x)  at  every  positive  zero  of  Ji(x). 
Computer  plot  Jo(x),  Ji(x),  and  Jz(x)  on  the  same  axes,  and  verify  that  these 
results  are  true. 


6.  As  in  Problem  5,  show  that  Jp_i(x)  =  Jp+i(x)  at  every  maximum  or  minimum  of 
JP(x),  and  Jp-i(x)  =  —JP+ i(x)  =  Jp(x)  at  every  positive  zero  of  Jp(x).  Computer 
plot,  say,  J2,  J3,  and  J4  on  the  same  axes,  (or  any  other  set  of  three  consecutive  J’s 
or  three  consecutive  N’s)  and  check  to  see  that  the  results  are  true. 

7.  (a)  Using  (15.2),  show  that 

x)  dx  =  —  Jo(*)|^°  =  I- 


(b)  Use  L23  of  the  Laplace  Transform  Table  (page  469)  to 
show  that  J0°°  J0(f)  dt  =  1.  (Also  see  Problem  23.29.) 


8.  From  equation  (15.4),  show  that 

poo  poo 

/  J\{x)dx=  /  Jz(x)dx- 

J  0  Jo 


poo 

/  J2n+l(x) 
JO 


dx, 


and 


poo  po 

/  Jo  Or)  dx  = 

Jo  Jo 


/oo 

J2n(x)  dx. 

Then,  by  Problem  7,  show  that 

O 

Jn(x)  dx  —  1  for  all  integral  n. 


f 

Jo 


9.  Use  L23  and  L32  of  the  Laplace  Transform  Table  (page  469)  to 
evaluate  J0°°  tJo(2t)e~t  dt. 


16.  DIFFERENTIAL  EQUATIONS  WITH  BESSEL  FUNCTION  SOLUTIONS 

Many  differential  equations  occur  in  practice  that  are  not  of  the  standard  form 
(12.1)  but  whose  solutions  can  be  written  in  terms  of  Bessel  functions.  It  can  be 
shown  (Problem  13)  that  the  differential  equation 


(16.1) 


„  1  —  2a  , 

y  + - y 


{bcxc~lY  + 


2  2  21 

or  —  pzcz 


y  =  0 


has  the  solution 


(16.2) 


y  =  xaZp(bxc), 
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where  Z  stands  for  J  or  N  or  any  linear  combination  of  them,  and  a,  b,  c,  p  are 
constants.  To  see  how  to  use  this,  let  us  “solve”  the  differential  equation 

(16.3)  y"  +  9  xy  =  0. 

If  (16.3)  is  of  the  type  (16.1),  then  we  must  have 

1  —  2a  =  0,  (be)2  =9,  2(c  —  1)  =  1,  a2-p2c2  =  0. 

From  these  equations  we  find 

1  O  1  fl  1 

a  =  2 ,  c  =  |,  b  =  2,  pm  —  =  — 

c  o 

Then  the  solution  of  (16.3)  is 

(16.4)  y  =  x1'2Z1/3(2x3'2). 

This  means  that  the  general  solution  of  (16.3)  is 

y  =  x1/2[A/1/3(  2x3/2)  +  BN1/3(2x3/2)], 
where  A  and  B  are  arbitrary  constants. 

It  is  useful  to  write  the  differential  equation  whose  solutions  are  Jp(Kx)  and 
Np(Kx),  where  K  is  a  constant.  We  substitute  Kx  for  x  in  (12.2).  Then  x(dy/dx ) 
becomes  Kx[dy/d(Kx)]  =  x(dy/dx)  and  similarly,  x(xy')'  is  unchanged.  Thus  the 
only  change  in  (12.2)  is  to  replace  x2  —  p2  by  K2 x2  —  p2  and  we  have: 

(16.5)  x(xy')'  +  (K2x2  —  p2)y  =  0  has  solutions  Jp(Kx)  and  Np(Kx). 


PROBLEMS,  SECTION  16 

Find  the  solutions  of  the  following  differential  equations  in  terms  of  Bessel  functions  by 
using  equations  (16.1)  and  (16.2). 


1. 

Equation  (11.2) 

2. 

y"  +  Ax2y  =  0 

3. 

xy"  +  2  y  +  Ay  =  0 

4. 

3xi /"  +  2  y'  +  12  y  =  0 

5. 

y"  -  -y  +  (4+  -4)  y  =  o 

X  \  xz  J 

6. 

Axy"  +  y  =  0 

7. 

xy"  +  3  y  +  x3y  =  0 

8. 

y  +xy  =  0 

9. 

3xy"  +  y  +  12  y  =  0 

10. 

xv"  —  y'  +  9  x2  y  =  0 

11. 

xy"  +  5  y  +  xy  =  0 

12. 

4  xy"  +  2  y'  +  y  =  0 

13.  Verify  by  direct  substitution  that  the  text  solution  of  equation  (16.3)  and  your 
solutions  in  the  problems  above  are  correct.  Also  prove  in  general  that  the  solution 
(16.2)  given  for  (16.1)  is  correct.  Hint:  These  are  exercises  in  partial  differentiation. 
To  verify  the  solution  (16.4)  of  (16.3),  we  would  change  variables  from  x,y  to  say 
z,  u  where 

y  =  x1/2u,  u  =  J1/3(z),  z  =  2x3/2, 
and  show  that  if  x,y  satisfy  (16.3),  then  u,  z  satisfy  (12.1),  that  is, 
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Use  (16.5)  to  write  the  solutions  of  the  following  problems.  Remember  that 
x(xy'Y  =  x2y"  +  xy' . 

14.  x2  y "  +  xy'  +  (4a:2  —  9  )y  =  0 

16.  x2  y"  +  xy'  +  (16a;2  —  l)y  =  0 

17.  OTHER  KINDS  OF  BESSEL  FUNCTIONS 

We  have  discussed  Jp(x)  and  Np(x)  which  are  called  Bessel  functions  of  the  first 
and  second  kinds ,  respectively.  Since  Bessel’s  equation  is  of  second  order,  there 
are,  of  course,  only  two  independent  solutions.  However,  there  are  a  number  of 
related  functions  which  are  also  called  Bessel  functions.  Here  again  there  is  a  close 
analogy  to  sines  and  cosines.  We  may  think  of  cos  a;  and  sinx  as  the  solutions  of 
y"  +  y  =  0.  But  cosx  ±  i  sinx  are  also  solutions  which  we  usually  write  as  e±lx .  If 
we  replace  x  by  ix,  we  get  the  functions  ex,  e~x ,  coshx,  sinhx,  which  are  solutions 
of  y"  —  y  =  0.  We  list  a  number  of  Bessel  functions  which  are  frequently  used  and 
their  trigonometric  analogues: 


15.  x{xy')'  +  (25x2  —  4)j/  =  0 

17.  xy"  +  y  +  9xy  =  0 


Hankel  Functions  or  Bessel  Functions  of  the  Third  Kind 


H^\x)  =  Jp{x)  +  iNp(x), 
Hp2\x)  =  JP(x)  -  iNp{x). 


(Compare  e±“  =  cosx  ±  i  sinx.) 


Modified  or  Hyperbolic  Bessel  Functions  The  solutions  of 
(17.2)  x2y"  +  xy'  —  (x2  +  p2)y  =  0 

are,  by  (16.1)  Zp(ix).  (Compare  this  with  the  standard  Bessel  equation  and  by 
analogy  consider  the  relation  between  y"  +  y  =  0  and  y"  —  y  =  0.)  The  two 
independent  solutions  of  (17.2)  which  are  ordinarily  used  are 


(17.3) 


kp(x)  =  ^ip+1H^\ix). 


These  should  be  compared  with  sinhx  =  — isin(ix)  and  coshx  =  cos(ix);  because 
of  the  analogy,  I  and  K  are  called  hyperbolic  Bessel  functions.  The  i  factors  are 
adjusted  to  make  I  and  K  real  for  real  x. 
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Spherical  Bessel  Functions  If  p  =  (2 n  +  l)/2  =  n  +  n  an  integer,  then 
Jp(x)  and  Np(x)  are  called  Bessel  functions  of  half-odd  integral  order;  they  can  be 
expressed  in  terms  of  sin  x,  cos  x,  and  powers  of  x.  The  spherical  Bessel  functions 
are  closely  related  to  them  as  you  can  see  from  the  formulas  (17.4)  below.  Spherical 
Bessel  functions  arise  in  a  variety  of  vibration  problems  especially  when  spheri¬ 
cal  coordinates  are  used.  We  define  the  spherical  Bessel  functions  j„(x),  yn(x), 
hn\x),  hn\x),  for  n  =  0,1,2,  and  state  their  values  in  terms  of  elementary 
functions  (see  Problems  2  and  3).  For  the  use  of  these  functions,  see  Chapter  13, 
Problems  7.15,  7.16,  7.19,  and  10.20. 


The  Kelvin  Functions  A  standard  method  of  solving  vibration  problems  is  to  as¬ 
sume  a  solution  involving  eZLUt ;  the  resulting  equation  may  contain  imaginary  terms. 
As  an  example,  the  following  equation  arises  in  the  problem  of  the  distribution  of 
alternating  current  in  wires  (skin  effect)  (Relton,  p.  177): 

(17.5)  y"  +  -y'  -  iy  =  0. 

X 

The  solution  of  this  equation  is  (Problem  8a) 

(17.6)  y=Z0(i3/2x). 

This  is  complex,  and  it  is  customary  to  separate  it  into  its  real  and  imaginary  parts, 
called  (for  Z  =  J)  ber  and  bei;  these  stand  for  Bessel-real  and  Bessel-imaginary. 
We  define  the  ber,  bei,  ker,  kei  functions  by 


(17.7) 


Jo  (i 3^2x)  =  ber  x  +  i  bei  x, 
Koi^^x)  =  ker  x  +  i  keia;. 


There  are  also  similar  functions  for  n  /  0.  These  functions  occur  in  problems  in 
heat  flow  and  in  the  theory  of  viscous  fluids,  as  well  in  electrical  engineering. 


The  Airy  Functions  The  Airy  differential  equation  is 

(17.8)  y"  -xy  =  0. 

By  Section  16,  the  solutions  are  (Problem  8b) 

(17.9)  v^i/3(I^3/2), 
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so  by  (17.3)  they  can  be  written  in  terms  of  /1/3  and  K 1/3.  The  Airy  functions  are 
defined  as 

Ai(a;)  =  ^\/firi/3(^3/2)! 

(17.10)  *  V  A 

Bi(®)  = 

For  negative  x,  Ai  and  Bi  can  be  expressed  in  terms  of  J\i 3  and  N1/3,  or  the  Hankel 
functions  (17.1)  of  order  1/3.  Airy  functions  are  of  use  in  electrodynamics  and 
quantum  mechanics. 


PROBLEMS,  SECTION  17 


1.  Write  the  solutions  of  Problem  16.1  as  spherical  Bessel  functions  using  the  definitions 

(17.4)  of  jn{ x)  and  yn{x)  in  terms  of  J(2n+ i)/2(®)  and  Y^n+1-,/2(x).  Then,  using 

(17.4) ,  obtain  the  solutions  in  terms  of  sin®  and  cos®.  Compare  with  the  answers 
in  equation  (11.6)  and  Problem  11.1. 

2.  From  Problem  12.9,  Ji/2(®)  =  1/2/7 r®  sin®.  Use  (15.2)  to  obtain  J3/2(®)  and 
Js/2(x)-  Substitute  your  results  for  the  J’s  into  (17.4)  to  verify  the  formulas  stated 
f°r  jo,  j  1,  and  j2  in  terms  of  sin®  and  cos®. 

3.  From  Problems  13.3  and  13.5,  hi/2(®)  =  —  \/2/7r®  cos®.  As  in  Problem  2,  obtain 
y3/2  and  V5 /2  and  verify  the  formulas  (17.4)  for  7/0,  j/i,  and  y2  in  terms  of  sin®  and 
cos®. 

4.  Using  (17.3)  and  the  results  stated  in  Problems  2  and  3  for  J1/ 2  and  Y\/2  (=  Ad//, 
show  that 


-fya  (®) 


and  K1/2{x) 


e 


— X 


5.  Show  from  (17.4)  that  h^\x)  =  —ixn  ( - — 

\  x  ax 

6.  Using  (16.1)  and  (17.4)  show  that  the  spherical  Bessel  functions  satisfy  the  differ¬ 
ential  equation 

x2  y "  +  2  xy  +  [®2  —  n(n  +  1  )\y  =  0. 

7.  (a)  Solve  the  differential  equation  xy "  =  y  using  (16.1),  and  then  express  the 

answer  in  terms  of  a  function  Iv  by  (17.3). 

(b)  As  in  (a),  find  a  solution  of  of  y"  —  xAy  =  0. 

8.  Using  (16.1)  and  (16.2),  verify  that 

(a)  the  solution  of  (17.5)  is  (17.6); 

(b)  the  solution  of  (17.8)  is  (17.9). 

9.  Using  (17.3)  and  (15.1)  to  (15.5),  find  the  recursion  relations  for  IP(x).  In  particular, 
show  that  Iq  =  Ii. 

10.  Computer  plot 

(a)  Io(x),  Ii(x),  /2(®),  from  x  =  0  to  2. 

(b)  Ao(®),  Jvl(®),  A2(®),  from  ®  =  0.1  to  2. 

(c)  Ai(®)  from  ®  =  —10  to  10. 
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(d)  Bi(a;)  from  x  =  —10  to  1. 

11.  From  (17.4),  show  that  h^\ix)  =  —e~x/x. 

Use  the  Section  15  recursion  relations  and  (17.4)  to  obtain  the  following  recursion  relations 
for  spherical  Bessel  functions.  We  have  written  them  for  jn,  but  they  are  valid  for  yn  and 
for  the  /in’s. 

12.  jn-i(x)  +  jn+i(x)  =  (2n  +  l)jn(x)/x  13.  (d/dx)jn(x)  =  nj„(x)/x  -  jn+i(x) 

14.  (. d/dx)jn(x )  =  jn- i(x)  -  (n  +  1  )jn(x)/x 

15.  (d/dx)[xn  +  1jn{x)\=xn  +  1jn- 1  (®)  16.  (d/dx)[x~njn(x)}  =  -X~"jn+l(x) 


18.  THE  LENGTHENING  PENDULUM 

As  an  example  of  the  use  of  Bessel  functions  we  consider  the  following  problem. 
Suppose  that  a  simple  pendulum  (see  Chapter  11,  Section  8)  has  the  length  l  of 
its  string  increased  at  a  steady  rate  (for  example,  a  weight  swaying  as  it  is  lowered 
by  a  crane).  (This  problem  was  considered  as  early  as  1707;  see  L.  LeCornu,  Acta 
Mathematica  19  (1895),  201-249.  Also  see  Relton,  and  Problem  8.)  Find  the 
equation  of  motion  and  the  solution  for  small  oscillations. 

From  Chapter  11,  Section  8,  we  have  the  equation  of  motion 

(18.1)  — (ml29)  +  mgl  sin  9  =  0. 

at 

Let  the  length  of  the  string  at  time  t  be 

(18.2)  l  =  Iq  +  v  t, 


and  change  from  t  to  l  as  the  independent  variable.  For  small  oscillations,  we  may 
replace  sinf?  by  0.  Then  (18.1)  becomes  (Problem  1): 


(18.3) 


,#0 

'  dl2 


}d.e 

)j~di 


9 


9  =  0. 


(This  equation  could  also  describe  the  damped  vibration  of  a  variable  mass,  or  an 
RLC  circuit  with  variable  L.) 

We  solve  (18.3)  by  comparing  it  with  the  standard  equation  (16.1)  to  get  (Prob¬ 
lem  2) 

(18.4)  6  =  Z-V2Zl(Mi/2)  where  b  =  2 g1/2/v. 


To  simplify  the  notation,  let 

(18.5)  u  =  bl1'2  =  {2g1l2/v)l1/2. 
The  general  solution  of  (18.3)  is  then 

(18.6)  9  =  Au~1J1(u)  + Bu^N^u). 

We  can  find  d9/du  from  (18.6)  using  (15.2): 

d9 
du 


(18.7) 


[. Au  1J2{u)  +  Bu  17V2(m)]. 
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The  constants  A  and  B  must  be  found  from  the  starting  conditions  just  as  they 
are  for  the  ordinary  simple  pendulum  with  constant  l.  For  example,  in  the  ordinary 
case,  if  0  =  0o  and  0  =  0  at  t  =  0,  then  the  general  solution  9  =  Acosut  +  B  sin uit 
becomes  just  9  =  9q  cos  cot.  For  the  lengthening  pendulum,  let’s  take  the  same 
simple  initial  conditions,  namely  9  =  9q  and  9  =  0  at  t  =  0.  For  these  initial 
conditions,  we  find  (after  some  calculations — see  Problems  3  to  6) 

2  2 

(18.8)  A  = - ^-90N2(uq)1  B  =  — ^  90J2(uq). 

The  solution  has  a  particularly  simple  form  if  we  adjust  the  constants  v  and  Iq  so 
that 


(18.9)  u0  =  2  (gl0)1/2/v  is  a  zero  of  J2  (u)  ■ 

Then  B  =  0  and  the  second  term  of  (18.6)  is  zero,  so  we  have 

(18.10)  9  =  Au-1  Ji(it)  =  Cr1/2Ji(&Z1'/2), 


where  (Problem  7) 


(18.11) 


2<71/2  mo 


C  = 


9 oh 


o‘o 


1/2 


Jl(wo) 


For  this  simple  case,  9  is  a  multiple  of  J2  (u)  (Problem  8);  thus  0  =  0  corresponds 
to  zeros  of  Ji(it)  and  0=0  corresponds  to  zeros  of  J2{u).  A  “quarter”  period 
corresponds  to  the  time  from  0  =  0  to  0  =  0,  or  0  =  0  to  0  =  0.  These  quarter 
periods  can  be  found  from  the  zeros  of  J\(u)  and  J2(u)  (Problem  8). 


PROBLEMS,  SECTION  18 

1.  Verify  equation  (18.3).  Hint:  From  equation  (18.2),  dl  =  vdt,  so 

d  d 

dt  dl 

2.  Solve  equation  (18.3)  to  get  equation  (18.4). 

3.  Prove 

2 

Jp(x)JLv(x)  —  J-p(x)J'(x)  = - sinp-7r 

TVX 

as  follows:  Write  Bessel’s  equation  (12.1)  with  y  =  Jv  and  with  y  —  J~p\  multiply 
the  Jp  equation  by  J-v  and  the  J_p  equation  by  Jv  and  subtract  to  get 

■^[x(Jpf_p  -  J-pJp)}  =  0. 

Then  JPJLP  —  J~pJp  =  c/x.  To  find  c,  use  equation  (12.9)  for  each  of  the  four 
functions  and  pick  out  the  1/x  terms  in  the  products.  Then  use  equation  (5.4)  of 
Chapter  11. 

4.  Using  equation  (13.3)  and  Problem  3,  show  that 


Jp(x)Np(x)  -  Jp(x)Np(x) 


Jp(x^ J — p(x)  Jp(x)J — p(x) 


J2_ 

nx 


sin  pn 
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5.  Use  the  recursion  relations  of  Section  15  (for  N’s  as  well  as  for  J’s)  and  Problem  4 
to  show  that 

2 

Jn(x)Nn+1(x)  -  Jn+i(x)Nn(x)  m—  —  . 

7VX 

Hint:  Do  it  first  for  n  =  0;  then  use  the  result  in  proving  the  n  =  1  case,  and  so  on. 

6.  For  the  initial  conditions  9  =  8q,  9  =  0,  show  that  the  constants  A  and  B  in 
equations  (18.6)  and  (18.7)  are  as  given  in  (18.8).  Hints:  Show  that  d9/du  =  0  if 
9  =  0.  In  equations  (18.6)  and  (18.7),  set  9  =  9q  and  d9/du  =  0  when  u  =  uo  and 
solve  for  A  and  B.  Then  use  the  formula  in  Problem  5  to  simplify  your  results  to 
get  equation  (18.8). 

7.  Verify  the  values  of  b  and  C  given  in  equation  (18.11).  Note  that  C  can  be  found  in 
two  ways:  (1)  in  equation  (18.10),  u  =  bl 1^2,  so  Au -1  =  (A/&)1-1/2,  C  =  A/b.  Use 
Problem  5  to  simplify  this.  (2)  Set  6  =  8o,  u  =  uo,  l  =  l o  in  equation  (18.10)  and 
solve  for  C. 

8.  Find 

^  d8  d9  du  dl 

dt  du  dl  dt 

either  from  equations  (18.10)  and  (15.2)  or  from  equation  (18.7)  with  B  =  0.  Thus 
show  that  9  =  0  when  Ji(u)  =  0  and  9  =  0  when  J2(u)  =  0.  Show  that  the 
successive  (variable)  quarter  periods  of  the  lengthening  pendulum  are  (v/4g)(r2—rf) 
or  (v/Ag)(r\  —  r^),  where  r i  and  T2  are  successive  zeros  of  J\  and  J2.  Use  a  computer 
or  tables  to  find  the  needed  zeros  and  calculate  several  quarter  periods  (as  multiples 
of  v/(4 g)).  Observe  that  an  inward  swing  takes  longer  than  either  the  preceding  or 
the  following  outward  swing.  [This  result  is  proved  by  LI.  G.  Chambers,  Proceedings 
of  the  Edinburgh  Mathematical  Society  (2)  12,  17-18  (I960).] 

9.  Consider  the  “shortening  pendulum”  problem.  Follow  the  method  in  the  text  but 
with  l  =  lo  —  vt.  Does  the  9  amplitude  of  the  vibration  increase  or  decrease  as  the 
pendulum  shortens?  Restate  the  result  of  Problem  8  about  quarter  periods  for  this 
case. 

10.  The  differential  equation  for  transverse  vibrations  of  a  string  whose  density  increases 
linearly  from  one  end  to  the  other  is  y"  +  (Ax  +  B)y  =  0,  where  A  and  B  are 
constants.  Find  the  general  solution  of  this  equation  in  terms  of  Bessel  functions. 
Hint:  Make  the  change  of  variable  Ax  +  B  =  Au. 

11.  A  straight  wire  clamped  vertically  at  its  lower  end  stands  vertically  if  it  is  short,  but 
bends  under  its  own  weight  if  it  is  long.  It  can  be  shown  that  the  greatest  length 
for  vertical  equilibrium  is  l,  where  kl3^2  is  the  first  zero  of  J_  1/3  and 


r  =  radius  of  the  wire,  p  =  linear  density,  g  =  acceleration  of  gravity,  Y  =  Young’s 
modulus.  Find  l  for  a  steel  wire  of  radius  1  mm;  for  a  lead  wire  of  the  same  radius. 
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19.  ORTHOGONALITY  OF  BESSEL  FUNCTIONS 

You  may  expect  here  that  we  are  going  to  prove  that  two  Jp’s  for  different  p  values 
are  orthogonal.  However,  this  is  not  what  we  are  going  to  do — as  a  matter  of  fact 
it  isn’t  true!  To  see  what  we  are  going  to  prove,  look  at  the  following  comparison 
between  Bessel  functions  and  sines  and  cosines. 


Two  functions:  sin  a;  and  cos  ax 

Consider  just  sin  ax 

At  the  zeros  of  sin  x,  namely, 
(19.1)  x  =  rnr,  sina:  =  0. 

At  x  =  1,  sin  nnx  =  0. 

The  differential  equation  satisfied 
by  y  =  sin  nirx  is 
y"  +  ( mr)2y  =  0. 


Two  functions  for  each  p: 

Jp(x)  and  Np(x). 

Consider  just  Jp(x)  for  one 
value  of  p. 

At  the  zeros  of  Jp(x),  say 
x  =  a,  P,  ■  ■  ■ ,  Jp(x)  =  0. 

At  x  =  1,  Jp(ax)  =  0, 

Jp{f3x)  =  0,  •  •  •. 

The  differential  equation  satisfied 
by  y  =  Jp(ax)  is  [see  (16.5)] 
x(x :y')'  +  ( a2x 2  —  p2)y  =  0. 


(In  comparing  the  differential  equations  remember  that  p  is  a  fixed 
constant.  The  correspondence  is  between  the  zeros  of  sin  a?,  namely 
nir,  and  the  zeros  of  Jp{x)1  namely,  a,/?,  etc.) 

We  have  proved  (Chapter  7):  We  shall  prove: 

rl  r  1 


/  sin  mrx  sin  m7ra;  dx  =  0  /  xJp(ax)Jp(/3x)dx  =  0 

Jo  Jo 


for  n  ^  m. 


for  a//l. 


By  (16.5),  the  differential  equation  satisfied  by  Jp(ax)  is 


(19.2) 


x(xy')'  +  ( a2x 2  —  p2)y  =  0 


and  the  differential  equation  satisfied  by  Jp((3x)  is 
(19.3)  x(xy')'  +  {(32x2  —  p2)y  =  0. 


Let  us  for  simplicity  call  Jp(a x)  =  u  and  Jp{f3x)  =  v:  then  (19.2)  and  (19.3)  become 

x{xu’Y  +  (a2x2  -  p2)u  =  0, 
x(x :v’Y  +  (/32  x2  —  p2)v  =  0. 


We  are  going  to  use  equations  (19.4)  to  prove  the  last  equation  in  (19.1)  by  a 
method  parallel  to  that  used  in  proving  the  orthogonality  of  Legendre  polynomials 
(Section  7).  Multiply  the  first  equation  of  (19.4)  by  v,  the  second  by  u,  subtract 
the  two  equations  and  cancel  an  x  to  get 

(19.5)  v(xu'Y  —  u(xv'Y  +  (a2  —  p2)xuv  =  0. 

The  first  two  terms  of  (19.5)  are  equal  to 


(19.6) 


—  (vxu'  —  uxv'). 
dx 
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Using  (19.6)  and  integrating  (19.5),  we  get 


(19.7) 


(■ vxu ' 


uxv') 


1 

xuv  dx  =  0. 


At  the  lower  limit  the  integrated  term  is  zero  because  x  =  0  and  u,  v ,  u' ,  v'  are 
finite.  To  evaluate  the  integrated  term  at  the  upper  limit,  recall  that  u  =  Jp(ax), 
v  =  Jp(/3x)'  then  at  x  =  1,  u  =  Jp(a)  =  0,  v  =  Jp(/3)  =  0  since  a  and  (3  are  zeros 
of  Jp.  The  integrated  term  is  therefore  zero  at  the  upper  limit  also.  Thus  (19.7) 
becomes 

(19.8)  (a2  —  (32)  f  xuvdx  =  0 

Jo 

or 

(19.9)  (a2—/?2)  f  xJp(ax)  Jp(/3x)  dx  =  0. 

Jo 

If  a  ^  (3,  that  is,  if  a  and  (3  are  different  zeros  of  Jp ,  the  integral  must  be  zero. 
If  a  =  (3,  the  integral  is  not  zero;  it  can  be  evaluated,  but  we  shall  just  state  the 
answer  (see  Problem  1): 


(19.10)  /  xJp(ax)Jp(/3x)  dx 


0  if  a  ^  (3, 

=  \Jp- i(a)  =  yp2{a)  if  a  =  /3, 


where  a  and  (3  are  zeros  of  Jp(x). 


[You  can  see  that  the  three  answers  for  the  case  a=  (3  are  equal  by  equations  (15.3) 
to  (15.5),  remembering  that  a  is  a  zero  of  Jp.] 

We  can  state  (19.10)  in  words  in  two  different  ways;  if  an,  n  =  1,  2, 3,  •  •  • ,  are 
the  zeros  of  Jp(x),  then  we  say  either  that 

(a)  the  functions  y/xJp(anx)  are  orthogonal  on  (0,1); 
or  that 

(b)  the  functions  Jp(anx)  are  orthogonal  on  (0,1)  with  respect  to  the  weight 
function  x. 

You  may  meet  other  sets  of  functions  which  are  orthogonal  with  respect  to  a 
weight  function.  (See,  for  example,  Section  22.)  In  general,  we  say  that  yn{x)  is  a 
set  of  orthogonal  functions  on  (xi,X2)  with  respect  to  the  weight  function  w(x)  if 


yn(x)ym(x)w(x)  dx  =  0 


for  n  ^  m. 


The  fact  that  the  Bessel  functions  Jp(anx)  obey  (19.10)  makes  it  possible  to 
expand  a  given  function  in  a  series  of  Bessel  functions  much  as  we  expand  functions 
in  Fourier  series  and  Legendre  series.  We  shall  do  this  later  (Chapter  13)  when  we 
need  it  in  a  physical  example. 

Just  as  we  generalized  Fourier  series  to  an  interval  (0,  l),  here  we  can  generalize 
(19.10)  to  an  interval  (0,a).  In  (19.10),  let  x  =  r/a.  Then  the  limits  are  x  =  r/a  = 
0  to  1,  that  is,  r  =  0  to  a.  The  integral  in  (19.10)  becomes 


(■ r/a)Jp(ar/a)Jp((3r/a )  d(r/a)  = 


rJp(ar /a)Jp(/3r / a)  dr. 
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Thus  we  have 


(19.11)  f  rJp(ar/a)Jp(/3r/a)  dr 
Jo 


—  S  „2 


2  Jp+  l(a)  — 


-4-  M)  = 


a  if  * 

2  JP 


if  a  ^  P, 
(a)  if  a  =  (3. 


PROBLEMS,  SECTION  19 

1.  Prove  equation  (19.10)  in  the  following  way.  First  note  that  (19.2)  and  (19.3)  and 
therefore  (19.7)  hold  whether  a  and  P  are  zeros  of  JP(x)  or  not.  Let  cr  be  a  zero, 
but  let  /?  be  just  any  number.  From  (19.7)  show  that  then 


l 


xuv  dx  = 


Jp(P)aJ'p(a) 
P2  —  a 2 


Now  let  p  — >  a  and  evaluate  the  indeterminate  form  by  L’Hopital’s  rule  (that  is, 
differentiate  numerator  and  denominator  with  respect  to  P  and  let  p  — >  a).  Hence 
find 

ri 

1  t'2. 


l 


xuv  dx  =  1  Jp  (a) 


for  a  =  P,  that  is,  for  u  =  v  =  Jp(ax)  as  in  (19.10).  Use  equations  (15.3)  to  (15.5) 
to  show  that  the  other  two  expressions  given  in  (19.10)  are  equivalent. 


2.  Given  that 


use  (19.10)  to  evaluate 


J3/2  ^  =  \!-X 


L 


1  /  sir 


dx 


where  a  is  a  root  of  the  equation  tana;  =  x. 


3.  Use  (17.4)  and  (19.10)  to  write  the  orthogonality  condition  and  the  normalization 
integral  for  the  spherical  Bessel  functions  j„( x). 

4.  Define  JP(z)  for  complex  z  by  the  power  series  (12.9)  with  x  replaced  by  z.  (By 
Problem  12.1,  the  series  converges  for  all  z.)  Show  by  (19.10)  that  all  the  zeros  of 
Jp(z )  are  real.  Hint:  Suppose  a  and  P  in  (19.10)  were  a  complex  conjugate  pair; 
show  that  then  the  integrand  would  be  positive  so  the  integral  could  not  be  zero. 

5.  We  obtained  (19.10)  for  Jp(x),  p  >  0.  It  is,  however,  valid  for  p  >  —1,  that  is  for 
Np(x),  0  <  p  <  1.  The  difficulty  in  the  proof  occurs  just  after  (19.7);  we  said  that 
u,v,u',v'  are  finite  at  x  =  0  which  is  not  true  for  Np(x).  However,  the  negative 
powers  of  x  cancel  if  p  <  1.  Show  this  for  p  =  -  by  using  two  terms  of  the  power 
series  (12.9)  or  (13.1)  for  the  function  N1/2(x)  =  — J_ i/2(x)  [see  (13.3)]. 

6.  By  Problem  5,  fg  xNi/2{ax)N1/2(Px)  dx  —  0  if  a  and  P  are  different  zeros  of 
Ni/2(x).  Using  (17.4),  find  IV1/2  ( x )  in  terms  of  cos  x  and  so  find  the  zeros  of  Ni/2(x). 
Show  that  the  functions  cos (n  +  ^)nx  are  an  orthogonal  set  on  (0, 1).  Use  (19.10) 
to  find  the  normalization  constant.  (Compare  Problem  6.8.) 
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20.  APPROXIMATE  FORMULAS  FOR  BESSEL  FUNCTIONS 

There  are  often  cases  in  which  it  is  useful  to  have  an  approximate  formula  giving 
the  behavior  of  a  Bessel  function  when  x  is  near  zero  or  when  x  is  very  large.  We 
list  some  of  these  formulas  here  for  reference.  The  symbol  0(xn )  is  read  “terms  of 
the  order  of  xn  or  less,”  and  means  that  the  error  in  the  given  approximation  is  less 
than  a  constant  times  xn;  thus  0(1)  means  bounded  terms.  Note  that  p  >  0. 

Function  Small  x  Large  x  ( asymptotic  formulas) 


'Jp(x) 


Np(x)  < 


p  =  0 


p  >  0 


H ^  or  (2)(z) 


i>TT)  (I)" +  °(x,+2)  'fE  c“  (*  -  2jLr-*)  +  °(ar3/2) 


—  In  a:  +  0(1) 

7 r 

.  .  „  f  0( xp),  p  <  1 

rw^y+  oUi),  l-i 

I  0(x2  p ),  p  >  1 


7T  V  X  ) 


sm  x  — 


2p+l 


+  0(x~3/2) 


Like  ±iNp(x)  e±*[»-(2p+iW4]  +  o(x~3/2) 

V  TTX 


Ip{x) 

Like  Jp( x) 

Kp(x) 

Like  Np(x ) 

jn(x ) 

rv.fl 

(2n+m+°^ 

Vn{x) 

(2  n  +  IV 

Note:  (2n+l)!!  means  1-3-5-7  •  •  •  (2n+l)  =  - ; — -.  See  Chapter  1,  Section  13C. 

2  "n! 

PROBLEMS,  SECTION  20 


Use  the  table  above  to  evaluate  the  following  limits: 


1. 

lim  Ji{x) /{J2(x)\2 

x — >0 

2.  lim  I3(x)/Is(x)  3. 

x — >oo 

lim  No(x2) /  ln(x) 

x—>0 

4. 

lim iJp(x)/Np(x) 

x — >0 

5.  lim  xIp(x)Kp(x)  6. 

x — >oo 

lim  xj„  (x)y„  (x) 

x — >0 

Use  the  table  above  and  the  definitions  in  Section  17  to  find  approximate  formulas  for 
large  x  for: 

7. 

h£ 1  (x) 

8.  h(2)(x) 

9. 

hi1)  (fa:) 

10.  hlf\ix) 
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To  study  the  approximations  in  the  table,  computer  plot  on  the  same  axes  the  given 
function  together  with  its  small  x  approximation  and  its  asymptotic  approximation.  Use 
an  interval  large  enough  to  show  the  asymptotic  approximation  agreeing  with  the  function 
for  large  x.  If  the  small  x  approximation  is  not  clear,  plot  it  alone  with  the  function  over 
a  small  interval. 

11.  Ji(x)  12.  J2(x)  13.  Jsix)  14.  N2  (2) 

15.  N3(x)  16.  j\(x)  17.  j2(x)  18.  t/2(®) 

19.  Computer  plot  on  the  same  axes  several  Ip(x)  functions  together  with  their  com¬ 
mon  asymptotic  approximation.  Then  computer  plot  each  function  with  its  small  x 
approximation. 

20.  As  in  Problem  19,  study  the  Kp(x)  functions.  It  is  interesting  to  note  (see  Prob¬ 
lem  17.4)  that  H’i/2(x)  is  equal  to  the  asymptotic  approximation. 

21.  SERIES  SOLUTIONS;  FUCHS'S  THEOREM 

We  have  discussed  two  examples  of  differential  equations  solvable  by  the  Frobenius 
method  (Legendre  and  Bessel  equations) .  There  are  many  other  “named”  equations 
and  the  corresponding  “named”  functions  which  are  their  solutions.  (See  a  few 
more  examples  in  Section  22.)  All  of  them  have  much  in  common  with  our  two 
examples  and  you  should  not  hesitate  to  look  them  up  and  use  them  without  a 
formal  introduction  when  you  run  into  them  on  your  computer  or  in  a  text  or 
reference  book.  You  may  discover  any  or  all  of  the  following  things  about  such  a 
new  (to  you)  set  of  functions:  that  they  are  the  set  of  solutions  of  a  differential 
equation  with  one  or  more  parameters  (like  the  p  in  Bessel’s  equation);  that  the 
values  of  the  functions,  their  derivatives,  their  zeros,  and  many  formulas  involving 
them  are  available  in  references  (tables  and  computer);  that  they  have  orthogonality 
properties,  perhaps  with  respect  to  a  weight  function,  and  consequently  (suitably 
restricted)  functions  can  be  expanded  in  series  of  them;  that  there  is  a  generating 
function  for  the  set  of  functions;  that  there  are  physical  problems  whose  solutions 
involve  the  functions,  often  in  the  solution  of  a  partial  differential  equation;  etc. 

Now  you  may  wonder  whether  all  differential  equations  can  be  solved  by  the 
Frobenius  method.  A  general  theorem  due  to  Fuchs  tells  when  this  method  will 
work;  we  shall  state  it  for  second-order  differential  equations  which  are  the  most 
important  ones  in  applications.  Write  the  differential  equation  as 


(21.1) 


y"  +  f{x)y'  +  g(x)y  =  0. 


If  xf(x)  and  x1 2g(x)  are  expandable  in  convergent  power  series  anXn,  we  say 

that  the  differential  equation  (21.1)  is  regular  (or  has  a  nonessential  singularity)  at 
the  origin.  Let  us  call  these  the  Fuchsian  conditions.  Fuchs’s  theorem  says  that 
these  conditions  are  necessary  and  sufficient  for  the  general  solution  of  (21.1)  to 
consist  of  either 

(1)  two  Frobenius  series,  or 

(2)  one  solution  Si  (2)  which  is  a  Frobenius  series,  and  a  second  solution  which 
is  Si  (2)  In  2  +  S2  (2)  where  S 2(2)  is  another  Frobenius  series. 
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Case  (2)  occurs  only  when  the  roots  of  the  indicial  equation  are  equal  or  differ 
by  an  integer,  and  not  always  then.  [See,  for  example,  equation  (11.2)  and  Problems 
11.1  to  11.4,  and  11.7  to  11.9.]  Note  the  necessary  condition:  If  the  Fuchsian  condi¬ 
tions  are  not  met,  we  cannot  find  the  general  solution  by  the  method  of  generalized 
power  series  (see  Problems  11  to  13).  However,  the  equations  most  commonly  found 
in  applications  do  meet  these  conditions. 

If  the  first  Frobenius  series  S\{x)  happens  to  break  off,  or  you  can  easily  write 
its  sum  in  closed  form,  then  the  method  of  “reduction  of  order”  [Chapter  8,  Sec¬ 
tion  7(e)]  gives  a  way  of  finding  the  second  solution  without  using  infinite  series 
(see  Problems  1  to  4).  However,  note  that  our  main  interest  in  series  solutions  is 
not  to  solve  differential  equations  this  way  in  general,  but  to  study  sets  of  functions 
(like  Legendre  polynomials  and  Bessel  functions)  which  are  solutions  of  differential 
equations  that  occur  in  applications.  So  the  purpose  in  using  series  to  solve  a  few 
simple  differential  equations  (for  which  there  are  easier  methods)  is  to  learn  how  and 
when  the  series  method  works — to  watch  Fuchs’s  theorem  in  action  (see  problems). 


PROBLEMS,  SECTION  21 

For  Problems  1  to  4,  find  one  (simple)  solution  of  each  differential  equation  by  series,  and 
then  find  the  second  solution  by  the  “reduction  of  order”  method,  Chapter  8,  Section  7(e). 

1.  (x2  +  l)y"  -  xy'  +  y  —  0  2.  x2 y"  +  (x  +  l)y  -  y  =  0 

3.  x2  y"  +  x2  y'  —  2y  =  0  4.  (x  —  l)y"  —  xy'  +  y  =  0 

Solve  the  differential  equations  in  Problems  5  to  10  by  the  Frobenius  method;  observe 
that  you  get  only  one  solution.  (Note,  also,  that  the  two  values  of  s  are  equal  or  differ 
by  an  integer,  and  in  the  latter  case  the  larger  s  gives  the  one  solution.)  Show  that  the 
conditions  of  Fuchs’s  theorem  are  satisfied.  Knowing  that  the  second  solution  is  In*  times 
the  solution  you  have,  plus  another  Frobenius  series,  find  the  second  solution. 


5. 

*(*  +  l)y"  —  (*  —  1  )y'  +  y  =  0 

6. 

4x2(x  +  1  )y"  —  4  x2  y'  +  (3*  +  1  )y  =  0 

7. 

x(x  —  1  )2  y"  —  2y  —  0 

8. 

xy"  +  xy'  —  2y  —  0 

9. 

x2  y"  +  ( x 2  —  3  x)y'  +  (4  —  2  x)y  =  0 

10. 

x2(x—  l)y"  —  x(5x— 4)y  +(9x  —  6)y  =  0 

11.  For  the  differential  equation  in  Problem  2,  verify  that  it  does  not  satisfy  the  Fuchsian 
conditions,  and  that  your  second  solution  cannot  be  expanded  in  a  Frobenius  series. 

12.  Verify  that  the  differential  equation  xAy"  +  y  =  0  is  not  Fuchsian;  that  it  has  the 
two  independent  solutions  xsin(l/*)  and  *cos(l/*);  and  that  these  solutions  are 
not  expandable  in  Frobenius  series. 

13.  Verify  that  the  the  differential  equation  in  Problem  11.13  is  not  Fuchsian.  Solve 
it  by  separation  of  variables  to  find  the  obvious  solution  y  =  const,  and  a  second 
solution  in  the  form  of  an  integral.  Show  that  the  second  solution  is  not  expandable 
in  a  Frobenius  series. 
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22.  HERMITE  FUNCTIONS;  LAGUERRE  FUNCTIONS;  LADDER  OPERATORS 

In  this  section,  we  shall  outline  some  of  the  important  formulas  for  two  more  sets  of 
named  functions.  Both  Hermite  and  Laguerre  functions  are  of  interest  in  quantum 
mechanics  where  they  arise  as  solutions  of  eigenvalue  problems  (see  Problem  27, 
and  Chapter  13,  Problems  7.20  to  7.22).  We  shall  also  consider  an  operator  method 
which  is  a  useful  alternative  to  series  solution  for  some  differential  equations. 

Hermite  Functions  The  differential  equation  for  Hermite  functions  is 


(22.1)  y"  -  x2y„  =  -(2n  +  l)yn,  n  =  0,1,2,---  . 


This  equation  can  be  solved  by  power  series  (Problem  5),  but  here  we  shall  consider 
an  operator  method  which  is  particularly  efficient  for  this  equation.  Let’s  use  the 
operator  D  to  mean  d/dx\  then  (see  Problem  5.31  of  Chapter  8) 


(22.2) 


(D  —  x)(D  +  x)y 


W  +  xy )  =  y" 


{D  +  x)(D  -  x)y  =  y"  -  x2y  -  y. 


x2y  +  y, 


and  similarly 


Using  (22.2),  we  can  write  (22.1)  in  two  ways: 


(22.3)  (D  -  x)(D  +  x)yn  =  -2nyn  or 

(22.4)  (D  +  x)(D  -  x)yn  =  -2(n+  l)yn 


Now  let  us  operate  on  (22.3)  with  (D  +  x)  and  on  (22.4)  with  (D  —  x),  and  change 
n  to  m  for  later  convenience: 


(22.5)  (D  +  x)(D  -  x)[(D  +  x)ym)  =  -2 m[(D  +  x)ym], 

(22.6)  (D  -  x)(D  +  x)[(D  -  x)ym\  =  -2(m+l)[(D  -  x)ym}. 

(The  brackets  have  been  inserted  to  clarify  our  next  step.) 

Now  compare  (22.3)  and  (22.6);  if  yn  =  [(D  —  x)ym]  and  n  =  m+1,  the  equations 
are  identical.  We  write 


(22.7)  ym+ 1  =  {D-  x)ym 

and  we  see  that,  given  a  solution  ym  of  (22.1)  for  one  value  of  n,  namely  n  =  to,  we 
can  find  a  solution  when  n  =  m  +  1  by  applying  the  “raising  operator”  (D  —  x)  to 
ym.  Similarly,  from  (22.4)  and  (22.5),  we  find  that  (Problem  1) 

(22.8)  ym- 1  =  (D  +  x)ym. 

We  may  call  (D  +  x)  a  “lowering  operator” ;  these  operators  are  called  creation  and 
annihilation  operators  in  quantum  theory.  Operators  of  this  kind  (see  Problems  29, 
30,  and  23.27  for  other  examples)  are  called  ladder  operators  since,  like  the  rungs 
of  a  ladder,  they  enable  us  to  go  up  or  down  in  a  set  of  functions. 

Now  if  n  =  0,  we  find  a  solution  of  (22.3)  [and  therefore  of  (22.1)]  by  requiring 


(22.9) 


(D  +  x)y0  =  0. 
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We  solve  this  equation  (Problem  2)  to  get 
(22.10)  y0  =  e~x2/2. 

Then,  by  (22.7),  yn  =  (D  —  x)ne~x  /2.  These  are  the  Hermite  functions;  they  can 
be  written  in  the  simpler  form  yn  =  ex  /2  (dn / dxn)e~x  (Problem  3): 


yn  =  (D  —  x)ne~x2/2  or 

(22.11)  Hermite  functions 

yn  =  ex  / 2{dnldxn)e~x  . 


If  we  multiply  (22.11)  by  (— l)nex  /2,  we  obtain  the  Hermite  polynomials ;  the  fol¬ 
lowing  equation  may  be  called  a  Rodrigues  formula  for  them: 


,  2  dn  2 

(22.12) 

Hn(x)  =  (— 1)  e  —e~x  . 

Hermite  polynomials 

We  find  (Problems  4  and  5): 


(22.13)  H0(x)  =  l,  H1(x)  =  2x,  H2{x)  =  4x2  -  2 . 


The  Hermite  polynomials  satisfy  the  differential  equation  (Problem  6): 


(22.14)  y"  —  2 xy'  +  2 ny  =  0.  Hermite  equation 


Using  the  differential  equation,  we  can  prove  (Problem  7)  that  the  Hermite  poly¬ 
nomials  are  orthogonal  on  (—00,00)  with  respect  to  the  weight  function  e~x  .  The 
normalization  integral  can  be  evaluated  (Problem  10).  Thus  we  have: 


(22.15) 


e  *  Hn(x)Hm(x)  dx 


0,  n/m, 

V/7t2  nn\  n  =  m. 


Section  22 


Hermite  Functions;  Laguerre  Functions;  Ladder  Operators  609 


The  generating  function  for  the  Hermite  polynomials  is  (Problem  8): 


OVJ  7  Yl 

(22.16)  $(x,  h)  =  e2xh~h2  =  Y^Hn{x)—. 

Z '  n! 

n= 0 


The  generating  function  can  be  used  to  derive  recursion  relations  for  the  Hermite 
polynomials.  Two  useful  relations  are  (Problem  9): 


(22.17) 


(a)  H'n(x)  =  2nHn_1(x), 

(b)  Hn+1(x)  =  2 xHn(x)  -  2nHn_i(x). 


Laguerre  functions  The  Laguerre  polynomials  may  be  defined  by  a  Rodrigues 
formula: 


1  dn 

(22.18)  Ln(x)  =  -ex—{xne-x). 

nl  axn 

Carrying  out  the  differentiation  (Problem  12),  we  find: 


(22.19)  Ln(x)  =  1  —  nx  + 


i(n  —  1)  x2  n(n  —  l)(n  —  2)  a;3 


2!  2! 

=  E(->rC)S 


m= 0 


3!  3! 

Laguerre  polynomials 


{-l)nxn 


The  symbol  (^)  is  a  binomial  coefficient  (see  Chapter  1,  Section  13C).  Some  au¬ 
thors  omit  the  1/n!  in  (22.18);  then  the  series  in  (22.19)  is  multiplied  by  n!.  It 
is  convenient  to  note  that  the  series  in  (22.19)  is  like  the  binomial  expansion  of 
(1  —  x)n  except  that  each  power  of  x,  say  xm,  is  divided  by  an  extra  to!.  We  find 
(Problem  13): 


(22.20) 

L  0(x)  =  1,  L1(x)  =  l-x, 

£2(2)  =  1  —  2x  +  x2 12. 

The  Laguerre  polynomials  are  solutions  of  the  differential  equation  (Problems  14 
and  15): 

(22.21) 

xy"  +  (1  —  x)y'  +  ny  =  0, 

y  =  Ln{  x). 
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Using  the  differential  equation,  we  can  prove  (Problem  16)  that  the  Laguerre 
polynomials  are  orthogonal  on  (0,oo)  with  respect  to  the  weight  function  e~x.  In 
fact,  we  find  (Problem  19)  that,  with  the  definition  (22.18),  the  functions  e~x^2Ln(x) 
are  an  orthonormal  set  on  (0,oo). 


(22.22) 


Ln(x)Lk(x)  dx 


Snk 


0,  n  ^  k, 
1,  n  =  k. 


The  generating  function  for  the  Laguerre  polynomials  is  (Problem  17): 

(22.23)  *(*,  h)  =  €~X^hh)  =  £  Ln{x)hn . 

n—0 

Using  it,  we  can  derive  recursion  relations;  some  examples  are  (Problem  18): 


(a)  L'n+X(x)  -  L'n(x)  +  Ln(x)  =  0, 

(22.24)  (b)  (n  +  l)Ln+i(x)  -  (2n  +  1  —  x)Ln(x)  +  nLn_i(x)  =  0, 

(c)  xL'n(x)  -  nLn{x)  +  nLn_i{x)  =  0. 


Warning:  These  formulas  will  be  different  if  the  factor  1  /n!  is  omitted  in  the  defi¬ 
nition  (22.18),  so  check  the  notation  of  any  reference  you  are  using  (computer,  text, 
tables) . 

Derivatives  of  the  Laguerre  polynomials  are  called  associated  Laguerre  polyno¬ 
mials;  they  may  be  found  by  differentiating  (22.18),  (22.19),  or  (22.20)  (Problem  20). 
We  define: 


(22.25)  Lk(x)  =  {-l)k—^Ln+k{x). 


Associated  Laguerre  polynomials 


Warning:  The  notation  in  various  references  may  be  confusing;  some  authors  define 
Lk(x)  as  (dk /dxk)Ln(x)  [compare  our  definition  in  (22.25)],  so  read  carefully  the 
definition  in  the  reference  you  are  using.  For  example,  associated  Laguerre  poly¬ 
nomials  are  used  in  the  theory  of  the  hydrogen  atom  in  quantum  mechanics.  In 
various  references  you  will  find  them  denoted  by  L^3_1( x)  and  by  L^1  (x):  both 
these  notations  mean  (except  for  sign)  ( d2l+1/dx2l+1)Ln+i(x ).  (See  Problems  26 
to  28.) 

By  differentiating  the  Laguerre  equation  (22.21),  we  find  the  differential  equation 
satisfied  by  the  polynomials  Lk(x)  (Problem  21): 


(22.26)  xy"  +  (k  +  1  —  x)y'  +  ny  =  0,  y  =  L^(  x). 
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The  polynomials  L^(x)  may  also  be  found  from  the  Rodrigues  formula  (Prob¬ 
lem  22): 

„-kpx  Jra 

(22.27)  Lkn(x)  =  (xn+ke~x). 

v  ’  nK  ’  n\  dxnK  ’ 

Note  that  in  this  form  k  does  not  have  to  be  an  integer;  in  fact,  (22.27)  is  used  to 
define  L^{x)  for  any  k  >  —  1. 

Recursion  relations  for  the  polynomials  L^{x)  may  be  found  by  differentiating 
recursion  relations  for  the  Laguerre  polynomials.  Some  examples  are  (Problem  23): 


(a)  (n  +  1  )Lk+1(x)  -  (2 n  +  k  +  1  -  x)L^(x)  +  (n  +  k)Lk_1(x)  =  0, 

(b)  x-^-Lk(x)  -  nLk{x)  +  (n  +  A)L^_1(a:)  =  0. 


Using  the  differential  equation  (22.26),  we  can  show  (Problem  24)  that  the  func¬ 
tions  Lk(x)  are  orthogonal  on  (0,oo)  with  respect  to  the  weight  function  xke~x. 
We  find  (Problem  25): 


(22.29) 


are  xLk(x)L^n(x)  dx 


0, 

(n+/c)! 


m  yF  n, 
m  =  n. 


The  normalization  integral  needed  in  the  theory  of  the  hydrogen  atom  is  not  (22.29), 
but  instead  has  the  factor  xk+1 .  We  find  (see  Problems  25  to  27): 

(22.30)  rxk+1e-x[Lkn{x)\2dx={2n+k+l)(^P^. 

J  o  n- 

Again  warning:  The  formulas  (22.28),  (22.29),  and  (22.30)  will  be  different  in  ref¬ 
erences  which  omit  the  l/n\  in  (22.18)  and/or  use  a  different  definition  of  L^{x)  in 
(22.25). 

PROBLEMS,  SECTION  22 

1.  Verify  equations  (22.2),  (22.3),  (22.4),  and  (22.8). 

2.  Solve  (22.9)  to  get  (22.10).  If  needed,  see  Chapter  8,  Section  2. 

3.  Show  that  ex  / 2D[e~x  f(x)]  =  (D  —  x)f{x).  Now  set 

f(x)  =  (D  -  x)g(x)  =  e*  /2D[e~x  /2g(x)} 

to  get 

(D  -  x)2g{ x)  =  ex  /2D2[e~x  /2g(x)}. 

Continue  this  process  to  show  that 

(. D  -  x)nF(x)  =  ex2/2Dn[e~x2/2F(x)\ 

for  any  F{x).  Then  let  F{x)  =  e~x  to  get  (22.11). 
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4. 


5. 


6. 

7. 


8. 


9. 


10. 


11. 


12. 

13. 

14. 


Using  (22.12)  find  the  Hermite  polynomials  given  in  (22.13).  Then  use  (22.17b)  to 
find  Hs(x)  and  H^x). 

By  power  series,  solve  the  Hermite  differential  equation 

y"  -  2xi {  +  2 py  =  0 


You  should  find  an  a o  series  and  an  ai  series  as  for  the  Legendre  equation  in  Section  2. 
Show  that  the  a o  series  terminates  when  p  is  an  even  integer,  and  the  ai  series 
terminates  when  p  is  an  odd  integer.  Thus  for  each  integer  n,  the  differential  equation 
(22.14)  has  one  polynomial  solution  of  degree  n.  These  polynomials  with  ao  or  ai 
chosen  so  that  the  highest  order  term  is  ( 2x)n  are  the  Hermite  polynomials.  Find 
Ho(x),  H i(x),  and  H2(x).  Observe  that  you  have  solved  an  eigenvalue  problem 
(see  end  of  Section  2),  namely  to  find  values  of  p  for  which  the  given  differential 
equation  has  polynomial  solutions,  and  then  to  find  the  corresponding  solutions 
(eigenfunctions) . 

Substitute  yn  =  e~x  ^ 2Hn(x )  into  (22.1)  to  show  that  the  differential  equation 
satisfied  by  Hn(x)  is  (22.14). 

Prove  that  the  functions  Hn{x)  are  orthogonal  on  (—00,00)  with  respect  to  the 
_  2 

weight  function  e  x  .  Hint:  Write  the  differential  equation  (22.14)  as 

ex  v)  +  2nV  =  °. 

and  see  Sections  7  and  19. 


In  the  generating  function  (22.16),  expand  the  exponential  in  a  power  series  and 
collect  powers  of  h  to  obtain  the  first  few  Hermite  polynomials.  Verify  the  identity 


a2$  3$  <9$  „ 

dx 2  2X  dx  +  2k  dh  ~  °' 


Substitute  the  series  in  (22.16)  into  this  identity  to  prove  that  the  functions  Hn(x) 
in  (22.16)  satisfy  equation  (22.14).  Verify  that  the  highest  term  in  Hn(x)  in  (22.16) 
is  ( 2x)n .  [You  have  then  proved  that  the  functions  called  Hn(x)  are  really  the 
Hermite  polynomials  since,  by  Problem  5,  (22.14)  has  just  one  polynomial  solution 
of  degree  n.[ 

Use  the  generating  function  to  prove  the  recursion  relations  in  (22.17).  Hint  for  (a): 
Differentiate  (22.16)  with  respect  to  x  and  equate  coefficients  of  hn.  Hint  for  (b): 
Differentiate  (22.16)  with  respect  to  h  and  equate  coefficients  of  hn . 

Evaluate  the  normalization  integral  in  (22.15).  Hint:  Use  (22.12)  for  one  of  the 
Hn(x)  factors,  integrate  by  parts,  and  use  (22.17a);  then  use  your  result  repeatedly. 

Show  that  we  have  solved  the  following  eigenvalue  problem  (see  Problem  5  and  end 
of  Section  2):  Given  the  differential  equation  y"  +  (e  —  x2)y  =  0  [compare  equation 
(22.1)].  find  the  possible  values  of  e  (eigenvalues)  such  that  the  solutions  y(x)  of  the 
given  differential  equation  tend  to  zero  as  x  — *  ±00;  for  these  values  of  e,  find  the 
eigenfunctions  y(x).  What  is  e,  and  what  are  the  eigenfunctions? 

Using  Leibniz’  rule  (Section  3),  carry  out  the  differentiation  in  (22.18)  to  obtain 
(22.19). 

Using  (22.19),  verify  (22.20)  and  also  find  L^{x)  and  L±(x). 

Show  that  y  =  Ln(x)  given  in  (22.18)  satisfies  (22.21).  Hint:  Follow  a  method  similar 
to  that  used  in  Section  4.  Let  v  =  xne~x  and  show  that  xv'  =  (n  —  x)v.  Differentiate 
this  last  equation  (n  +  1)  times  by  Leibniz’  rule,  and  use  dnv/dx n  =  n\  e~xLn(x) 
from  (22.18). 
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15.  Solve  the  Laguerre  differential  equation 

xy"  +  (1  -  x)y  +py  =  0 


by  power  series.  Show  that  the  a o  series  terminates  if  p  is  an  integer.  Thus  for  each 
integer  n,  the  differential  equation  (22.21)  has  one  solution  which  is  a  polynomial 
of  degree  n.  These  polynomials  with  ao  =  1  are  the  Laguerre  polynomials  Ln(x). 
Find  Lo(x),  L\{x),  L2(x),  and  Ls(x).  (This  is  an  eigenvalue  problem — compare 
Problem  5  and  Section  2.) 

16.  Prove  that  the  functions  1„(: r)  are  orthogonal  on  (0,  oo)  with  respect  to  the  weight 
function  e~x .  Hint:  Write  the  differential  equation  (22.21)  as 

ex-^-{xe~xy')  +  ny  =  0, 


and  see  Sections  7  and  19. 


17.  In  (22.23),  write  the  series  for  the  exponential  and  collect  powers  of  h  to  verify  the 
first  few  terms  of  the  series.  Verify  the  identity 


<92d> 


+  (1 


fl-F  ,  <9$  „ 

X)d^  +  hdh=°- 


Substitute  the  series  in  (22.23)  into  this  identity  to  show  that  the  functions  Ln(x) 
in  (22.23)  satisfy  Laguerre’s  equation  (22.21).  Verify  that  the  constant  term  is  1  by 
putting  x  =  0  in  the  generating  function.  [You  have  then  proved  that  the  functions 
called  Ln(x)  in  (22.23)  are  really  Laguerre  polynomials  since,  by  Problem  15,  (22.21) 
has  just  one  polynomial  solution  of  degree  n.\ 

18.  Verify  the  recursion  relations  (22.24)  as  follows: 

(a)  Differentiate  (22.23)  with  respect  to  x  to  get  hd>  —  (h  —  l)(9d?/cte);  equate 
coefficients  of  hn+1. 

(b)  Differentiate  (22.23)  with  respect  to  h  to  get  (1  —  h)2(d$/dh)  =  (1  —  h  —  *)'!>; 
equate  coefficients  of  hn. 

(c)  Combine  (a)  and  (b)  to  get  x(d&/dx)  +  hd?  —  h(  1  —  h)d$/dh  =  0.  Substitute 
the  series  for  d?  and  equate  coefficients  of  hn. 

19.  Evaluate  the  normalization  integral  in  (22.22).  Hint:  Use  (22.18)  for  one  of  the 
Ln(x)  factors;  integrate  by  parts  n  times.  Use  (22.19)  to  find  (dn / dxn)Ln(x)  and 
Chapter  11,  Section  3,  to  evaluate  J0°°  xne~x  dx. 

20.  Using  (22.25),  (22.20),  and  Problem  13,  find  L^(x)  for  n  =  0, 1,2,  and  k  =  1,2. 

21.  Verify  that  the  polynomials  L^{ x)  in  (22.25)  satisfy  (22.26).  Hint:  Write  (22.21) 
with  n  replaced  by  n  +  k  and  differentiate  k  times  by  Leibniz’  rule. 

22.  Verify  that  the  polynomials  given  by  (22.27)  are  the  same  as  the  L^(x)  defined  in 

(22.25) .  Hints:  Show  that  the  functions  in  (22.27)  satisfy  (22.26)  as  follows.  Let 
v  =  e~xxn+k  and  show  that  xv'  =  (n+k  —  x)v.  (Compare  Problem  14.)  Differentiate 
this  equation  n  +  1  times  by  Leibniz’  rule,  and  use  dnv/dxn  =  n!  e~x xk L^{x)  from 
(22.27).  Also  show  that  the  coefficient  of  xn  in  both  (22.25)  and  (22.27)  is  (— l)n/n! 
[Thus,  assuming  that  (22.26)  for  one  k  has  only  one  polynomial  solution  of  degree 
n  (which  can  be  shown  by  series  solution),  (22.27)  gives  the  same  polynomials  as 

(22.25)  for  integral  k.] 
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23.  Verify  the  recursion  relation  relations  (22.28)  as  follows: 

(a)  In  (22.24b),  replace  n  by  n  +  k  and  differentiate  k  times  by  Leibniz’  rule;  in 
(22.24a),  replace  k  by  n  +  k  and  differentiate  k  —  1  times.  Subtract  k  times 
the  second  result  from  the  first. 

(b)  In  (22.24c),  replace  n  by  n  +  k  and  differentiate  k  times. 

24.  Show  that  the  functions  Lt/(x)  are  orthogonal  on  (0,  oo)  with  respect  to  the  weight 
function  xke~x.  Hint:  Write  the  differential  equation  (22.26)  as 

— k  x  d  .  fc+l  — x  /\  . 

x  e— (ar  e  y  )  +  ny  =  0 
ax 

and  see  Sections  7  and  19. 


25. 


26. 


27. 


28. 

29. 


30. 


Evaluate  the  normalization  integrals  (22.29)  and  (22.30).  Hints:  Use  (22.27)  for 
one  of  the  L„(x)  factors  in  (22.29);  integrate  by  parts  n  times.  Use  (22.25)  and 
then  (22.19)  to  evaluate  dn /dxn L^{x).  Compare  Problem  19.  To  evaluate  (22.30), 
multiply  (22.28a)  by  xke~x  and  integrate;  use  (22.29)  both  for  m  ^  n  and  m  =  n. 

Solve  the  following  eigenvalue  problem  (see  end  of  Section  2  and  Problem  11):  Given 
the  differential  equation 


V  + 


Ki  +  m 

x2  ) 


y  =  o 


where  l  is  an  integer  >  0,  find  values  of  A  such  that  y  — >  0  as  x  — >  oo,  and  find 
the  corresponding  eigenfunctions.  Hint:  let  y  =  xl+1e~x^2v(x),  and  show  that  v(x) 
satisfies  the  differential  equation 


xv"  +  (21  +  2  —  x)v'  +  (A  —  l  —  l)v  =  0. 


Compare  (22.26)  to  show  that  if  A  is  an  integer  >  l,  there  is  a  polynomial  solution 
v{x)  =  Lf+^^x). 

The  functions  which  are  of  interest  in  the  theory  of  the  hydrogen  atom  are 

fn(x)  =  Q 

where  n  and  l  are  integers  with  0  <  l  <  n  —  1.  (Note  that  here  k  =  21  +  1,  and  we 
have  replaced  n  by  n  —  l  —  1;  in  this  problem  L\  ,  say,  means  l  =  l,n  =  4.)  For 
1  =  1,  show  that 

f2(x)  =  x2e~x/i,  f3(x)  =  x2e~x/6  (4—  ,  f4(x)  =  x2e~x/8  ^10  -  ^ j  . 

Hint:  Find  the  polynomials  Lq,Li,L2  as  in  Problem  20  (with  k  =  3)  and  then 
replace  x  by  x/n.  The  functions  fn(x)  are  very  different  from  those  in  (22.29) 
since  x/n  changes  from  one  function  to  the  next.  However,  it  can  be  shown  (Prob¬ 
lem  23.25)  that  for  one  fixed  l,  the  set  of  functions  fn(x),n  >  l  + 1,  is  an  orthogonal 
set  on  (0,oo).  Verify  this  for  these  three  functions.  Hint:  The  integrals  are  P 
functions — see  Chapter  11,  Section  3. 

Repeat  Problem  27  for  l  =  0,  n  =  1,  2,  3. 

Show  that  Rp  =  £  —  D  and  Lp  =  £  +  D  where  D  =  d/dx,  are  raising  and  lowering 
operators  for  Bessel  functions,  that  is,  show  that  RpJp(x)  =  Jp+i(x)  and  LpJp(x)  = 
Jp-i(x).  Hint:  Use  equations  (15.5).  Note  that  these  operators  depend  on  p  as 
well  as  x,  so  they  are  not  as  simple  as  the  Hermite  function  raising  and  lowering 
operators  (22.7)  and  (22.8).  If  you  want  to  operate,  say,  on  Jp+i,  you  must  change  p 
in  R  or  L  to  p+ 1,  etc.  Making  this  adjustment,  show  that  the  equations  LRJP  =  Jp 
and  RLJP  =  Jp  both  give  Bessel’s  equation. 

Find  raising  and  lowering  operators  (see  Problem  29)  for  spherical  Bessel  functions. 
Hint:  See  problems  17.15  and  17.16. 
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23.  MISCELLANEOUS  PROBLEMS 

1.  Use  the  generating  function  (5.1)  to  find  the  normalizing  factor  for  Legendre  poly¬ 
nomials.  Hint:  Square  equation  (5.2)  with  <£>  as  in  (5.1)  and  integrate  from  —1  to  1. 
Expand  the  integral  of  $2  (after  integrating)  in  powers  of  h  and  equate  coefficients. 

2.  Use  the  generating  function  to  show  that 

P2n+ 1(0)  =  0  and  P2„('  0)  =  =  (~ir(fV  1)!!; 

Hints:  Expand  (5.1)  for  x  =  0  in  powers  of  h  and  equate  coefficients  of  powers  of  h 
in  (5.2).  See  Chapter  1,  Section  13C. 

3.  Use  (5.8e)  to  show  that  f3  Pi(x)  dx  =  [P;_i(0)  —  Pi+i(0)]/(2l  +  1).  Then  use  the 
result  of  Problem  2  and  Chapter  1,  Section  13C  to  show  that 

l  P2n <*) ' dx  =  °’ n  >  0’  and  l  P2-+1  (X)  dx  =  =  (n  +  l)  ' 


4.  Obtain  the  binomial  coefficient  result  in  Problem  3  directly  by  integrating  the  gen¬ 
erating  function  from  0  to  1  and  expanding  the  result  in  powers  of  h.  Equate  the 
coefficients  of  hl  in  the  identity  obtained  by  integrating  (5.2)  from  0  to  1,  and  use 
Chapter  1,  Section  13C. 

5.  Show  that  Yloi.21  +  1  )Pi(x)  =  Pn(x)  +  Pn+i(x).  Hint:  Use  mathematical  induction 
as  follows: 


(a)  Verify  the  formula  for  n  =  0. 

(b)  Assuming  that  the  formula  is  true  for  l  =  n  —  1,  show  [using  (5.8e)[  that  it  is 
true  for  l  =  n. 


6.  Using  (10.6),  (5.8),  and  Problem  2,  evaluate  PL+ i(0). 

7.  Show  that,  for  l  >  0,  fb  Pi{x)  dx  =  0  if  a  and  b  are  any  two  maximum  or  minimum 
points  of  Pi{x),  or  ±1.  Hint:  Integrate  (7.2). 

8.  Show  that  (21  +  l)(x2  —  l)P[(x)  =  1(1  +  l)[P;+i(x)  —  P;_i(x)[.  Hint:  Integrate  (5. 8e) 
and  (7.2)  and  combine  the  results.  Thus  show  that  P;+i(x)  =  Pi-i(x)  at  maximum 
and  minimum  points  of  Pi[x)  and  at  ±1. 

9.  Evaluate  f^1xPi(x)Pn(x)  dx,  n  <  l.  Hint:  Write  (5.8a)  with  l  replaced  by  l  +  1, 
multiply  by  Pn{ x)  and  integrate. 

Use  the  recursion  relations  of  Section  15  (and,  as  needed,  Sections  12,  13,  17,  and  20)  to 
verify  the  formulas  in  Problems  10  to  14. 


10. 


11. 


12. 

13. 

14. 


f 

f 


x  pJp+i(x)dx  = 


X  njn+l(x)  dx  = 


2pr(i+P)' 

1 

(2n+  1)!!' 


— Ap(a;)  =  -\[KP-  ^(x)  +  Kp+1(x)\. 

=  \n3n- i(x)~  (n  +  l)jn+i(x)]/(2n  +  l). 


f 


x3  Jo  (x)  dx  =  x3  J\(x)  —  2x2  Ji  (x) . 
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15.  Use  the  result  of  Problem  18.4  and  equations  (17.4)  to  show  that 

3n(x)y'n(x)  -yn(x)j'n{x)  = 

xz 

Then  use  Problem  17.14  (for  y’s  as  well  as  fs)  to  show  that 
jn(x)y„~i(x)  -yn(x)jn- l(x)  = 

xz 


16.  Use  (15.2)  repeatedly  to  show  that 


Mx) = x  (4 £) Jo(x)’  Mx) = x2  (4£)  Mx) ■ 

and,  in  general, 

M*)  =  xn(-~)  Mx). 

17.  Let  a  be  the  first  positive  zero  of  J\{x)  and  let  /3n  be  the  zeros  of  Jo(x).  In  terms 
of  a  and  /3n,  find  the  values  of  x  at  the  maximum  and  minimum  points  of  the 
function  y  =  xJi(ax).  By  computer  or  tables,  find  the  needed  zeros  and  compute 
the  coordinates  of  the  maximum  and  minimum  points  on  the  graph  of  y( x)  for  x 
between  0  and  5.  Computer  plot  y  from  x  =  0  to  5  and  compare  your  computed 
maximum  and  minimum  points  with  what  the  plot  shows. 

18.  (a)  Make  the  change  of  variables  z  =  ex  in  the  differential  equation  y"  +  e2xy  =  0, 

and  so  find  a  solution  of  the  differential  equation  in  terms  of  Bessel  functions. 

(b)  Make  the  change  of  variables  z  =  ex  in  the  differential  equation  xy"  —  y'  + 
x3(ex  —  p2)y  =  0,  and  solve  the  equation  in  terms  of  Bessel  functions. 

19.  (a)  The  generating  function  for  Bessel  functions  of  integral  order  p  =  n  is 


OO 

$(£ ,/i)  =  e(1/2M'1-'l-1)  =  J2  hUMx)- 

n=  —  oo 


By  expanding  the  exponential  in  powers  of  x(h  —  h  x)  show  that  the  n  —  0 
term  is  Jo(x)  as  claimed. 


(b)  Show  that 


2d2$  9$  2 

I  TT  +  X  $  - 

ox2  ox 


$  =  0. 


Use  this  result  and  4?(x,  h)  =  '^2^L_oc  hn  Jn(x )  to  show  that  the  functions  Jn( x) 
satisfy  Bessel’s  equation.  By  considering  the  terms  in  hn  in  the  expansion  of 
e(1/2Mh-h  )  jn  part  (a),  show  that  the  coefficient  of  hn  is  a  series  starting 

with  the  term  (l/n!)(a;/2)n.  (You  have  then  proved  that  the  functions  called 
Jn(x)  in  the  expansion  of  <3>(a3,  /z)  are  indeed  the  Bessel  functions  of  integral 
order  previously  defined  by  (12.9)  and  (13.1)  with  p  =  n.) 
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20.  In  the  generating  function  equation  of  Problem  19,  put  h  =  el6  and  separate  real 
and  imaginary  parts  to  derive  the  equations 


cos(:rsin0)  =  Jo{x)  +  2J^{x)  cos  29  +  2J^(x)  cos  4 6  +  ■■■ 


=  Jo(x)  +  2  E^  J2n(x)  cos  2nd, 

n= 1 

sin(a;sin0)  =  2[J\(x)  sin0  +  J?,{x)  sin 30  H - ] 

OO 

=  ’E*  n+i  (*)  sin(2n  +  1)0. 

n= 0 


21. 


22. 


These  are  Fourier  series  with  Bessel  functions  as  coefficients.  (In  fact  the  Jn’s  for 
integral  n  are  often  called  Bessel  coefficients  because  they  occur  in  many  series  like 
these.)  Use  the  formulas  for  the  coefficients  in  a  Fourier  series  to  find  integrals 
representing  Jn  for  even  n  and  for  odd  n.  Show  that  these  results  can  be  combined 
to  give 

i  r 

Jn(x)  =  —  /  cos(n0  —  *sin0)  dd 
n  Jo 

for  all  integral  n.  These  series  and  integrals  are  of  interest  in  astronomy  and  in  the 
theory  of  frequency  modulated  waves. 


In  the  generating  function  equation,  Problem  19,  put  x  =  iy  and  h  =  —ik  and  show 


that 


e(l/2)i /(fc+fc-1)  = 


E  knI„(y). 


n=  —  oo 


In  the  cos(®sin0)  series  of  Problem  20,  let  0  =  0,  and  then  let  0  =  7r/2,  and  add  the 
results  to  show  that  (recall  Problem  13.2) 


23. 


24. 


25. 


E  Jin{x)  =  i(l  +COS2:). 

n=  —  oo 

Solve  by  power  series  (1  —  x2)y"  —  xy'  +  n2y  =  0.  The  polynomial  solutions  of 
this  equation  with  coefficients  determined  to  make  y(l)  =  1  are  called  Chebyshev 
polynomials  Tn(x).  Find  To,  Ti,  and  T2. 


(a)  The  following  differential  equation  is  often  called  a  Sturm-Liouville  equation: 

-^~[A(x)y']  +  [X  B{x)  +  C(x)]y  =  0 

(A  is  a  constant  parameter).  This  equation  includes  many  of  the  differential 
equations  of  mathematical  physics  as  special  cases.  Show  that  the  following 
equations  can  be  written  in  the  Sturm-Liouville  form:  the  Legendre  equation 
(7.2);  Bessel’s  equation  (19.2)  for  a  fixed  p,  that  is,  with  the  parameter  A 
corresponding  to  a2;  the  simple  harmonic  motion  equation  y"  =  —n2y;  the 
Hermite  equation  (22.14);  the  Laguerre  equations  (22.21)  and  (22.26). 

(b)  By  following  the  methods  of  the  orthogonality  proofs  in  Sections  7  and  19,  show 
that  if  y  1  and  2/2  are  two  solutions  of  the  Sturm-Liouville  equation  (correspond¬ 
ing  to  the  two  values  Ai  and  A2  of  the  parameter  A),  then  y\  and  1/2  are  orthogo¬ 
nal  on  (a,  b)  with  respect  to  the  weight  function  B(x )  if  A(x)(y[y2  —  J/2J/1 )  I  a.  =  0. 


In  Problem  22.26,  replace  x  by  x/n  in  the  y  differential  equation  and  set  A  =  n 
to  show  that  the  differential  equation  satisfied  by  the  functions  fn(x)  in  Prob¬ 
lem  22.27  is 


"  1 

y  + 


1 

4n2 


1(1  +  1) 


y  =  0. 


Hence  show  by  Problem  24  that  the  functions  fn(x)  are  orthogonal  on  (0,  00). 
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26.  Verify  Bauer’s  formula  elxw  =  ^2^(2l  +  l)ilji(x)Pi(w)  as  follows.  Write  the  integral 
for  the  coefficients  ci  in  the  Legendre  series  for  ezxw  =  ciPi(w).  You  want  to  show 
that  ci(x)  =  (21  +  l)ilji(x).  First  show  that  y  =  ci(x )  satisfies  the  differential 
equation  (Problem  17.6)  for  spherical  Bessel  functions.  Hints:  Differentiate  with 
respect  to  x  under  the  integral  sign  to  hnd  y'  and  y"  \  substitute  into  the  left  side 
of  the  differential  equation.  Now  integrate  by  parts  with  respect  to  w  to  show 
that  the  integrand  is  zero  because  Pi(w)  satisfies  Legendre’s  equation.  Thus  ci(x) 
must  be  a  linear  combination  of  ji(x)  and  ni(x).  Now  consider  the  q( x)  integral 
for  small  x ;  expand  elwx  in  series  and  evaluate  the  lowest  term  (which  is  xl  since 


J_[ !  wnPi(w)  dw  =  0  for  n  <  l).  Compare  with  the  approximate  formulas  for  ji(x) 


and  ni(x )  in  Section  20. 

27.  Show  that  R  =  lx  —  (1  —  x2)D  and  L  =  lx  +  (1  —  x2)D,  where  D  —  d/dx,  are  raising 
and  lowering  operators  for  Legendre  polynomials  [compare  Hermite  functions,  (22.1) 
to  (22.11)  and  Bessel  functions,  Problems  22.29  and  22.30].  More  precisely,  show 
that  RPi-i(x)  =  lPi(x)  and  LPi(x)  =  IPi- i(x).  Hint:  Use  equations  (5.8d)  and 
(5.8f).  Note  that,  unlike  the  raising  and  lowering  operators  for  Hermite  functions, 
here  R  and  L  depend  on  l  as  well  as  x,  so  you  must  be  careful  about  indices.  The  L 
operator  operates  on  Pi,  but  the  R  operator  as  given  operates  on  Pi- 1  to  produce 
IPi.  [If  you  prefer,  you  could  replace  l  by  l  +  1  to  rewrite  R  as  (l  +  l)x  —  (1  —  x2)D; 
then  it  operates  on  Pi  to  produce  ( l  +  l)P;+i.]  Assuming  that  all  Pi(l)  =  1,  solve 
LPq(x)  =  0  to  find  Pq(x )  =  1,  and  then  use  raising  operators  to  hnd  Pi(x)  and 


Pzix). 


28.  Show  that  the  functions  Jo(t)  and  Jo(n  —  t)  are  orthogonal  on  (0,7r).  Hints:  See 
the  Laplace  transform  table  (page  469),  L23  and  L24  with  g  =  h  =  Jq.  What  is  the 
inverse  transform  of  ( p 2  +  a2)-1? 

29.  Show  that  the  Fourier  cosine  transform  (Chapter  7,  Section  12)  of  Jo(x)  is 


Hence  show  that  Jo(x)dx  =  1.  Hints:  Show  that  the  integral  in  Problem  20 


Let  sin#  =  a  to  hnd  Jo  as  a  cosine  transform;  write  the  inverse  transform.  Now  let 
a  =  0. 

30.  Use  the  results  of  Chapter  7,  Problems  12.18  and  13.19  to  evaluate  ^^[^(a)]2  da. 
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Partial  Differential  Equations 


1.  INTRODUCTION 

Many  of  the  problems  of  mathematical  physics  involve  the  solution  of  partial  dif¬ 
ferential  equations.  The  same  partial  differential  equation  may  apply  to  a  variety 
of  physical  problems;  thus  the  mathematical  methods  which  you  will  learn  in  this 
chapter  apply  to  many  more  problems  than  those  we  shall  discuss  in  the  illustrative 
examples.  Let  us  outline  the  partial  differential  equations  we  shall  consider,  and 
the  kinds  of  physical  problems  which  lead  to  each  of  them. 

(1.1)  Laplace’s  equation  V2u  =  0 

The  function  u  may  be  the  gravitational  potential  in  a  region  containing  no  mass, 
the  electrostatic  potential  in  a  charge-free  region,  the  steady-state  temperature  (that 
is,  temperature  not  changing  with  time)  in  a  region  containing  no  sources  of  heat, 
or  the  velocity  potential  for  an  incompressible  fluid  with  no  vortices  and  no  sources 
or  sinks. 


(1.2)  Poisson’s  equation  V2u  =  f(x,y,z ) 


The  function  u  may  represent  the  same  physical  quantities  listed  for  Laplace’s 
equation,  but  in  a  region  containing  mass,  electric  charge,  or  sources  of  heat  or 
fluid,  respectively,  for  the  various  cases.  The  function  f{x ,  y,  z)  is  called  the  source 
density;  for  example,  in  electricity  it  is  proportional  to  the  density  of  the  electric 
charge. 

1  du 

(1.3)  The  diffusion  or  heat  flow  equation  V2u  =  — — 

a 1  at 

Here  u  may  be  the  non-steady-state  temperature  (that  is,  temperature  varying 
with  time)  in  a  region  with  no  heat  sources;  or  it  may  be  the  concentration  of  a 
diffusing  substance  (for  example,  a  chemical,  or  particles  such  as  neutrons).  The 
quantity  a2  is  a  constant  known  as  the  diffusivity. 


(1.4)  Wave  equation 


V2u 


1  d2u 


619 
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Here  u  may  represent  the  displacement  from  equilibrium  of  a  vibrating  string 
or  membrane  or  (in  acoustics)  of  the  vibrating  medium  (gas,  liquid,  or  solid);  in 
electricity  u  may  be  the  current  or  potential  along  a  transmission  line;  or  u  may 
be  a  component  of  E  or  B  in  an  electromagnetic  wave  (light,  radio  waves,  etc.). 
The  quantity  v  is  the  speed  of  propagation  of  the  waves;  for  example,  for  light  in 
a  vacuum  it  is  c,  the  speed  of  light,  and  for  sound  waves  it  is  the  speed  at  which 
sound  travels  in  the  medium  under  consideration.  The  operator  V2  —  is  called 
the  d’Alembertian. 


(1.5)  Helmholtz  equation  V2E  +  k2F  =  0 


As  you  will  see  later,  the  function  F  here  represents  the  space  part  (that  is,  the 
time-independent  part)  of  the  solution  of  either  the  diffusion  or  the  wave  equation. 


(1.6)  Schrodinger  equation 


% 2  Q 

- V2T  +  PT  =  ih— T 


2m 


dt 


This  is  the  wave  equation  of  quantum  mechanics.  In  this  equation,  h  is  Planck’s 
constant  divided  by  27t,  to  is  the  mass  of  a  particle,  i  =  a/ — 1,  and  V  is  the  potential 
energy  of  the  particle.  The  wave  function  T  is  complex,  and  its  absolute  square  is 
proportional  to  the  position  probability  of  the  particle. 

We  shall  be  principally  concerned  with  the  solution  of  these  equations  rather 
than  their  derivation.  If  you  like,  you  could  say  that  it  is  true  experimentally  that 
the  physical  quantities  mentioned  above  satisfy  the  given  equations.  However,  it  is 
also  true  that  the  equations  can  be  derived  from  somewhat  simpler  experimental 
assumptions.  Let  us  indicate  briefly  an  example  of  how  this  can  be  done.  In 
Chapter  6,  Sections  10  and  11,  we  considered  the  flow  of  a  fluid.  We  showed 
(Chapter  6,  Problem  10.15)  that  V  •  v  =  0  for  an  incompressible  fluid  in  a  region 
containing  no  sources  or  sinks.  If  it  is  also  true  that  there  are  no  vortices  (that  is,  the 
flow  is  irrotational),  then  curl  v  =  0,  and  v  can  be  written  as  the  gradient  of  a  scalar 
function:  v  =  V«.  Combining  these  two  equations,  we  have  V  •  Vit  =  V2u  =  0. 
The  function  u  is  called  the  velocity  potential  and  we  see  that  (under  the  given 
conditions)  it  satisfies  Laplace’s  equation  as  we  claimed.  A  few  more  examples  of 
such  derivations  are  outlined  in  the  problems. 

In  the  following  sections,  we  shall  consider  a  number  of  physical  problems  to 
illustrate  the  very  useful  method  of  solving  partial  differential  equations  known  as 
separation  of  variables  (no  relation  to  the  same  term  used  in  ordinary  differential 
equations,  Chapter  8).  In  Sections  2  to  4,  we  consider  problems  in  rectangular 
coordinates  leading  to  Fourier  series  solutions — problems  similar  to  those  solved  by 
Fourier.  In  later  sections,  we  consider  use  of  other  coordinate  systems  (cylindrical, 
spherical)  leading  to  solutions  using  Legendre  or  Bessel  series. 


PROBLEMS,  SECTION  1 

1.  Assume  from  electrostatics  the  equations  V  •  E  =  p/to  and  E  =  —'Vf  (E  =  electric 
field,  p  =  charge  density,  to  =  constant,  f  =  electrostatic  potential).  Show  that  the 
electrostatic  potential  satisfies  Laplace’s  equation  (1.1)  in  a  charge- free  region  and 
satisfies  Poisson’s  equation  (1.2)  in  a  region  of  charge  density  p. 

2.  (a)  Show  that  the  expression  u  =  sin  (x  —  vt)  describing  a  sinusoidal  wave  (see 

Chapter  7,  Figure  2.3),  satisfies  the  wave  equation  (1.4).  Show  that,  in  general, 


Section  2 
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u  =  /( x  —  vt )  and  u  =  f(x  +  vt)  satisfy  the  wave  equation,  where  /  is  any 
function  with  a  second  derivative.  This  is  the  d’Alembert  solution  of  the  wave 
equation.  (See  Chapter  4,  Section  11,  Example  1.)  The  function  /( x  —  vt) 
represents  a  wave  moving  in  the  positive  x  direction  and  f(x  +  vt)  represents 
a  wave  moving  in  the  opposite  direction. 

(b)  Show  that  u(r,t)  =  (1  /r)f(r  —  vt)  and  u(r,t)  =  (1  /r)f(r  +  vt)  satisfy  the  wave 
equation  in  spherical  coordinates.  [Use  the  first  term  of  (7.1)  for  V2u  since 
here  u  is  independent  of  9  and  </>.]  These  functions  represent  spherical  waves 
spreading  out  from  the  origin  or  converging  on  the  origin. 


3.  Assume  from  electrodynamics  the  following  equations  which  are  valid  in  free  space. 
(They  are  called  Maxwell’s  equations.) 


V  -E  =  0 
_  T,  <9B 

V  X  E  =  ~~dt 


V  •  B  =  0 

__  -j-.  1  <9E 

V  x  B  =  —  — 

c2  at 


where  E  and  B  are  the  electric  and  magnetic  fields,  and  c  is  the  speed  of  light  in  a 
vacuum.  From  them  show  that  any  component  of  E  or  B  satisfies  the  wave  equation 
(1.4)  with  v  =  c. 

4.  Obtain  the  heat  flow  equation  (1.3)  as  follows:  The  quantity  of  heat  Q  flowing  across 
a  surface  is  proportional  to  the  normal  component  of  the  (negative)  temperature 
gradient,  (  —  'VT)  -n.  Compare  Chapter  6,  equation  (10.4),  and  apply  the  discussion 
of  flow  of  water  given  there  to  the  flow  of  heat.  Thus  show  that  the  rate  of  gain 
of  heat  per  unit  volume  per  unit  time  is  proportional  to  V  ■  VT.  But  dT/dt  is 
proportional  to  this  gain  in  heat;  thus  show  that  T  satisfies  (1.3). 


2.  LAPLACE'S  EQUATION;  STEADY-STATE 
TEMPERATURE  IN  A  RECTANGULAR  PLATE 

We  want  to  solve  the  following  problem:  A  long  rectangular  metal  plate  has  its  two 
long  sides  and  the  far  end  at  0°  and  the  base  at  100°  (Figure  2.1).  The  width  of 
the  plate  is  10  cm.  Find  the  steady-state  temperature  distribution  inside  the  plate. 
(This  problem  is  mathematically  identical  to  the  problem  of  finding  the  electrostatic 
potential  in  the  region  0  <  a:  <  10,  j/  >  0,  if  the  given  temperatures  are  replaced  by 
potentials — see,  for  example,  Jackson,  3rd  edition,  p.73) 

To  simplify  the  problem,  we  shall  assume  at  first 
that  the  plate  is  so  long  compared  to  its  width  that 
we  may  make  the  mathematical  approximation  that 
it  extends  to  infinity  in  the  y  direction.  It  is  then 
called  a  semi-infinite  plate.  This  is  a  good  approxi¬ 
mation  if  we  are  interested  in  temperatures  not  too 
near  the  far  end. 

The  temperature  T  satisfies  Laplace’s  equation 
inside  the  plate  where  there  are  no  sources  of  heat, 
that  is. 


(2.1) 


V  T  =  0 


or 


d  2T  d2T 
dx2  dy‘ 


=  0 


We  have  written  V2  in  rectangular  coordinates 
because  the  boundary  of  the  plate  is  rectangular 


T=  0' 
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and  we  have  omitted  the  2  term  because  the  plate  is  in  two  dimensions.  To  solve 
this  equation,  we  are  going  to  try  a  solution  of  the  form 

(2.2)  T{x,y)  =  X{x)Y{y), 


where,  as  indicated,  X  is  a  function  only  of  x,  and  Y  is  a  function  only  of  y. 
Immediately  you  may  raise  the  question:  But  how  do  we  know  that  the  solution  is 
of  this  form?  The  answer  is  that  it  is  not!  However,  as  you  will  see,  once  we  have 
solutions  of  the  form  (2.2)  we  can  combine  them  to  get  the  solution  we  want.  [Note 
that  a  sum  of  solutions  of  (2.1)  is  a  solution  of  (2.1).]  Substituting  (2.2)  into  (2.1), 
we  have 


(2.3) 


Y 


d2X 

dx2 


•*S~ 


(Ordinary  instead  of  partial  derivatives  are  now  correct  since  X  depends  only  on  x , 
and  y  depends  only  on  y.)  Divide  (2.3)  by  XY  to  get 


1  d2X  1  d2 
Itf  +  Y~dy* 


The  next  step  is  really  the  key  to  the  process  of  separation  of  variables.  We  are 
going  to  say  that  each  of  the  terms  in  (2.4)  is  a  constant  because  the  first  term  is 
a  function  of  x  alone  and  the  second  term  is  a  function  of  y  alone.  Why  is  this 
correct?  Recall  that  when  we  say  u  =  sinf  is  a  solution  of  it  =  —u,  we  mean  that 
if  we  substitute  u  =  sin  t  into  the  differential  equation,  we  get  an  identity  (it  =  —u 
becomes  —  sin  t  =  —  sin  t),  which  is  true  for  all  values  of  t.  Although  we  speak  of  an 
equation ,  when  we  substitute  the  solution  into  a  differential  equation,  we  have  an 
identity  in  the  independent  variable.  (We  made  use  of  this  fact  in  series  solutions 
of  differential  equations  in  Chapter  12,  Sections  1  and  2.)  In  (2.1)  to  (2.4)  we  have 
two  independent  variables,  x  and  y.  Saying  that  (2.2)  is  a  solution  of  (2.1)  means 
that  (2.4)  is  an  identity  in  the  two  independent  variables  x  and  y  [recall  that  (2.4) 
was  obtained  by  substituting  (2.2)  into  (2.1)].  In  other  words,  if  (2.2)  is  a  solution 
of  (2.1),  then  (2.4)  must  be  true  for  any  and  all  values  of  the  two  independent 
variables  x  and  y.  Since  A  is  a  function  only  of  x  and  Y  of  y,  the  first  term  of 
(2.4)  is  a  function  only  of  x  and  the  second  term  is  a  function  only  of  y.  Suppose 
we  substitute  a  particular  x  into  the  first  term;  that  term  is  then  some  numerical 
constant.  To  have  (2.4)  satisfied,  the  second  term  must  be  minus  the  same  constant. 
While  x  remains  fixed,  let  y  vary  (remember  that  x  and  y  are  independent).  We 
have  said  that  (2.4)  is  an  identity;  it  is  then  true  for  our  fixed  x  and  any  y.  Thus 
the  second  term  remains  constant  as  y  varies.  Similarly,  if  we  fix  y  and  let  x  vary, 
we  see  that  the  first  term  of  (2.4)  is  a  constant.  To  say  this  more  concisely,  the 
equation  /( x)  =  g(y),  with  x  and  y  independent  variables,  is  an  identity  only  if 
both  functions  are  the  same  constant;  this  is  the  basis  of  the  process  of  separation 
of  variables.  From  (2.4)  we  then  write 


(2.5) 


1  d2X 


1  d2Y 


X  dx2  Y  dy2 
X”  =  -k2X 


=  const.  =  —  k2,  k  >  0, 

and  Y"  =  k2Y. 


or 


The  constant  k 2  is  called  the  separation  constant.  The  solutions  of  (2.5)  are 


(2.6) 


X  = 


sin  kx, 
cos  kx, 


Y  = 


zky , 

o~ky 
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and  the  solutions  of  (2.1)  [of  the  form  (2.2)]  are 


(2.7) 


T  =  XY  = 


eky 

e~ky 


sin  kx 
cos  kx 


None  of  the  four  solutions  in  (2.7)  satisfies  the  given  boundary  temperatures.  What 
we  must  do  now  is  to  take  a  combination  of  the  solutions  (2.7),  with  the  constant 
k  properly  selected,  which  will  satisfy  the  given  boundary  conditions.  [Any  linear 
combination  of  solutions  of  (2.1)  is  a  solution  of  (2.1)  because  the  differential  equa¬ 
tion  (2.1)  is  linear ;  see  Chapter  3,  Section  7,  and  Chapter  8,  Sections  1  and  6.]  We 
first  discard  the  solutions  containing  eky  since  we  are  given  T  — >  0  as  y  — >  oo.  (We 
are  assuming  k  >  0;  see  Problem  5.)  Next  we  discard  solutions  containing  cos  fear 
since  T  =  0  when  x  =  0.  This  leaves  us  just  e~ky  sinkx,  but  the  value  of  k  is 
still  to  be  determined.  When  x  =  10,  we  are  to  have  T  =  0;  this  will  be  true  if 
sin  (10/c)  =  0,  that  is,  if  k  =  rwr/lO  for  n  —  1, 2,  •  ■  • .  Thus  for  any  integral  n,  the 
solution 


(2.8) 


T  =  e-n*y/ 1°  si 


mrx 


sin  ■ 


10 


satisfies  the  given  boundary  conditions  on  the  three  T  =  0  sides. 

Finally,  we  must  have  T  =  100  when  y  =  0;  this  condition  is  not  satisfied  by 
(2.8)  for  any  n.  But  a  linear  combination  of  solutions  like  (2.8)  is  a  solution  of  (2.1); 
let  us  try  to  find  such  a  combination  which  does  satisfy  T  =  100  when  y  =  0.  In 
order  to  allow  all  possible  n’s  we  write  an  infinite  series  for  T,  namely 


(2.9) 


-nny/10  • 

16  s  10  ' 


For  y  =  0,  we  must  have  T  =  100;  from  (2.9)  with  y  =  0  we  get 

OO 

(2.10)  Ty= o  -  y  bn  sir 


mrx 

.  sin  — — —  =  100. 


10 


But  this  is  just  the  Fourier  sine  series  (Chapter  7,  Section  9)  for  f(x)  =  100  with 
l  =  10.  We  can  find  the  coefficients  bn,  as  we  did  in  Chapter  7;  we  get 


(2.11)  bn  = 


7  /'/(*)  ain^d*=  ^  [W 


/0 


/0 


mrx  ,  f  — ,  odd  n, 
100  sin  — — -  dx  =  <  n7r 

10  0,  even  n. 


Then  (2.9)  becomes 


(2.12) 


T  =  ±°°  (  e-7ry/10  sin  ^  +  -e~3ny/10  sin  ^ 
7 r  [  10  3  10 


37nr 


Equation  (2.12)  can  be  used  for  computation  if  7ry/10  is  not  too  small  since  then 
the  series  converges  rapidly.  (See  also  Problem  6.)  For  example,  at  x  =  5  (central 
line  of  the  plate)  and  y  =  5,  we  find 


(2.13) 


T  = 


400 


e  ’r/2  sin  ^  +  \e  3w^2sin 


37T 

T 


~  26.1° 
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To  see  how  the  temperature  varies  with  x  and  y  over  a  rectangle,  you  can  computer 
plot  a  3-dimensional  graph  of  several  terms  of  T(x,y)  in  (2.12).  Or  you  can  make 
a  2-dimensional  contour  plot  which  shows  the  isothermals  (curves  of  constant  T). 
If  the  temperature  on  the  bottom  edge  is  any  function  /(x)  instead  of  100°  (with 
the  other  three  sides  at  0°  as  before),  we  can  do  the  problem  by  the  same  method. 
We  have  only  to  expand  the  given  /(x)  in  a  Fourier  sine  series  and  substitute  the 
coefficients  into  (2.9). 

Next,  let  us  consider  a  finite  plate  of  height  30  cm  with  the  top  edge  at  T  =  0°, 
and  other  dimensions  and  temperatures  as  in  Figure  2.1.  We  no  longer  have  any 
reason  to  discard  the  eky  solution  since  y  does  not  become  infinite.  We  now  replace 
e~ky  by  a  linear  combination  ae~ky  +  beky  which  is  zero  when  y  =  30.  The  most 
convenient  way  to  do  this  is  to  use  the  combination 


1  k(30-y) 
2e 


1  -fc(30-y) 
2e 


(2.14) 

(that  is,  let  a  =  ^e30fe  and  b  =  —^e~30k).  Then,  when  y  =  30,  (2.14)  gives 
e°  —  e°  =  0  as  we  wanted.  Now  (2.14)  is  just  sinh  £:(30  —  y)  (see  Chapter  2,  Section 
12),  so  for  the  finite  plate,  we  can  write  the  solution  as  [compare  (2.9)] 


(2.15)  T=E^sinh^(30-y)sin^. 

n—  1 


Each  term  of  this  series  is  zero  on  the  three  T  =  0  sides  of  the  plate.  When  y  =  0, 
we  want  T  =  100: 


(2.16) 


T, 


y= o 


OO 

=  100  =  Bn  sinh  (3rm)  sin 

71=1 


nnx 

IF 


oo 

y  bn  sin 

71=1 


n'KX 

IF 


where  bn  =  sinh  3n7r  or  Bn  =  6ra/sinh3n7r.  We  find  bn ,  solve  for  Bn  and 
substitute  into  (2.15)  to  get  the  temperature  distribution  in  the  finite  plate: 


(2.17) 


odd  7i 


400 

mr  sinh  3n7r 


77/ 7T 

sinh  —  (30  —  y)  sin 


717TX 

FF' 


In  (2.12)  and  (2.17)  we  have  found  functions  T(x,y)  satisfying  both  (2.1)  and 
all  the  given  boundary  conditions.  For  a  bounded  region  with  given  boundary 
temperatures,  it  is  an  experimental  fact  (and  it  can  also  be  shown  mathematically — 
see  Problem  16  and  Chapter  14,  Problem  11.38)  that  there  is  only  one  T(x,y) 
satisfying  Laplace’s  equation  and  the  given  boundary  conditions.  Thus  (2.17)  is  the 
desired  solution  for  the  rectangular  plate.  It  can  also  be  shown  that  there  is  only 
one  solution  for  the  semi-infinite  plate  provided  T  — >  0  at  oo;  thus  (2.12)  is  the 
solution  for  that  case. 

It  may  have  occurred  to  you  to  wonder  why  we  took  the  constant  in  (2.5)  to  be 
—k2  and  what  would  happen  if  we  took  +k2  instead.  As  far  as  getting  solutions  of 
the  differential  equation  is  concerned  it  would  be  perfectly  correct  to  use  +fc2;  we 
would  get  instead  of  (2.7): 

[We  are  assuming  that  k  is  real;  an  imaginary  k  in  (2.18)  would  simply  give  combi¬ 
nations  of  the  solutions  (2.7)  over  again.  Also  see  Problem  5.]  The  solutions  (2.18) 
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would  not  be  of  any  use  for  the  semi-infinite  plate  problem  since  none  of  them  tends 
to  zero  as  y  — >  oo,  and  a  linear  combination  of  ekx  and  e~kx  cannot  be  zero  both  at 
x  =  0  and  at  x  =  10.  However,  if  we  had  considered  a  semi-infinite  plate  with  its 
long  sides  parallel  to  the  x  axis  instead  of  the  y  axis,  and  T  =  100°  along  the  short 
end  on  the  y  axis,  the  solutions  (2.18)  would  have  been  the  ones  needed.  Or,  for 
the  finite  plate,  if  the  100°  side  were  along  the  y  axis,  then  we  would  want  (2.18). 

Finally,  let  us  see  how  to  find  the  temperature  distribution  in  a 
plate  if  two  adjacent  sides  are  held  at  100°  and  the  other  two  at  0° 

(or,  in  general,  if  any  values  are  given  for  the  four  sides).  We  can 
find  the  solution  to  this  problem  by  a  combination  of  the  results 
we  have  already  obtained.  Let  us  call  the  sides  of  the  rectangular 
plate  A,  B ,  C ,  D  (Figure  2.2).  If  sides  A,  B ,  and  C  are  held  at  0°, 
and  D  at  100°,  we  can  find  the  temperature  distribution  by  the 
same  method  we  used  in  finding  (2.17)  if  we  take  the  x  axis  along 
D.  Next  suppose  that  for  the  same  plate  (Figure  2.2)  sides  A,  B, 
and  D  are  held  at  0°  and  C  at  100°.  This  is  the  same  kind  of  problem  over  again, 
but  this  time  we  want  to  use  the  solutions  (2.18).  [Or  to  shorten  the  work,  we  could 
write  the  solution  like  (2.17)  with  the  x  axis  taken  along  C  and  then  interchange 
x  and  y  in  the  result  to  agree  with  Figure  2.2.]  Having  obtained  the  two  solutions 
(one  for  C  at  100°  and  one  for  D  at  100°),  let  us  add  these  two  answers.  The 
result  is  a  solution  of  the  differential  equation  (2.1)  (linearity:  the  sum  of  any  two 
solutions  is  a  solution).  The  temperatures  on  the  boundary  (as  well  as  inside)  are 
the  sums  of  the  temperatures  in  the  two  solutions  we  added,  that  is,  0°  on  A,  0°  on 
B ,  0°  +  100°  on  C,  and  100°  +  0°  on  D.  These  are  the  given  boundary  conditions 
we  wanted  to  satisfy.  Thus  the  sum  of  the  solutions  of  two  simple  problems  gives 
the  answer  to  the  more  complicated  one  (see  Problems  11  to  13). 

Before  solving  more  problems,  let  us  stop  for  a  moment  to  summarize  this  pro¬ 
cess  of  separation  of  variables  which  is  basically  the  same  for  all  the  partial  differ¬ 
ential  equations  we  shall  discuss.  We  first  assume  a  solution  which  is  a  product 
of  functions  of  the  independent  variables  [like  (2.2)],  and  separate  the  partial  dif¬ 
ferential  equation  into  several  ordinary  differential  equations  [like  (2.5)].  We  solve 
these  ordinary  differential  equations;  the  solutions  may  be  exponential  functions, 
trigonometric  functions,  powers  (positive  or  negative),  Bessel  functions,  Legendre 
polynomials,  etc.  Any  linear  combination  of  these  solutions,  with  any  values  of  the 
separation  constants,  is  a  solution  of  the  partial  differential  equation.  The  problem 
is  to  determine  both  the  values  of  the  separation  constants  and  the  correct  linear 
combination  to  fit  the  given  boundary  or  initial  conditions. 

The  problem  of  finding  the  solution  of  a  given  differential  equation  subject  to 
given  boundary  conditions  is  called  a  boundary  value  problem.  Such  problems  often 
lead  to  eigenvalue  problems.  Recall  (Chapter  3,  Section  11,  and  Chapter  12,  end  of 
Section  2)  that  in  an  eigenvalue  (or  characteristic  value)  problem,  there  is  a  param¬ 
eter  whose  values  are  to  be  selected  so  that  the  solutions  of  the  problem  meet  some 
given  requirements.  The  separation  constants  we  have  been  using  are  just  such 
parameters;  their  values  are  determined  by  demanding  that  the  solutions  satisfy 
some  of  the  boundary  conditions.  [For  example,  we  found  k  =  mr/10  just  before 
(2.8)  by  requiring  that  T  =  0  when  x  =  10.]  The  resulting  values  of  the  separation 
constants  are  called  eigenvalues  and  the  solutions  of  the  differential  equation  [for 
example  (2.8)]  corresponding  to  the  eigenvalues  are  called  eigenfunctions.  It  may 
also  happen  that  in  addition  to  the  separation  constants  there  is  a  parameter  in  the 


Figure  2.2 
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original  partial  differential  equation  [for  example,  E  in  the  Schrodinger  equation 
(1.6)].  Again  the  possible  values  of  this  parameter  (for  which  the  equation  has  solu¬ 
tions  meeting  specified  requirements)  are  called  eigenvalues,  and  the  corresponding 
solutions  are  called  eigenfunctions. 

Having  found  the  eigenfunctions,  the  next  step  is  to  expand  the  given  function 
(boundary  or  initial  conditions)  in  terms  of  them.  [See,  for  example  (2.10)  and 
(2.16)  and  many  examples  in  later  sections.]  As  we  have  discussed  (see  Chapter  7, 
Section  8,  and  Chapter  12,  Section  6),  the  eigenfunctions  are  a  set  of  basis  functions 
for  this  expansion.  Thus  we  select  the  functions  [for  example  e~ky  sinkx  in  (2.7)] 
and  values  of  the  separation  constants  (eigenvalues)  to  fit  the  given  boundary  (or 
initial)  conditions;  this  determines  the  basis  functions  for  a  problem. 

PROBLEMS,  SECTION  2 

After  you  find  the  series  solution  of  a  problem,  make  computer  plots  of  your  results 
as  discussed  just  after  equation  (2.13). 

1.  Find  the  steady-state  temperature  distribution  for  the  semi-infinite  plate  problem 
if  the  temperature  of  the  bottom  edge  is  T  =  f(x)  =  x  (in  degrees;  that  is,  the 
temperature  at  x  cm  is  x  degrees),  the  temperature  of  the  other  sides  is  0°,  and  the 
width  of  the  plate  is  10  cm. 

A  rn  20  \  -  (— 1)"+1  -niry/10  ■  ,  nn\ 

Answer:  T  =  —  >  - - - e  sin  Innx  W). 

7r  '  n  ' 

n= 1 

2.  Solve  the  semi-infinite  plate  problem  if  the  bottom  edge  of  width  20  is  held  at 

0°,  0  <  a:  <  10, 

100°,  10  <*<20, 

and  the  other  sides  are  at  0°. 

3.  Solve  the  semi-infinite  plate  problem  if  the  bottom  edge  of  width  7r  is  held  at 
T  =  cos  x  and  the  other  sides  are  at  0° . 

4  ^ — '  71  —  ny 

Answer:  T  =  —  >  — r - e  sinni. 

7T  '  n2  —  1 
even  n 

4.  Solve  the  semi-infinite  plate  problem  if  the  bottom  edge  of  width  30  is  held  at 

x,  0  <  x  <  15, 

30  —  15  <  x  <  30, 

and  the  other  sides  are  at  0°. 

5.  Show  that  the  solutions  of  (2.5)  can  also  be  written  as 

sinh  ky, 
cosh  ky. 

Also  show  that  these  solutions  are  equivalent  to  (2.7)  if  k  is  real  and  equivalent 
to  (2.18)  if  k  is  pure  imaginary.  (See  Chapter  2,  Section  12.)  Also  show  that 
X  =  sin  k(x  —  a),  Y  =  sinh  k(y  —  b)  are  solutions  of  (2.5). 
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6.  Show  that  the  series  in  (2.12)  can  be  summed  to  get 

200  f  sin  (to/ 10) 

T=— arCtan^s.nh(^/io) 

(with  the  arc  tangent  in  radians).  Use  this  formula  to  check  the  value  T  =  26.1° 
at  x  =  y  =  5.  Hints  for  summing  the  series:  Use  sin  (n-ro/lO)  =  Ime’"“',1°  to 
write  the  series  as  Im5/)odd  n  zn /n.  (What  is  zl)  Compare  this  with  the  series  for 
ln[(l  +  z)/{  1  —  z)]  (see  Chapter  1,  Problem  13.17).  Then  use  (13.5)  of  Chapter  2. 

7.  Solve  Problem  3  if  the  plate  is  cut  off  at  height  1  and  the  temperature  at  y  =  1  is 
held  at  0°. 

Answer:  T=  —  Y"'  - U.  .  , —  sinhn(l  —  y)  sin  to. 

tt  ^  (n2  —  1)  sinh  n  v 

even  n  v  ' 

8.  Find  the  steady-state  temperature  distribution  in  a  rectangular  plate  30  cm  by  40  cm 
given  that  the  temperature  is  0°  along  the  two  long  sides  and  along  one  short  end; 
the  other  short  end  along  the  x  axis  has  temperature 

100°,  0  <  a:  <  10, 

0°,  10  <  x  <30. 


9.  Solve  Problem  2  if  the  plate  is  cut  off  at  height  10  and  the  temperature  of  the  top 
edge  is  0° . 

10.  Find  the  steady-state  temperature  distribution  in  a  metal  plate  10  cm  square  if  one 
side  is  held  at  100°  and  the  other  three  sides  at  0°.  Find  the  temperature  at  the 
center  of  the  plate. 


Answer: 


T=  E 

odd  n 

T{ 5,5)  ~  25°. 


400  .  ,  m r,,.  .  .  mvx 

- ■  i - smh  777 ( 10  “  Vi  sm  ~ iTT’ 

mrsmhn7r  10  10 


11.  Find  the  steady-state  temperature  distribution  in  the  plate  of  Problem  10  if  two 
adjacent  sides  are  at  100°  and  the  other  two  at  0°.  Hint:  Use  your  solution  of 
Problem  10.  You  should  not  have  to  do  any  calculation-  just  write  down  the  answer! 

12.  Find  the  temperature  distribution  in  a  rectangular  plate  10  cm  by  30  cm  if  two 
adjacent  sides  are  held  at  100°  and  the  other  two  sides  at  0°. 

13.  Find  the  steady-state  temperature  distribution  in  a  rectangular  plate  covering  the 
area  0  <  x  <  10,  0  <  y  <  20,  if  the  two  adjacent  sides  along  the  axes  are  held  at 
temperatures  T  =  x  and  T  =  y  and  the  other  two  sides  at  0°. 

14.  In  the  rectangular  plate  problem,  we  have  so  far  had  the  temperature  specified  all 
around  the  boundary.  We  could,  instead,  have  some  edges  insulated.  The  heat 
flow  across  an  edge  is  proportional  to  dT /dn,  where  n  is  a  variable  in  the  direction 
normal  to  the  edge  (see  normal  derivatives,  Chapter  6,  Section  6).  For  example, 
the  heat  flow  across  an  edge  lying  along  the  x  axis  is  proportional  to  dT/dy.  Since 
the  heat  flow  across  an  insulated  edge  is  zero,  we  must  have  not  T,  but  a  partial 
derivative  of  T,  equal  to  zero  on  an  insulated  boundary.  Use  this  fact  to  find  the 
steady-state  temperature  distribution  in  a  semi-infinite  plate  of  width  10  cm  if  the 
two  long  sides  are  insulated,  the  far  end  (at  oo  as  in  Figure  2.1)  is  at  0°,  and  the 
bottom  edge  is  at  T  =  f(x)  =  x  —  5. 

Note  that  you  used  T  — >  0  as  y  — >  oo  only  to  discard  the  solutions  e+ky‘,  it  would 
be  just  as  satisfactory  to  say  that  T  does  not  become  infinite  as  y  — >  oo.  Actually, 
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the  temperature  (assumed  finite)  as  y  — *  oo  in  this  problem  is  determined  by  the 
given  temperature  at  y  =  0.  Let  T  =  f(x)  =  x  at  y  =  0,  repeat  your  calculations 
above  to  find  the  temperature  distribution,  and  find  the  value  of  T  for  large  y.  Don’t 
forget  the  k  =  0  term  in  the  series! 

15.  Consider  a  finite  plate,  10  cm  by  30  cm,  with  two  insulated  sides,  one  end  at  0°  and 
the  other  at  a  given  temperature  T  =  f(x).  Try  f(x)  =  100°;  f{x)  =  x.  You  should 
convince  yourself  that  this  problem  cannot  be  done  using  just  the  solutions  (2.7). 
To  see  what  is  wrong,  go  back  to  the  differential  equations  (2.5)  and  solve  them 
if  k  =  0.  You  should  find  solutions  x,  y,  xy  and  constant  [the  constant  is  already 
contained  in  (2.7)  for  k  =  0,  but  the  other  three  solutions  are  not].  Now  go  back 
over  each  of  the  problems  we  have  done  so  far  and  see  why  we  could  ignore  these 
k  =  0  solutions;  then  including  the  k  =  0  solutions,  finish  the  problem  of  the  finite 
plate  with  insulated  sides. 

For  the  case  f(x)  =  x,  the  answer  is: 


odd  n 


1  .  ,  nn  mrx 

2-xQ —  smh  To  (30  ”  V)  cos  T7T ' 
n2  smh  3n7r  10  10 


16.  Show  that  there  is  only  one  function  u  which  takes  given  values  on  the  (closed) 
boundary  of  a  region  and  satisfies  Laplace’s  equation  V2u  =  0  in  the  interior  of 
the  region.  Hints:  Suppose  ui  and  U2  are  both  solutions  with  the  same  boundary 
conditions  so  that  U  =  ui  —  U2  =  0  on  the  boundary.  In  Green’s  first  identity 
(Chapter  6,  Problem  10.16),  let  <j>  =  \E'  =  U  to  show  that  VC/  =  0.  Thus  show 
U  =  0  everywhere  inside  the  region. 


3.  THE  DIFFUSION  OR  HEAT  FLOW  EQUATION;  THE  SCHR0DINGER  EQUATION 

The  heat  flow  equation  is 


(3.1) 


V2u  = 


1  du 
a 2  dt 1 


where  u  is  the  temperature  and  a2  is  a  constant  characteristic  of  the  material 
through  which  heat  is  flowing.  It  is  worthwhile  to  do  first  a  partial  separation  of 

(3.1)  into  a  space  equation  and  a  time  equation;  the  space  equation  in  more  than 
one  dimension  then  must  be  further  separated  into  ordinary  differential  equations 
in  x  and  y,  or  x,  y,  and  z,  or  r,  9,  <p,  etc.  We  assume  a  solution  of  (3.1)  of  the  form 

(3.2)  u  =  F(x,y,z)T{t). 


(Note  the  change  in  meaning  of  T;  we  have  previously  used  it  for  temperature;  here 
u  is  temperature  and  T  is  the  time-dependent  factor  in  u.)  Substitute  (3.2)  into 
(3.1);  we  get 

(3.3)  TV2F=\f^. 

or  at 

Next  divide  (3.3)  by  FT  to  get 

^2f  =  ^F 

F  a2  T  dt 


(3.4) 
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The  left  side  of  this  identity  is  a  function  only  of  the  space  variables  x,  y ,  z,  and  the 
right  side  is  a  function  only  of  time.  Therefore  both  sides  are  the  same  constant 
and  we  can  write 


(3.5) 


1  9 

-V2F 

1  1  dT 
c^T~dt 


-k2 

-k2 


or  \72F  +  k2F  =  0 

dT  2  2 
or  —  =  -k2a2T. 
dt 


The  time  equation  can  be  integrated  to  give 


and 


(3.6) 


yi  _  e-k2a2t 


We  can  see  a  physical  reason  here  for  choosing  the  separation  constant  (— k2)  to  be 
negative.  As  t  increases,  the  temperature  of  a  body  might  decrease  to  zero  as  in 
(3.6),  but  it  could  not  increase  to  infinity  as  it  would  if  we  had  used  +k 2  in  (3.5) 
and  (3.6).  The  space  equation  in  (3.5)  is  the  Helmholtz  equation  (1.5)  as  promised. 
You  will  find  (Problem  10)  that  the  space  part  of  the  wave  equation  is  also  the 
Helmholtz  equation. 

y 

Example  1.  Let  us  now  consider  the  flow  of  heat  through  a  slab 
of  thickness  l  (for  example,  the  wall  of  a  refrigerator).  We 
shall  assume  that  the  faces  of  the  slab  are  so  large  that  we 
may  neglect  any  end  effects  and  assume  that  heat  flows  only 
in  the  x  direction  (Figure  3.1).  This  problem  is  then  iden¬ 
tical  to  the  problem  of  heat  flow  in  a  bar  of  length  l  with 
insulated  sides,  because  in  both  cases  the  heat  flow  is  just  in 
the  x  direction.  Suppose  the  slab  has  initially  a  steady-state 
temperature  distribution  with  the  x  =  0  wall  at  0°  and  the 
x  =  l  wall  at  100°.  From  t  =  0  on,  let  the  x  =  l  wall  (as 
well  as  the  x  =  0  wall)  be  held  at  0°.  We  want  to  find  the 
temperature  at  any  x  (in  the  slab)  at  any  later  time. 

First,  we  find  the  initial  steady-state  temperature  distribution.  You  probably 
already  know  that  this  is  linear,  but  it  is  interesting  to  see  this  from  our  equations. 
The  initial  steady-state  temperature  uq  satisfies  Laplace’s  equation,  which  in  this 
one-dimensional  case  is  d2uo/dx2  =  0.  The  solution  of  this  equation  is  uq  =  ax  +  6, 
where  a  and  b  are  constants  which  must  be  found  to  fit  the  given  conditions.  Since 
uq  =  0  at  x  =  0  and  u o  =  100  at  x  =  l,  we  have 


(3.7) 


100 

u  o  =  —f~x- 


From  t  =  0  on,  u  satisfies  the  heat  flow  equation  (3.1).  We  have  already  sep¬ 
arated  this;  the  solutions  are  (3.2)  where  T(t)  is  given  by  (3.6)  and  F( x)  satisfies 
the  first  of  equations  (3.5),  namely 


d?F 

(3.8)  V2F  +  k2F  =  0  or  — —  +  k2F  =  0. 

dxz 

(For  this  one-dimensional  problem,  F  is  a  function  only  of  x.)  The  solutions  of  (3.8) 
are 


F(x)  = 


(3.9) 


sin  kx , 
cos  kx 


630  Partial  Differential  Equations 


Chapter  13 


and  the  solutions  (3.2)  are 


(3.10) 


_ •  7 

e  sin  kx 

e-k2a2t  cog 


We  discard  the  cos  kx  solution  for  this  problem  because  we  are  given  u  =  0  at  x  =  0. 
Also  we  want  u  =  0  at  x  =  Z;  this  will  be  true  if  sin  kl  =  0,  that  is,  kl  =  nn:  or 
k  =  mr/l  (eigenvalues).  Our  basis  functions  (eigenfunctions)  are  then 


u  =  e  (nna'l>  'sin  — — 


(3.11) 

and  the  solution  of  our  problem  will  be  the  series 

OO 

(3.12)  u  =  J2  bne~{n™/l)2t  sin 

n—  1 

At  t  =  0,  we  want  u  =  Uq  as  in  (3.7),  that  is, 

(3.13) 


v-^  .  rmx  100 

=  2 *  On  Sin  — - —  =  Uq  =  —^X. 


I 


l 


This  means  finding  the  Fourier  sine  series  for  (100/Z)x  on  (0, Z);  the  result  (from 
Problem  1)  for  the  coefficients  is 


(3.14) 


k  =  =  mtvn. 

Inn  n  n 


Then  we  get  the  final  solution  by  substituting  (3.14)  into  (3.12);  this  gives 


(3.15) 


200 


7 r 


E 


n 


(mra/l)2t  gjn 


nnx 

~T 


Example  2.  We  can  now  do  some  variations  of  this  problem.  Suppose  the  final  temper¬ 
atures  of  the  faces  are  given  as  two  different  constant  values  different  from  zero. 
Then,  as  for  the  initial  steady  state,  the  final  steady  state  is  a  linear  function  of 
distance.  The  series  (3.12)  tends  to  a  final  steady  state  of  zero;  to  obtain  a  solution 
tending  to  some  other  final  steady  state,  we  add  to  (3.12)  the  linear  function  Uf 
representing  the  correct  final  steady  state.  Thus  we  write  instead  of  of  (3.12) 

OO 

(3.16)  u=  J2bne-{n"a/l)2tsm^  +  uf. 

n=  1 

Then  for  t  =  0,  the  equation  corresponding  to  (3.13)  is 


(3.17) 


E717TX 

bn  sin— - 1-  Uf 

n= 1 


i 
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or 


(3.18) 


U0  -  Uf  =  ^2  bn  si 


mrx 


sin  ■ 


TL—l 


l 


Thus  when  Uf  ^  0,  it  is  uo~  Uf  rather  than  uq  which  must  be  expanded  in  a  Fourier 
series. 


Insulated  boundaries  So  far  we  have  had  the  boundary  temperatures  given. 
We  could,  instead,  have  the  faces  insulated;  then  no  heat  flows  in  or  out  of  the 
body.  This  will  be  true  if  the  normal  derivative  du/dn  (see  Problem  2.14)  of  the 
temperature  is  zero  at  the  boundary.  (When  the  boundary  values  of  u  are  given, 
the  problem  is  called  a  Dirichlet  problem;  when  the  boundary  values  of  the  normal 
derivative  du/dn  are  given,  the  problem  is  called  a  Neumann  problem.)  For  the  one¬ 
dimensional  case  we  have  considered,  we  replace  the  condition  u  =  0  at  x  =  0  and 
l  by  the  condition  du/dx  =  0  at  x  =  0  and  l  if  the  faces  are  insulated.  This  means 
that  the  useful  solution  in  (3.10)  is  now  the  one  containing  coskx',  note  carefully 
that  we  must  include  the  constant  term  (corresponding  to  k  =  0).  See  Problem  7. 


The  Schrodinger  Equation  Compare  equations  (1.3)  and  (1.6).  If  V  =  0  in 
(1.6),  the  two  equations  have  the  same  form  (a  V2  term  and  a  first  partial  with 
respect  to  t).  For  future  reference  (see  problems,  Section  7),  let’s  first  separate 
variables  in  the  general  equation  (1.6).  We  assume  [compare  (3.2)] 

(3.19)  'F  =  ip{x,y,z)T(t). 


Substitute  (3.19)  into  (1.6)  and  divide  by  T  to  get 


(3.20) 


2  to  V’ 


v2v>  +  p  =  in 


1  dT 
T~dt 


=  E 


where  E  is  the  separation  constant  [compare  (3.5)].  (In  quantum  mechanics,  E  has 
the  meaning  of  energy  of  the  particle.)  Then  integrating  the  time  equation  gives 
(compare  3.6) 

(3.21)  T  =  e~iEt/n 


and  the  space  equation  (called  the  time-independent  Schrodinger  equation )  is 
h2 

(3.22)  —  - — V2r/>  +  Vif)  =  Eip.  Time-independent  Schrodinger  equation 

2m 

For  the  one-dimensional  problems  that  we  consider  in  this  section,  and  with 
V  =  0,  we  have 


(3.23) 


h2  d2ip 
2m  dx2 


Eip 


or 


d2% b 
dx2 


2  mE 


=  0 


which  is  (3.8)  with  k2  =  '2r//F .  Thus  the  solutions  of  (3.23)  are  the  same  as  in  (3.9) 
and  the  corresponding  T  solutions  are 


—iEt/h 


(3.24) 


T  =  ip(x)T(t) 


sin  kx 
cos  kx 
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Example  3.  The  “particle  in  a  box”  problem  in  quantum  mechanics  requires  the  solution 
of  the  Schrodinger  equation  with  V  =  0  on  (0,  l)  and  T  =  0  at  the  endpoints  x  =  0 
and  x  =  l  for  all  t.  (The  wave  function  T  then  describes  a  particle  trapped  between 
0  and  l.)  As  in  the  heat  flow  problem,  T  =  0  at  x  =  0  requires  the  sine  solutions 
in  (3.24)  and  T  =  0  at  x  =  l  requires  k  =  mr/l.  Since  k 2  =  2mE/h2,  we  find 

t2  2  2 

E  =  which  we  will  call  En.  (The  meaning  of  this  equation  in  quantum 

mechanics  is  that  the  energy  of  a  particle  trapped  between  0  and  l  can  have  only  a 
discrete  set  of  values  called  eigenvalues.  We  say  that  the  energy  is  quantized.)  The 
basis  functions  for  this  problem  are  then  the  eigenfunctions 

(3.25)  Tn  =  sin 

and  we  write  ty(x,t)  as  a  linear  combination  of  them. 

OO 

(3.26)  ^(rr,  t)  =  ^bn  sin  !E£e-Wnt/Ht 

n—  1 

(compare  (3.12)  for  the  heat  flow  problem).  If  the  initial  state  ^(a^O)  is  the  same 
function  as  in  (3.7),  the  bn  coefficients  are  the  same  as  in  (3.14),  so  we  have 


(3.27) 


See  Problems  11  and  12;  also  see  Problems  6.6  to  6.8,  and  7.17  to  7.22. 


PROBLEMS,  SECTION  3 

As  in  Section  2,  make  computer  plots  of  your  results. 

1.  Verify  the  coefficients  in  equation  (3.14). 

2.  A  bar  10  cm  long  with  insulated  sides  is  initially  at  100°.  Starting  at  t  =  0,  the 
ends  are  held  at  0°.  Find  the  temperature  distribution  in  the  bar  at  time  t. 

A  400  \r-^  1  —  (nna /10)2£  •  nnx 

Answer:  u= -  >  — e  sm - . 

7r  '  n  10 

odd  n 

3.  In  the  initial  steady  state  of  an  infinite  slab  of  thickness  l,  the  face  x  =  0  is  at  0° 
and  the  face  x  =  l  is  at  100° .  From  t  =  0  on,  the  x  =  0  face  is  held  at  100°  and  the 
x  =  l  face  at  0°.  Find  the  temperature  distribution  at  time  t. 

A  mo  100x  400  1  -(.nna/l)2t  ■  nrrx 

Answer:  u  =  100 - >  —  e  sm - . 

I  7T  '  n  l 

even  n 

4.  At  t  =  0,  two  flat  slabs  each  5  cm  thick,  one  at  0°  and  one  at  20°,  are  stacked 
together,  and  then  the  surfaces  are  kept  at  0°.  Find  the  temperature  as  a  function 
of  x  and  t  for  t  >  0. 

5.  Two  slabs,  each  1  inch  thick,  each  have  one  surface  at  0°  and  the  other  surface  at 
100°.  At  t  =  0,  they  are  stacked  with  their  100°  faces  together  and  then  the  outside 
surfaces  are  held  at  100°.  Find  u(x,t)  for  t  >  0. 

6.  Show  that  the  following  problem  is  easily  solved  using  (3.15):  The  ends  of  a  bar 
are  initially  at  20°  and  150°;  at  t  =  0  the  150°  end  is  changed  to  50°.  Find  the 
time-dependent  temperature  distribution. 
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7.  A  bar  of  length  l  with  insulated  sides  has  its  ends  also  insulated  from  time  t  =  0  on. 
Initially  the  temperature  is  u  =  x,  where  x  is  the  distance  from  one  end.  Determine 
the  temperature  distribution  inside  the  bar  at  time  t.  Hints  and  comments:  See  the 
discussion  above  and  also  Problem  2.14.  Show  that  the  k  =  0  solutions  are  x  and 
constant  (time  independent).  Note  that  here  (unlike  Problem  2.15)  you  do  not  need 
the  extra  solution  (namely  x)  for  k  =  0  since  the  final  steady  state  is  a  constant 
and  this  is  included  in  the  solutions  (3.10).  Also  note  that  we  did  need  the  k  =  0 
solutions  in  the  discussion  following  (3.15)  but  were  able  to  simplify  the  work  by 
observing  that  these  linear  solutions  simply  give  the  final  steady  state. 


Answer: 


4 1 


E 

odd  n 


1  nnx  . 

~  cos  — — e 
n i  l 


(nna/l)^  t 


8.  A  bar  of  length  2  is  initially  at  0°.  From  t  =  0  on,  the  x  =  0  end  is  held  at  0°  and 
the  x  =  2  end  at  100°.  Find  the  time-dependent  temperature  distribution. 

9.  Solve  Problem  8  if,  for  t  >  0,  the  x  =  0  end  of  the  bar  is  insulated  and  the  x  =  2 
end  is  held  at  100°.  See  Problem  7  above,  and  Chapter  7,  end  of  Section  11. 

10.  Separate  the  wave  equation  (1.4)  into  a  space  equation  and  a  time  equation  as  we  did 
the  heat  flow  equation,  and  show  that  the  space  equation  is  the  Helmholtz  equation 
for  this  case  also. 

11.  Solve  the  “particle  in  a  box”  problem  to  find  ^(xH)  if  4/(a;,0)  =  1  on  (0, 7r).  What 
is  B„?  The  function  of  interest  here  which  you  should  plot  is  |'P(a;,t)|2. 

12.  Do  Problem  11  if  ^(a^O)  =  sin2  nx  on  (0, 1). 


4.  THE  WAVE  EQUATION;  THE  VIBRATING  STRING 

Let  a  string  (for  example,  a  piano  or  violin  string)  be  stretched  tightly  and  its  ends 
fastened  to  supports  at  x  =  0  and  x  =  l.  When  the  string  is  vibrating,  its  vertical 
displacement  y  from  its  equilibrium  position  along  the  x  axis  depends  on  x  and  t. 
We  assume  that  the  displacement  y  is  always  very  small  and  that  the  slope  dy/dx 
of  the  string  at  any  point  at  any  time  is  small.  In  other  words,  we  assume  that 
the  string  never  gets  very  far  away  from  its  stretched  equilibrium  position;  in  fact, 
we  do  not  distinguish  between  the  length  of  the  string  and  the  distance  between 
the  supports,  although  it  is  clear  that  the  string  must  stretch  a  little  as  it  vibrates 
out  of  its  equilibrium  position.  Under  these  assumptions,  the  displacement  y(x,t) 
satisfies  the  (one-dimensional)  wave  equation 


(4.1) 


d2y  1  d2y 
dx2  v 2  dt2 


The  constant  v  depends  on  the  tension  and  the  linear  density  of  the  string;  it  is 
called  the  wave  velocity  because  it  is  the  velocity  with  which  a  disturbance  at  one 
point  of  the  string  would  travel  along  the  string.  To  separate  the  variables,  we 
substitute 


(4.2)  y  =  X(x)T(t) 

into  (4.1)  and  get  (Problem  3.10) 

1  d2X  1  1  d2T 
X~d^  ~  v* T~d: ¥ 
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or 

(4.3)  X"  +  k2X  =  0, 

f  +  jfeVfT  =  0. 


We  can  see  from  the  physical  problem  why  we  use  a  negative  separation  constant 
here;  the  solutions  are  to  describe  vibrations  which  are  represented  by  sines  and 
cosines,  not  by  real  exponentials.  Of  course,  if  we  tried  using  +k2  with  k  real, 
we  would  also  discover  mathematically  that  we  could  not  satisfy  the  boundary 
conditions. 

Recall  the  following  notation  used  in  discussing  wave  phenomena  (see  Chapter  7, 
Problem  2.17): 


V  =  frequency  (sec  x)  u  =  2lru 

A  =  wavelength  2ir 

v  =  Xu  k=~  = 

=  angular  frequency  (radians) 

2ttv  lu 

- =  —  =  wave  number 

V  V 

The  solutions  of  the  two  equations  in  (44 

(4.4)  x  =  j™  fa,  r  1 

1  cos  k  x, 

and  so  the  solutions  (4.2)  for  y  are  are 

)  are 

sin  kvt  =  sin  uit, 
cos  kvt  =  cos  cot, 

II 

LO" 

sin  kx  1  J  sin  cut 
cos  kx  J  \  cos  tot 

where  ui  =  kv. 

Since  the  string  is  fastened  at  x  =  0  and  x  =  l,  we  must  have  y  =  0  for  these  values 
of  x  and  all  t.  This  means  that  we  want  only  the  sin kx  factors  in  (4.5),  and  also 
we  select  k  so  that  sinfcZ  =  0  or  k  =  nir/l.  The  solutions  then  become 

.  nirx  nirvt 

sin  — —  sin  — - — , 

nirx  nirvt 
sin  — 
l 

The  particular  combination  of  solu¬ 
tions  (4.6)  that  we  should  take  to  solve 
a  given  problem  depends  on  the  initial 
conditions.  For  example,  suppose  the 
string  is  started  vibrating  by  plucking 
(that  is,  pulling  it  aside  a  small  distance 
h  at  the  center  and  letting  go).  Then 
we  are  given  the  shape  of  the  string  at  t  =  0,  namely  yo  =  f(x)  as  in  Figure 
4.1,  and  also  the  fact  that  the  velocity  dy/dt  of  points  on  the  string  is  zero  at 
t  =  0.  (Do  not  confuse  dy/dt  with  the  wave  velocity  v;  there  is  no  relation  between 


l 
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them.)  In  (4.6)  we  must  then  discard  the  term  containing  sin  ( mrvt/l )  since  its  time 
derivative  is  not  zero  when  t  =  0.  Thus  the  basis  functions  for  this  problem  are 
sin  ( mrx/l )  cos  ( mrvt/l )  and  we  write  the  solution  in  the  form 

.  .  ,  .  mrx  mrvt 

(4.7)  y=  y  j  bn  sin  — —  cos  — — . 

n=  1 

The  coefficients  bn  are  to  be  determined  so  that  at  t  =  0  we  have  yo  =  f(x),  that 
is, 

OO 

,  j  \ >  ,  .  TL7TX  . 

(4.8)  y0  =  2J  K  sin  — —  =  /( x). 

n= 1 

As  in  previous  problems,  we  find  the  coefficients  in  the  Fourier  sine  series  for  the 
given  f(x)  and  substitute  them  into  (4.7).  The  result  is  (Problem  1) 

8h  (  nx  nvt  1  37ra;  3nvt  \ 

(4.9)  »=^(S1“T“«— -9Sm“rC““r  +  "T 

Another  way  to  start  the  string  vibrating  is  to  hit  it  (a  piano  string,  for  example). 
In  this  case  the  initial  conditions  would  be  y  =  0  at  t  =  0,  with  the  velocity  dy/dt 
at  t  =  0  given  as  a  function  of  x  (that  is,  the  velocity  of  each  point  of  the  string 
is  given  at  t  =  0).  This  time  we  discard  in  (4.6)  the  term  containing  cos  (mrvt/l) 
because  it  is  not  zero  at  t  =  0.  Then,  for  this  problem,  the  basis  functions  are 
sin  (mrx/l)  sin  ( mrvt/l )  and  the  solution  is  of  the  form 


(4.10) 


y  = 


oo 

y,  Bn  sin 

n—1 


TITTX  . 

— - —  sin 


nTTVt 

l 


Here  the  coefficients  must  be  determined  so  that 


(4.11) 


oy  \  yr-^  nnv  .  nnx  7 

Wl  =2^B”—Sm—  =  2^bn  SI 


t=0 


nnx 

sm  — - —  =  V  (x) , 


that  is,  V(x),  the  given  initial  velocity,  must  be  expanded  in  a  Fourier  sine  series 
(see  Problems  5  to  8). 

Suppose  the  string  is  vibrating  in  such  a  way  that,  instead  of  an  infinite  series 
for  y,  we  have  just  one  of  the  solutions  (4.6),  say 


(4.12) 


.  mrx  mrvt 

y  =  sm  — - —  sm  — - — 


for  some  one  value  of  n.  The  largest  value  of  sm(mrvt/l),  for  any  t,  is  1,  and  the 
shape  of  the  string  then  is 


(4.13) 


.  mrx 
y  =  sm  — — 


Graphs  of  (4.13)  are  sketched  in  Figure  4.2  for  n  =  1,2,  3, 4.  (The  graphs  are  ex¬ 
aggerated!  Remember  that  the  displacements  are  actually  very  small.)  Consider 
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Figure  4.2 


a  point  x  on  the  string;  for  this  point  sin  (mrx/l)  is  some  number,  say  A.  Then  the 
displacement  of  this  point  at  time  t  is  [from  (4.12)] 


(4.14) 


m rvt 

y  =  A  sin  — - — 


As  time  passes,  this  point  of  the  string  oscillates  up  and  down  with  frequency  vn 
given  by  w„  =  mrv/l  =  2 irvn  or  vn  =  nv/(2l);  the  amplitude  of  the  oscillation  at 
this  point  is  A  =  sin(mrx/l)  (see  Figure  4.2).  Other  points  of  the  string  oscillate 
with  different  amplitudes  but  the  same  frequency.  This  is  the  frequency  of  the 
musical  note  which  the  string  is  producing.  (See  Chapter  7,  Section  10.)  If  n  =  1 
(see  Figure  4.2),  the  frequency  is  v/(2l)\  in  music  this  tone  is  called  the  fundamental 
or  first  harmonic.  If  n  =  2,  the  frequency  is  just  twice  that  of  the  fundamental;  this 
tone  is  called  the  first  overtone  or  the  second  harmonic;  etc.  All  the  frequencies 
which  this  string  can  produce  are  multiples  of  the  fundamental.  These  frequencies 
are  called  the  characteristic  frequencies  of  the  string.  (They  are  proportional  to  the 
characteristic  values  or  eigenvalues,  k  =  mr/I.)  The  corresponding  ways  in  which 
the  string  may  vibrate  producing  a  pure  tone  of  just  one  frequency  [that  is,  with  y 
given  by  (4.12)  for  one  value  of  n]  are  called  the  normal  modes  of  vibration.  The 
first  four  normal  modes  are  indicated  in  Figure  4.2.  Any  vibration  is  a  combination 
of  these  normal  modes  [for  example,  (4.9)  or  (4.10)].  The  solution  (4.12)  (for  one 
n )  describing  one  normal  mode,  is  a  characteristic  function  or  eigenfunction. 

The  waves  in  Figure  4.2  are  called  standing  waves.  The  d’Alembert  solution 
of  the  wave  equation  (see  Problem  1.2)  represents  traveling  waves.  Suppose  we 
combine  two  traveling  waves  moving  in  opposite  directions  as  follows: 


(4.15) 


cos  fc(  x  —  vt)  —  cos  k(x  +  vt)  =  2  sin  kx  sin  kvt 


(by  a  trigonometry  formula).  This  is  one  of  the  solutions  (4.5)  so  we  see  that  this 
combination  of  two  traveling  waves  produces  a  standing  wave.  Suppose  these  two 
traveling  waves  are  moving  along  a  string  which  is  fastened  at  x  =  0  and  at  x  =  l. 
First  consider  the  wave  cos  k(x  +  vt)  which  is  moving  in  the  negative  x  direction 
toward  x  =  0.  When  it  reaches  x  =  0,  it  will  be  reflected,  and  the  combination  of 
the  incident  and  reflected  waves  must  equal  zero  at  x  =  0  for  all  t.  We  see  that 
this  is  true  in  (4.15),  so  the  wave  cos k(x  —  vt)  is  the  reflection  of  —  cos k(x  +  vt). 
Now  consider  cos  k(x  —  vt)  traveling  toward  x  =  l.  When  it  reaches  x  =  l  and  is 
reflected,  we  can  verify  (Problem  10)  that,  if  k  =  nir/l,  then  the  reflection  at  x  =  l 
is  —  cos  (x  +  vt) .  We  can  think  of  a  wave  traveling  back  and  forth  between  x  =  0 
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and  x  =  l,  being  reflected  at  each  end.  The  net  result  as  we  see  from  (4.15)  is  a 
standing  wave. 

So  far  we  have  been  considering  problems  in  which  a  string  is  pinned  at  both 
ends.  We  could,  instead,  have  a  “free”  end;  this  means  free  to  move  up  and  down 
along  x  =  0  or  x  =  l,  say  by  allowing  the  end  to  slide  along  a  frictionless  track. 
The  mathematical  condition  for  this  is  dy/dx  =  0  at  the  free  end  (compare  the 
condition  for  an  insulated  face  in  Section  3).  If  the  x  =  0  end  is  free,  we  choose 
the  solution  containing  coskx  (since  ^  c oskx  =  —ksinkx  =  0  at  x  =  0).  Then,  if 
the  string  is  pinned  at  x  =  l,  we  want  cos  kl  =  0,  so  kl  =  (n  +  ^)7r.  Thus  the  basis 
functions  when  the  x  =  0  end  is  free,  the  x  =  l  end  is  pinned,  and  the  initial  string 
velocity  is  zero,  are 


(4.16) 


y  =  cos  ■ 


>+*)■ 


■  cos  ■ 


TTVt 


l  l 

For  a  discussion  of  these  functions,  see  Chapter  7,  Section  11  and  Problem  11.11. 


PROBLEMS,  SECTION  4 

As  in  Sections  2  and  3,  use  a  computer  to  plot  your  answers. 

1.  Complete  the  plucked  string  problem  to  get  equation  (4.9). 

2.  A  string  of  length  l  has  a  zero  initial  velocity 
and  a  displacement  yo(x)  as  shown.  (This  ini-  yoU) 
tial  displacement  might  be  caused  by  stopping 
the  string  at  the  center  and  plucking  half  of  it.) 

Find  the  displacement  as  a  function  of  x  and  t. 


Answer: 


8h  vA  „  •  n7ra:  nnvt 

y  =  —z  >  Bn  sm  — —  cos  — : — 
7 XZ  z '  l  l 

n= 1 


where  Bn 


(2  sin  nn /4  —  sin  nn /2)  jr? . 


3.  Solve  Problem  2  if  the  initial 
displacement  is: 


//4 


4.  Solve  Problem  2  if  the  initial 
displacement  is: 


5.  A  string  of  length  l  is  initially  stretched  straight;  its 
ends  are  fixed  for  all  t.  At  time  t  =  0,  its  points  are 
given  the  velocity  V(x )  =  (dy/dt)t= o  as  indicated 
in  the  diagram  (for  example,  by  hitting  the  string). 

Determine  the  shape  of  the  string  at  time  t,  that  is, 
find  the  displacement  y  as  a  function  of  x  and  t  in  the  form  of  a  series  similar  to 
(4.9).  Warning:  What  basis  functions  do  you  need  here? 

8 hi  (  .  nx  .  7 rut  1  .  37ra;  .  3nvt  1  .  57ra:  .  5nvt 

Answer:  y  =  — -  sm  — —  sm  — - —  sm  — —  sm  — ; - h  —  sm  — —  sm  — ; - - 

nJv  \  l  l  3J  l  l  l  l 
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6.  Do  Problem  5  if  the  initial  velocity  V(x)  =  (dy / dt)t=o  is  as  shown. 


V(x) 


2  w 


1/2 


■X 


4 hi  (  .  7YW  .  TTX  .  7 YVt  1  .  37 TW  .  3nX  .  3lVVt 

Answer:  y  =  — sm  — —  sm  —  sm  — - —  sm  — ; —  sm  — ; —  sm  — ; - p  •  •  • 

7 rJv  \  l  l  l  9  l  l  l 


7.  Solve  Problem  5  if  the  initial  velocity  is: 


8.  Solve  Problem  5  if  the  initial  velocity  is 


V(x) 


sin  2nx/l, 

0, 


0  <  x  <  1/2, 
1/2  <  x  <  l. 


9.  In  each  of  the  Problems  1  to  8,  find  the  frequency  of  the  most  important  harmonic. 

10.  Verify  that,  if  k  =  then  the  sum  of  the  two  traveling  waves  in  equation  (4.15)  is 
zero  at  x  =  l,  for  all  t. 

11.  Verify  (4.16)  and  find  a  similar  formula  for  a  string  pinned  at  x  =  0  and  free  at 

x  =  l.  Solve  Problems  2,  3,  and  4,  for  a  string  with  a  free  end  (a)  at  x  =  0;  (b)  at 

x  =  l. 

12.  In  Sections  2,  3,  4,  we  have  solved  a  number  of  physics  problems  which  led  to 
the  expansion  of  a  given  f(x)  in  a  Fourier  sine  series.  Look  at  (2.9)  and  (2.25), 
temperature  in  a  plate;  (3.12),  heat  flow;  (3.26),  wave  function  for  a  particle  in  a 
box;  (4.7)  and  (4.10),  displacement  of  a  vibrating  string  plucked  or  struck.  If  we 
have  expanded  a  given  f(x )  in  a  Fourier  sine  series  on  (0,/),  we  can  immediately 
write  the  corresponding  solutions  for  these  six  different  physics  problems  on  the  same 
interval.  Do  this  for  f(x)  =  x  —  x2  on  (0, 1),  that  is  with  1  =  1.  Make  computer 
plots  of  your  results. 

13.  Do  Problem  12  for  f(x)  =  1  —  cos  2x  on  (0, 7r). 

14.  Do  Problem  12  for  f(x)  =  x  —  x3  on  (0, 1). 

5.  STEADY-STATE  TEMPERATURE  IN  A  CYLINDER 

Consider  the  following  problem.  Find  the  steady-state 
temperature  distribution  u  in  a  semi-infinite  solid  cylin¬ 
der  (Figure  5.1)  of  radius  a  if  the  base  is  held  at  100° 
and  the  curved  sides  at  0°.  This  sounds  very  much 
like  the  problem  of  the  temperature  distribution  in  a 
semi-infinite  plate.  However,  it  is  not  convenient  here 
to  use  the  solutions  in  rectangular  coordinates,  because 
the  boundary  condition  u  =  0  is  given  for  r  =  a  rather 
than  for  constant  values  of  x  or  y.  The  natural  vari¬ 
ables  for  this  problem  are  the  cylindrical  coordinates 
r,  9,  z.  The  temperature  u  inside  the  cylinder  satisfies 
Laplace’s  equation  since  there  are  no  sources  of  heat 
there. 


Figure  5.1 
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Laplace’s  equation  in  cylindrical  coordinates  is  (see  Chapter  10,  Section  9) 


(5.1) 


V  M  = 


1  d 


r  dr  V  dr 


du 


1  d2u  d2u 
r 2  dd 2  dz2 


=  0. 


To  separate  the  variables,  we  assume  a  solution  of  the  form 


(5.2)  u  =  R(r)Q(0)Z(z). 

Substitute  (5.2)  into  (5.1)  and  divide  by  RQZ  to  get 

II A  (  1  d2Z 

Rr  dr  \  dr  )  +  0  r2  d02  +  Z  dz2 

The  last  term  is  a  function  only  of  z ,  while  the  other  two  terms  do  not  contain  z. 
Therefore  the  last  term  is  a  constant  and  the  sum  of  the  first  two  terms  is  minus 
the  same  constant.  Notice  that  neither  of  the  first  two  terms  is  constant  alone  since 
both  contain  r. 


In  order  to  say  that  a  term  is  constant,  we  must  be  sure  that: 

(a)  it  is  a  function  of  only  one  variable,  and 

(b)  that  variable  does  not  appear  elsewhere  in  the  equation. 


Thus  we  have 
(5.4) 


1  d2Z 

Zl^ 


=  K 


Z  = 


aKz 


a-Kz 


Since  we  want  the  temperature  u  to  tend  to  zero  as  z  tends  to  infinity,  we  call  the 
separation  constant  +K2  ( K  >  0)  and  then  use  only  the  e~Kz  solution.  Next  write 
(5.3)  with  the  last  term  replaced  by  K2 — see  (5.4). 

11  d  (  dR\  1  1  d2Q  o  „ 

Rr  dr  \  dr  )  0  r2  dd 2 


We  can  separate  the  variables  by  multiplying  by  r2. 


(5.5) 


rdf  dR \ 
Rdr  \  dr  ) 


1  d2Q 
Q~dB2  + 


K2r2  =  0. 


In  (5.5)  the  second  term  is  a  function  of  9  only,  and  the  other  terms  are  independent 
of  9.  Thus  we  have 


(5.6) 


i  d2e  2  J  sin  n9, 

0  d62  1  ~  ]  cos  r i9. 


Here  we  must  use  —n2  as  the  separation  constant  and  then  require  n  to  be  an 
integer  for  the  following  reason.  When  we  locate  a  point  using  polar  coordinates, 
we  can  choose  the  angle  as  9  or  as  9  +  2rmr  where  m  is  any  integer.  But  regardless 
of  the  value  of  m,  there  is  one  physical  point  and  one  temperature  there.  The 
mathematical  formula  for  the  temperature  at  the  point  must  give  the  same  value  at 
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9  as  at  9+2mn1  that  is,  the  temperature  must  be  a  periodic  function  of  9  with  period 
27 r.  This  is  true  only  if  the  0  solutions  are  sines  and  cosines  instead  of  exponentials 
(hence  the  negative  separation  constant)  and  the  constant  n  is  an  integer  (to  give 
period  27t).  The  solutions  of  (5.6)  when  n  =  0  are  9  and  constant.  Since  9  is  not 
periodic,  we  can  use  only  the  constant  solution  which  is  already  contained  in  the 
cosnf?  solution  when  n  =  0. 

Finally,  the  r  equation  is 


This  is  a  Bessel  equation  with  solutions  Jn(Kr)  and  Nn(Kr)  [see  Chapter  12, 
equation  (16.5)].  Since  the  base  of  the  cylinder  contains  the  origin,  we  can  use  only 
the  Jn  and  not  the  Nn  solutions  since  Nn  becomes  infinite  at  the  origin.  Hence  we 
have 

(5.8)  R(r)  =  Jn(Kr). 

We  can  find  the  possible  values  of  K  from  the  condition  that  the  temperature  is 
zero  on  the  curved  surface  of  the  cylinder.  Thus  u  =  0  when  r  =  a  (for  all  9  and 
z)  or  R(r)  =  0  when  r  =  a.  So  from  (5.8)  we  see  that  Jn(Ka )  =  0,  that  is,  the 
possible  values  oi  K a  are  the  zeros  of  Jn.  If  we  define  k  =  Ka,  or  K  =  k/ a,  then 

(5.9)  R(r)  =  Jn(kr/a)  and  Z(z)  =  e~kz^a . 

Thus  the  solutions  for  u  are 


(5.10) 


u  = 


Jn{kr/a)  sin  n9  e  kz^a , 
Jn(kr/a)  cos  n9  e~kz/a, 


where  k  is  a  zero  of  Jn. 


For  our  problem,  the  base  of  the  cylinder  is  held  at  a  constant  temperature  of 
100°.  If  we  turn  the  cylinder  through  any  angle  the  boundary  conditions  are  not 
changed;  thus  the  solution  does  not  depend  on  the  angle  9.  This  means  that  we  use 
cosnf?  with  n  =  0  in  (5.10).  The  possible  values  of  k  are  the  zeros  of  Jo;  call  these 
zeros  km,  where  m  =  l,2,3,---.  Thus  we  have  the  basis  functions  for  the  problem 
and  write  the  solution  in  terms  of  them: 

OO 

(5.11)  u=  y  cmMkmr/a)e-k-z/a. 

771=1 

When  2  =  0,  we  want  u  =  100,  that  is, 

OO 

uz= o  =  ^2  cmJo{kmr/a)  =  100. 

771=1 


(5.12) 
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This  should  remind  you  of  a  Fourier  series;  here  we  want  to  expand  100  in  a  series 
of  Bessel  functions  instead  of  a  series  of  sines  or  cosines.  We  proved  [see  Chapter 
12,  equation  (19.11)]  that  the  functions  J0(kmr/a)  are  orthogonal  on  (0,a)  with 
respect  to  the  weight  function  r.  We  can  then  find  the  coefficients  cm  in  (5.12)  by 
the  same  method  used  in  finding  the  coefficients  in  a  Fourier  sine  or  cosine  series. 
(In  fact,  series  like  (5.12)  are  often  called  Fourier-Bessel  series.)  Multiply  (5.12) 
by  rJo(kfj,r/a),iJ,  =  1,2,3,---,  and  integrate  term  by  term  from  r  =  0  to  r  =  a. 
Because  of  the  orthogonality  [see  Chapter  12,  equation  (19.11)],  all  terms  of  the 
series  drop  out  except  the  term  with  m  =  yt,  and  we  have 

pa  pa 

(5.13)  /  r[Jo(k^r/a)]2  dr  =  /  100 rJ0{k^r/a)  dr. 

Jo  Jo 

For  each  value  of  yt  =  1,2,3,---,  equation  (5.13)  gives  one  of  the  coefficients  in 
(5.11)  and  (5.12);  thus  any  cm  in  (5.11)  is  given  by  (5.13)  with  yt  replaced  by  m. 

We  need  to  evaluate  the  integrals  in  (5.13).  Equation  (19.11)  of  Chapter  12 
gives  (for  p  =  0,  a  =  (3  =  km) 


(5.14) 


fa  a2 

J  r  [Jo{kmr/a)}2  dr  =  —  J2{km). 


By  equation  (15.1)  of  Chapter  12 

d 

dx 


[xJi(x)]  =  xJq(x). 


If  we  put  x  =  kmr/a  in  this  formula,  we  get 


T—-J-  [{kmr / a)Ji{kmr / a)]  =  (kmr / a)J0(kmr / a), 
km.  dr 


Cancelling  one  km/a  factor  and  integrating  from  0  to  a,  we  have 


(5.15) 


fa  CL 

/  rJ0(kmr/a)  dr  =  - — rJi(kmr/a) 
Jo  km 


km. 


Now  we  write  (5.13)  for  cm,  substitute  the  values  of  the  integrals  from  (5.14)  and 
(5.15),  and  solve  for  cm.  The  result  is 


(5.16) 


100a2Ji(fcm) 


200 


O'2  J\[km)  kmJlikm) 


The  solution  of  our  problem  is  now  (5.11)  with  the  values  of  cm  given  by  (5.16). 
The  numerical  value  of  the  temperature  at  any  point  can  be  found  by  computing  a 
few  terms  of  the  series  (Problem  1).  The  values  of  the  zeros  and  of  the  Bessel  func¬ 
tions  can  be  found  either  from  your  computer  or  from  tables.  Warning:  Remember 
that  km  is  a  zero  of  Jq  ,  not  of  J\ . 

Suppose  the  given  temperature  of  the  base  of  the  cylinder  is  more  complicated 
than  just  a  constant  value,  say  f(r,6),  some  function  of  r  and  6.  Down  to  (5.10) 
we  proceed  as  before.  But  now  the  series  solution  is  more  complicated  than  (5.11) 
since  we  must  include  all  Jn’ s  instead  of  just  Jo-  We  need  a  double  subscript  on  the 
numbers  k  which  are  the  zeros  of  the  Bessel  functions;  by  kmn  we  shall  mean  the 
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mth  positive  zero  of  Jn ,  where  n  =  0, 1,  2,  •  •  •  and  m  =  l,2,3,---.  The  temperature 
u  is  a  double  infinite  series,  summed  over  the  indices  m,  n  of  all  zeros  of  all  the  Jn’s: 

oo  oo 

(5.17)  u=  Jn{kmnr  /  a)  (Amn  cos  n8  +  Bmnsmn9)e~kmnZ/a . 

m—1  n=0 

At  z  =  0,  we  want  u  =  f{r,  9).  Thus  we  write 

OO  OO 

(5.18)  itz=0  =  ^2  Jn{kmnr  /  a){Amn  cos  n9  +  Bmn  sin  nO)  =  fir, 9). 

771=1  71=0 


To  determine  the  coefficients  Amn ,  multiply  this  equation  by  J v(k M„r / a)  cos  v9  and 
integrate  over  the  whole  base  of  the  cylinder,  (0  to  27t  for  9,  0  to  a  for  r).  Because 
of  the  orthogonality  of  the  functions  sin  n9  and  cos  n9  on  (0,27t),  all  the  Bmn  terms 
drop  out,  and  only  the  Amn  terms  for  n  =  v  remain.  Because  of  the  orthogonality 
of  the  functions  Jn{kmnr  /  a)  (one  n,  all  m),  only  the  one  term  A remains.  Thus 
we  have 


n27 r 

/(r,  9)  Ju  (k^r/a)  cos  v9  r  dr  dO 


=  A„ 


Jl{k^r/a)  cos2  v9rdr  d9  =  A ^ 


Jv+l  (kfii/)  •  tt. 


o  Jo 


[The  r  integral  is  given  by  (19.11)  of  Chapter  12,  and  the  9  integral  by  Chapter  7, 
Section  4].  Notice  how  the  weight  function  r  in  the  Bessel  function  integral  arises 
here  as  part  of  the  polar  coordinate  area  element.  Similarly,  we  can  find 


(5.20)  na2j. 


+  1  J o  Jo 


n2,7T 

f(r,  9)Jv(kfj,vr /a)  sin  v9r  dr  d9. 


By  substituting  the  values  of  the  A  and  B  coefficients  from  (5.19)  and  (5.20)  into 
(5.17),  we  find  the  solution  to  the  problem. 


PROBLEMS,  SECTION  5 

1.  (a)  Compute  numerically  the  coefficients  (5.16)  of  the  first  three  terms  of  the  series 

(5.11)  for  the  steady-state  temperature  in  a  solid  semi-infinite  cylinder  when 
u  =  0  at  r  =  1,  and  u  —  100  at  z  —  0.  Find  u  at  r  =  i, z  =  1. 

(b)  In  part  (a),  if  u  =  0  at  r  =  10  and  u  =  100  at  z  =  0,  find  u  at  r  =  5,z  =  10. 
What  is  the  relation  between  parts  (a)  and  (b)?  Hint:  Suppose  in  part  (a)  that 
the  length  units  for  r  and  z  are  centimeters.  Consider  the  identical  physics 
problem  but  with  distances  measured  in  millimeters,  and  compare  part  (b). 
Note  that  in  equation  (5.10),  r/a  and  z/a  are  just  measurements  as  multiples 
of  the  radius  a. 

2.  (a)  Find  the  steady-state  temperature  distribution  in  a  solid  semi-infinite  cylinder 

if  the  boundary  temperatures  are  u  =  0  at  r  =  1  and  u  =  y  =  rsin0  at  2  =  0. 
Hints:  In  (5.10)  you  want  the  solution  containing  sin#;  therefore  you  want 
the  functions  Ji.  You  will  need  to  integrate  r2Ji;  follow  the  text  method  of 
integrating  r  Jo  just  before  (5.15). 
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(b)  Do  part  (a)  if  the  cylinder  radius  is  r  =  a. 


3. 

4. 

5. 

6. 

7. 

8. 


9. 


If  a  =  2,  find  u  when  r  =  1,  2  =  1,  9  =  n/2. 

(a)  Find  the  steady-state  temperature  distribution  in  a  solid  cylinder  of  height  10 
and  radius  1  if  the  top  and  curved  surface  are  held  at  0°  and  the  base  at  100° . 
Hint :  See  Section  2. 

(b)  Generalize  part  (a)  to  a  cylinder  of  height  H  and  radius  a. 

A  flat  circular  plate  of  radius  a  is  initially  at  temperature  100°.  From  time  t  =  0  on, 
the  circumference  of  the  plate  is  held  at  0°.  Find  the  time-dependent  temperature 
distribution  u(r,  9,  t).  Hint:  Separate  variables  in  equation  (3.1)  in  polar  coordinates. 

Do  Problem  4  if  the  initial  temperature  distribution  is  u(r,9,t  =  0)  =  lOOr  sin  (9. 

Consider  Problem  4  if  the  initial  temperature  distribution  is  given  as  some  function 
f(r,9).  The  solution  is,  in  general,  a  double  infinite  series  similar  to  (5.17).  Find 
formulas  for  the  coefficients  in  the  series. 

Find  the  steady-state  temperature  distribution  in  a  solid  cylinder  of  height  20  and 
radius  3  if  the  flat  ends  are  held  at  0°  and  the  curved  surface  at  100°.  Hints:  Use 
— K 2  in  (5.4).  Also  see  Chapter  12,  Sections  17  and  20. 

Water  at  100°  is  flowing  through  a  long  pipe  of  radius  1  rapidly  enough  so  that  we 
may  assume  that  the  temperature  is  100°  at  all  points.  At  t  =  0,  the  water  is  turned 
off  and  the  surface  of  the  pipe  is  maintained  at  40°  from  then  on  (neglect  the  wall 
thickness  of  the  pipe).  Find  the  temperature  distribution  in  the  water  as  a  function 
of  r  and  t.  Note  that  you  need  only  consider  a  cross  section  of  the  pipe. 


Find  the  steady-state  distribution  of  temperature  in  a  cube  of  side  10  if  the  tem¬ 
perature  is  100°  on  the  face  z  =  0  and  0°  on  the  other  five  faces.  Hint:  Separate 
Laplace’s  equation  in  three  dimensions  in  rectangular  coordinates,  and  follow  the 
methods  of  Section  2.  You  will  want  to  expand  100  in  the  double  Fourier  series 


oo  oo 


The  coefficients  anm  are  determined  by  using  the  orthogonality  of  the  functions 
sin  ( nnx/l )  sin  (miry /l)  over  the  square,  that  is, 


unless 


A  cube  is  originally  at  100°.  From  t  =  0  on,  the  faces  are  held  at  0°.  Find  the 
time-dependent  temperature  distribution.  Hint:  This  problem  leads  to  a  triple 
Fourier  series;  see  the  double  Fourier  series  in  Problem  9  and  generalize  it  to  three 
dimensions. 


3. 

4. 

5. 

6. 

7. 

8. 


9. 
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11.  The  following  two  R{r)  equations  arise  in  various  separation  of  variables  problems 
in  polar,  cylindrical,  or  spherical  coordinates: 


d  (  dR. 


r  dr  V  dr 


d 


=  n2  R, 


dr 


>  dR 


-\r  =Ki+m 


There  are  various  ways  of  solving  them:  They  are  a  standard  kind  of  equation  (often 
called  Euler  or  Cauchy  equations — see  Chapter  8,  Section  7d);  you  could  use  power 
series  methods;  given  the  fact  that  the  solutions  are  just  powers  of  r,  it  is  easy  to 
find  the  powers.  Choose  any  method  you  like,  and  solve  the  two  equations  for  future 
reference.  Consider  the  case  n  =  0  separately.  Is  this  necessary  for  l  =  0? 

12.  Separate  Laplace’s  equation  in  two  dimensions  in  polar  coordinates 
[equation  (5.1)  without  the  z  term]  and  solve  the  r  and  9  equations. 

(See  Problem  11.)  Remember  that  for  the  9  equation,  only  periodic 
solutions  are  of  interest.  Use  your  results  to  solve  the  problem  of  the 
steady-state  temperature  in  a  circular  plate  if  the  upper  semicircular 
boundary  is  held  at  100°  and  the  lower  at  0°. 

Comment:  Another  physical  problem  whose  mathematical  solution  is  identical 
with  this  temperature  problem  is  this:  Find  the  electrostatic  potential  inside  a 
capacitor  formed  by  two  half-cylinders,  insulated  from  each  other  and  maintained 
at  potentials  0  and  100. 


Answer: 


u  =  50  + 


200 


£(:) 

odd  n 


sin  n6 
n 


13.  Find  the  steady-state  distribution  of  temperature  in  the  sector 
of  a  circular  plate  of  radius  10  and  angle  7r/4  if  the  temperature 
is  maintained  at  0°  along  the  radii  and  at  100°  along  the  curved 
edge.  Hint:  See  Problem  12. 


14.  Find  the  steady  state  temperature  distribution  in  a  circular  annulus 
(shaded  area)  of  inner  radius  1  and  outer  radius  2  if  the  inner  circle 
is  held  at  0°  and  the  outer  circle  has  half  its  circumference  at  0° 
and  half  at  100°.  Hint:  Don’t  forget  the  r  solutions  corresponding 
to  k  =  0. 


15.  Solve  Problem  14  if  the  temperatures  of  the  two  circles  are  interchanged. 


6.  VIBRATION  OF  A  CIRCULAR  MEMBRANE 


A  circular  membrane  (for  example,  a  drumhead)  is  attached  to  a  rigid  support  along 
its  circumference.  Find  the  characteristic  vibration  frequencies  and  the  correspond¬ 
ing  normal  modes  of  vibration. 

Take  the  ( x ,  y)  plane  to  be  the  plane  of  the  circular  support  and  take  the  origin 
at  its  center.  Let  z(x,y,t)  be  the  displacement  of  the  membrane  from  the  (x,y) 
plane.  Then  z  satisfies  the  wave  equation 


(6.1) 


1  d2z 
v2  dt 2 


Putting 

(6.2) 


2  =  F(x,y)T(t), 
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we  separate  (6.1)  into  a  space  equation  (Helmholtz)  and  a  time  equation  (see  Prob¬ 
lem  3.10  and  Section  3).  We  get  the  two  equations 

(6.3)  \72F  +  K2F=  0  and  T  +  K2v2T=0. 

Because  the  membrane  is  circular  we  write  V2  in  polar  coordinates  (see  Chapter  10, 
Section  9);  then  the  F  equation  is 


(6.4) 


r  dr  \  dr  J 


1  d2F 
r2  d62 


+  K2F  =  0. 


When  we  put 


(6.5)  F  =  R(r)9(6), 

(6.4)  becomes  (5.5),  and  the  separated  equations  and  their  solutions  are  just  (5.6), 
(5.7),  and  (5.8).  The  solutions  of  the  time  equation  (6.3)  are  sin  Kvt  and  cos  Kvt. 
Thus  the  solutions  for  z  are  z  =  R(r)Q(6)T(t),  where  R(r)  =  Jn(Kr),Q(9)  = 
{sin  n(2,  cos  n0}  and  T(t)  =  {sin  Kvt,  cos  Kvt}.  Just  as  in  Section  5,  n  must  be  an 
integer.  To  find  possible  values  of  K,  we  use  the  fact  that  the  membrane  is  attached 
to  a  rigid  frame  at  r  =  a,  so  we  must  have  z  =  0  at  r  =  a  for  all  values  of  6  and  t. 
Thus  Jn(Ka)  =  0  so  the  possible  values  of  Ka  are  the  zeros  of  Jn.  As  in  Section  5, 
let  k  =  Ka7  that  is,  K  =  k/a.  Then  the  possible  values  of  k  for  each  Jn  are  kmn, 
the  zeros  of  Jn.  We  can  now  write  the  solutions  for  z  as 

,  ,  .  ,  ,  /  sin  n9  \  /  sin  kvt/a  \ 

(6-6)  ^  =  J^kr/a)  {  cos  n6  j  {  coskvt/a  /  ' 

For  a  given  initial  displacement  or  velocity  of  the  membrane,  we  could  find  z  as 
a  double  series  as  we  found  (5.17)  in  the  cylinder  temperature  problem.  However, 
here  we  shall  do  something  different,  namely  investigate  the  separate  normal  modes 
of  vibration  and  their  frequencies.  Recall  that  for  the  vibrating  string  (Section  4), 
each  n  gives  a  different  frequency  and  a  corresponding  normal  mode  of  vibration 
(Figure  4.2).  The  frequencies  of  the  string  are  v  =  nv/(2l );  all  frequencies  are 
integral  multiples  of  the  frequency  V\  =  v/{2 1)  of  the  fundamental.  For  the  circular 
membrane,  the  frequencies  are  [from  (6.6)] 

u>  kv 

2n  27 ra 

The  possible  values  of  k  are  the  zeros  kmn  of  the  Bessel  functions.  Each  value  of 
kmn  gives  a  frequency  umn  =  kmnv/(2ira ),  so  we  have  a  doubly  infinite  set  of  char¬ 
acteristic  frequencies  and  the  corresponding  normal  modes  of  vibration.  All  these 
frequencies  are  different,  and  they  are  not  integral  multiples  of  the  fundamental  as 
is  true  for  the  string.  This  is  why  a  drum  is  less  musical  than  a  violin.  From  your 
computer  or  tables  you  can  find  several  kmn  values  (Problem  2)  and  find  the  fre¬ 
quencies  as  (nonintegral)  multiples  of  the  fundamental  (which  corresponds  to  k\Q, 
the  first  zero  of  Jo).  Let  us  sketch  a  few  graphs  (Figure  6.1)  of  the  normal  vibration 
modes  corresponding  to  those  in  Figure  4.2  for  the  string,  and  write  the  correspond¬ 
ing  formulas  (eigenfunctions)  for  the  displacement  z  given  in  (6.6).  (For  simplicity, 
we  have  used  just  the  cos  n6  cos  kvt /a  solutions  in  Figure  6.1.)  In  the  fundamental 
mode  of  vibration  corresponding  to  fcio,  the  membrane  vibrates  as  a  whole.  In  the 
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Figure  6.1 

&20  mode,  it  vibrates  in  two  parts  as  shown,  the  +  part  vibrating  up  while  the 
—  part  vibrates  down,  and  vice  versa,  with  the  circle  between  them  at  rest.  We 
can  show  that  there  is  such  a  circle  (called  a  nodal  line)  and  find  its  radius.  Since 
&20  >  fcio  >  the  circle  r  =  a/cio/fc20  is  a  circle  of  radius  less  than  a.  Hence  it  is  a  circle 
on  the  membrane.  For  this  value  of  r,  Jo(k2or/a)  =  Jo(fc20^io/^2o))  =  A)(fcio)  =  0, 
so  points  on  this  circle  are  at  rest.  For  the  kn  mode,  cos  9  =  0  when  9  =  ±7t/2  and 
is  positive  or  negative  as  shown.  Continuing  in  this  way  you  can  sketch  any  normal 
mode  (Problem  1). 

It  is  difficult  experimentally  to  obtain  pure  normal  modes  of  a  vibrating  object. 
However,  a  complicated  vibration  will  have  nodal  lines  of  some  kind  and  it  is  easy 
to  observe  these.  Fine  sand  sprinkled  on  the  vibrating  object  will  collect  along  the 
nodal  lines  (where  there  is  no  vibration)  so  that  you  can  see  them  clearly — but 
see  Am.  J.  Phys.  72,  1345-1346,  (2004).  [For  experimental  work  on  the  vibrating 
circular  membrane,  see  Am.  J.  Phys.  35,  1029-1031,  (1967);  Am.  J.  Phys.  40, 
186-188,  (1972);  Am.  J.  Phys.  59,  376-377,  (1991).  Also  see  Problem  1(b).] 

PROBLEMS,  SECTION  6 

1.  (a)  Continue  Figure  6.1  to  show  the  fundamental  modes  of  vibration  of  a  circular 

membrane  for  n  =  0, 1,  2,  and  m  =  1,2,  3.  As  in  Figure  6.1,  write  the  formula 
for  the  displacement  z  under  each  sketch. 

(b)  Use  a  computer  to  set  up  animations  of  the  various  modes  of  vibration  of  a 
circular  membrane.  [This  has  been  discussed  in  a  number  of  places.  See,  for 
example,  Am.  J.  Phys.  67,  534-537,  (1999).] 

2.  Find,  from  computer  or  tables,  the  first  three  zeros  of  each  of  the  Bessel  func¬ 
tions  Jo,  Ji,  J2,  and  J3.  Find  the  first  six  frequencies  of  a  vibrating  circular  mem¬ 
brane  as  (non-integral)  multiples  of  the  fundamental  frequency. 

3.  Separate  the  wave  equation  in  two-dimensional  rectangular  coordinates  x,y.  Con¬ 
sider  a  rectangular  membrane  as  shown,  rigidly  attached  to  supports  along  its  sides. 
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Show  that  its  characteristic  frequencies  are 

Vnm  =  (v/2)y/  ( n/a )2  +  (m/6)2,  y _ 

where  n  and  m  are  positive  integers,  and  sketch  the  normal  b 

modes  of  vibration  corresponding  to  the  first  few  frequen- - - - - * 

cies.  That  is,  indicate  the  nodal  lines  as  we  did  for  the 
circular  membrane  in  Figure  6.1  and  Problem  1. 

Next  suppose  the  membrane  is  square.  Show  that  in  this  case  there  may  be 
two  or  more  normal  modes  of  vibration  corresponding  to  a  single  frequency.  ( Hint 
for  one  example:  72  +  l1  =  1“  +  72  =  52  +  52.)  This  is  an  example  of  what  is 
called  degeneracy ;  we  say  that  there  is  degeneracy  when  several  different  solutions 
of  the  wave  equation  (eigenfunctions)  correspond  to  the  same  frequency  (eigenvalue). 
Sketch  several  normal  modes  giving  rise  to  the  same  frequency.  Comment:  Compare 
Chapter  3,  Section  11,  where  an  eigenvalue  of  a  matrix  is  called  degenerate  if  several 
eigenvectors  correspond  to  it. 

4.  Find  the  characteristic  frequencies  for  sound  vibration  in  a  rectangular  box  (say 
a  room)  of  sides  a,  b,  c.  Hint:  Separate  the  wave  equation  in  three  dimensions  in 
rectangular  coordinates.  This  problem  is  like  Problem  3  but  for  three  dimensions 
instead  of  two.  Discuss  degeneracy  (see  Problem  3). 

5.  A  square  membrane  of  side  l  is  distorted  into  the  shape 

f(x,y)  =  xy(l  -  x)(l  -  y) 

and  released.  Express  its  shape  at  subsequent  times  as  an  infinite  series.  Hint:  Use 
a  double  Fourier  series  as  in  Problem  5.9. 

6.  Let  V  =  0  in  the  Schrodinger  equation  (3.22)  and  separate  variables  in  2-dimensional 
rectangular  coordinates.  Solve  the  problem  of  a  particle  in  a  2-dimensional  square 
box,  0  <  x  <  1,0  <  y  <  l.  This  means  to  find  solutions  of  the  Schrodinger  equation 
which  are  0  for  x  =  0,  x  =  l,  y  =  0,  y  =  l,  that  is,  on  the  boundary  of  the  box,  and 
to  find  the  corresponding  energy  eigenvalues.  Comments:  If  we  extend  the  idea  of 
a  “particle  in  a  box”  (see  Section  3,  Example  3)  to  two  or  three  dimensions,  the 
box  in  2D  might  be  a  square  (as  in  this  problem)  or  a  circle  (Problem  8);  in  3D 
it  might  be  a  cube  (Problem  7.17)  or  a  sphere  (Problem  7.19).  In  all  cases,  the 
mathematical  problem  is  to  find  solutions  of  the  Schrodinger  equation  with  V  =  0 
inside  the  box  and  4/  =  0  on  the  boundary  of  the  box,  and  to  find  the  corresponding 
energy  eigenvalues.  In  quantum  mechanics,  4*  describes  a  particle  trapped  inside  the 
box  and  the  energy  eigenvalues  are  the  possible  values  of  the  energy  of  the  particle. 

7.  In  your  Problem  6  solutions,  find  some  examples  of  degeneracy.  (See  Problem  3. 
Degeneracy  means  that  several  eigenfunctions  correspond  to  the  same  energy  eigen¬ 
value.) 

8.  Do  Problem  6  in  polar  coordinates  to  find  the  eigenfunctions  and  energy  eigenvalues 
of  a  particle  in  a  circular  box  r  <  a.  You  want  4/  =  0  when  r  =  a. 

7.  STEADY-STATE  TEMPERATURE  IN  A  SPHERE 

Find  the  steady-state  temperature  inside  a  sphere  of  radius  a  when  the  surface  of 
the  upper  half  is  held  at  100°  and  the  surface  of  the  lower  half  at  0°. 

Inside  the  sphere,  the  temperature  u  satisfies  Laplace’s  equation.  In  spherical 
coordinates  this  is  (see  Chapter  10,  Section  9) 

1  d  f  2du\  1  9  (  .  9u\  1  d2u 

r2  dr  \  dr  )  ^  r2  sin  9  dd  \  d9  )  r2  sin2  8  dcj>2 


(7.1)  V2u 
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We  separate  this  equation  following  our  standard  procedure.  Substitute 


(7.2) 


u  =  R(r)  0(0)  <&((/)) 


into  (7.1)  and  multiply  by  r2 /ROQ  to  get 


(7.3) 


j__d 

R  dr 


2  dR\ 
r  ~cRj 


1  1 


0  sin  9  dO 


sin  9 


dQ 

~d9 


1  d2$ 


4> 


sin 


G?</>2 


=  0. 


If  we  multiply  (7.3)  by  sin2  91  the  last  term  becomes  a  function  of  cf>  only  and  the 
other  terms  do  not  contain  </>.  Thus  we  obtain  the  (f)  equation  and  its  solutions: 


(7.4) 


1  d24» 


sin  rruj), 
cos  m<f>. 


The  separation  constant  must  be  negative  and  m  an  integer  to  make  4>  a  periodic 
function  of  </>  [see  the  discussion  after  (5.6)]. 

Equation  (7.3)  can  now  be  written  as 


(7.5) 


II  (r^)  +  1  J_A 

Rdr  \  dr  J  0  sin  9  d9  \  d9  )  sin2  9 


The  first  term  is  a  function  of  r  and  the  last  two  terms  are  functions  of  0,  so  we 
have  two  equations 


(7.6) 

(7.7) 


1  d 
sin  9  d9 


1  ±  (r2^\  =k 

Rdr  \  dr  ) 


Q  +  kQ  =  0. 


If  you  compare  (7.7)  with  the  equation  of  Problem  10.2  in  Chapter  12,  you  will 
see  that  (7.7)  is  the  equation  for  the  associated  Legendre  functions  if  k  =  l  (l  +  1). 
Recall  that  l  must  be  an  integer  in  order  for  the  solution  of  Legendre’s  equation  to 
be  finite  at  x  =  cos0  =  ±1,  that  is,  at  9  =  0  or  7r;  the  same  statement  is  true  for 
the  equation  for  the  associated  Legendre  functions.  The  corresponding  result  for 
(7.7)  is  that  k  must  be  a  product  of  two  successive  integers;  it  is  then  convenient 
to  replace  k  by  1(1  +  1),  where  l  is  an  integer.  The  solutions  of  (7.7)  are  then  the 
associated  Legendre  functions  (see  Problem  10.2,  Chapter  12) 


(7.8) 


0  =  if*  (cos  0). 


In  (7.6),  we  put  k  =  1(1  +  1);  you  can  then  easily  verify  (Problem  5.11)  that  the 
solutions  of  (7.6)  are 


(7.9) 


Since  we  are  interested  in  the  interior  of  the  sphere,  we  discard  the  solutions  r 
because  they  become  infinite  at  the  origin.  If  we  were  discussing  a  problem  (say 
about  water  flow  or  electrostatic  potential)  outside  the  sphere,  we  would  use  the 
r~i_1  solutions  and  discard  the  solutions  rl  because  they  become  infinite  at  infinity. 
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The  basis  functions  for  our  problem  are  then 


(7.10) 

«  =  r‘Pr(cos9)|Si,lm^ 

1  cos  rruj}. 

[The  functions  PJn{ cos  (9)  sin  rrup  and  P[n(cos9)  cos  m<t>  are  called  spherical  harmon- 
zcsand  are  often  denoted  by  Ylm(9,(j))-,  also  see  Problem  16.]  If  the  surface  tem¬ 
perature  at  r  =  a  were  given  as  a  function  of  9  and  (f>,  we  would  have  a  double 
series  (summed  on  l  and  to).  For  the  given  surface  temperatures  in  our  problem 
(100°  on  the  top  hemisphere  and  0°  on  the  lower  hemisphere),  the  temperature  is 
independent  of  (j>)  thus  in  (7.10)  we  must  have  m  =  0,cos  rruf)  =  1.  The  solutions 

(7.10)  then  reduce  to  rlPi(cos9).  We  write  the  solution  of  the  problem  as  a  series 
of  these  basis  functions: 

OO 

(7.11)  u  =  y>yp;(c0Sg). 

1=0 

We  determine  the  coefficients  c;  by  using  the  given  temperatures  when  r  =  a;  that 
is,  we  must  have 

OO 

(7.12)  ur=a  =  J2  ci  a1  Pi  (cos  9) 

1=0 

_  f  100,  0  <9  <  |,  that  is,  0  <  cos  9  <  1, 

\  0,  \  <9  <  7T,  that  is,  —1  <  cos  9  <  0, 

or,  with  x  =  cos  9, 

OO 

(7.13)  ur=a  =  J2  cialPi(x)  =  100 f(x) 

;=o 


where 


/(z) 


0,  — 1  <  x  <  0, 

1,  0  <  x  <  1. 


(Note  that  here  x  just  stands  for  cos  9  and  is  not  the  coordinate  x.)  In  Section  9  of 
Chapter  12,  we  expanded  this  f(x)  in  a  series  of  Legendre  polynomials  and  obtained: 


(7.14) 


/O)  =  \po(x)  +  ^Pi(x)  -  yjrp3(a0  +  ^P^x)  +  •  •  •  • 


The  coefficients  c;  in  (7.13)  are  just  these  coefficients  times  100/a*.  Substituting 
the  c’s  into  (7.11),  we  get  the  final  solution: 


(7.15)  u  =  100  iP0(cos6»)  +  |^Pi(cos6>)  -  ^  (£)3  P3(cos6») 

+  11  (L' 
'  32  Va, 


(cos  9) 
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We  can  do  variations  of  this  problem.  Notice  that  we  have  not  even  mentioned 
so  far  what  temperature  scale  we  are  using  (Celsius,  Fahrenheit,  absolute,  etc.). 
This  is  a  very  easy  adjustment  to  make  once  we  have  a  solution  in  any  one  scale. 
To  see  why,  observe  that  if  u  is  a  solution  of  Laplace’s  equation  V2u  =  0  or  of  the 
heat  flow  equation  V2u  =  (1  /a2)(du/dt),  then  u  +  C  and  Cu  are  also  solutions  for 
any  constant  C.  If  we  add,  say,  50°  to  the  solution  (7.15),  we  have  the  temperature 
distribution  inside  a  sphere  with  the  top  half  of  the  surface  at  150°  and  the  lower  half 
at  50°.  If  we  multiply  the  solution  (7.15)  by  2,  we  find  the  temperature  distribution 
with  given  surface  temperatures  of  200°  and  0°,  and  so  on. 

The  temperature  of  the  equatorial  plane  9  =  7r/2  or  cos  9  =  0  as  given  by  equa¬ 
tions  (7.11)  to  (7.15)  is  halfway  between  the  top  and  bottom  surface  temperatures, 
because  Legendre  series,  like  Fourier  series,  converge  to  the  midpoint  of  a  jump  in 
the  function  which  was  expanded  to  get  the  series.  To  solve  the  problem  of  the 
temperature  in  a  hemisphere  given  the  temperatures  of  the  curved  surface  and  of 
the  equatorial  plane,  we  need  only  imagine  the  lower  hemisphere  in  place  and  at 
the  proper  temperature  to  give  the  desired  average  on  the  equatorial  plane.  When 
the  temperature  of  the  equatorial  plane  is  0°,  this  amounts  to  defining  the  function 
f(x)  in  (7.13)  on  (—1,0)  to  make  it  an  odd  function. 


PROBLEMS,  SECTION  7 


Find  the  steady-state  temperature  distribution  inside  a  sphere  of  radius  1  when  the 
surface  temperatures  are  as  given  in  Problems  1  to  10. 


1. 

3. 

5. 


35  cos4  9 
cos  8  —  3  sin2  6 
|cos  6\ 


2.  cos  6  —  cos3  6 
4.  5  cos3  9  —  3  sin2  9 

6.  7t/2 —9.  See  Chapter  12,  Problem  9.4. 


7. 


8. 


f  cos  8,  0  <  9  <  n/2,  that 

(  0,  7t/2  <  9  <  7r,  that 

f  100°,  0  <  9  <  7t/3, 

i  0°,  otherwise.  Hlnt: 


is,  upper  hemisphere, 
is,  lower  hemisphere. 


See  Problem  9.8  of  Chapter  12. 


9.  3  sin  9  cos  8  sin  (j>.  Hint:  See  equation  (7.10)  and  Chapter  12,  equation  (10.6). 

10.  sin2  8  cos  8  cos  2 (ft  —  cos  8.  (See  Problem  9.) 

11.  Find  the  steady-state  temperature  distribution  inside  a  hemisphere  if  the  spherical 
surface  is  held  at  100°  and  the  equatorial  plane  at  0° .  Hint:  See  the  last  paragraph 
of  this  section  above. 

12.  Do  Problem  11  if  the  curved  surface  is  held  at  cos 2  8  and  the  equatorial  plane  at 
zero.  Careful:  The  answer  does  not  involve  P2;  read  the  last  sentence  of  this  section. 

13.  Find  the  electrostatic  potential  outside  a  conducting  sphere  of  radius  a  placed  in 
an  originally  uniform  electric  field,  and  maintained  at  zero  potential.  Hint:  Let 
the  original  field  E  be  in  the  negative  0  direction  so  that  E  =  — Eok.  Then  since 
E  =  —  V4>,  where  is  the  potential,  we  have  $  =  Eqz  =  Eorcos9  (Verify  this!) 
for  the  original  potential.  You  then  want  a  solution  of  Laplace’s  equation  V2u  =  0 
which  is  zero  at  r  =  a  and  becomes  u  ~  for  large  r  (that  is,  far  away  from  the 
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sphere).  Select  the  solutions  of  Laplace’s  equation  in  spherical  coordinates  which 
have  the  right  9  and  <j>  dependence  (there  are  just  two  such  solutions)  and  find  the 
combination  which  reduces  to  zero  for  r  =  a. 


14.  Find  the  steady-state  temperature  distribution  in  a  spherical  shell  of  inner  radius 
1  and  outer  radius  2  if  the  inner  surface  is  held  at  0°  and  the  outer  surface  has  its 
upper  half  at  100°  and  its  lower  half  at  0°.  Hint:  r  =  0  is  not  in  the  region  of 
interest,  so  the  solutions  r_!_1  in  (7.9)  should  be  included.  Replace  cir1  in  (7.11) 
by  ( ctr 1  +  6ir_i_1). 

15.  A  sphere  initially  at  0°  has  its  surface  kept  at  100°  from  t  =  0  on  (for  example,  a 
frozen  potato  in  boiling  water!).  Find  the  time-dependent  temperature  distribution. 
Hint:  Subtract  100°  from  all  temperatures  and  solve  the  problem;  then  add  the  100° 
to  the  answer.  Can  you  justify  this  procedure?  Show  that  the  Legendre  function 
required  for  this  problem  is  Pq  and  the  r  solution  is  (l/^/r)Ji/2  or  jo  [see  (17.4)  in 
Chapter  12] .  Since  spherical  Bessel  functions  can  be  expressed  in  terms  of  elementary 
functions,  the  series  in  this  problem  can  be  thought  of  as  either  a  Bessel  series  or  a 
Fourier  series.  Show  that  the  results  are  identical. 


16.  Separate  the  wave  equation  in  spherical  coordinates,  and  show  that  the  6,  <j>  solutions 
are  the  spherical  harmonics  Y™(9,  cp)  =Pjm(cos  0)e±*m<#>  and  the  r  solutions  are  the 
spherical  Bessel  functions  ji{kr)  and  yi(kr)  [Chapter  12,  equations  (17.4)]. 

17.  Do  Problem  6.6  in  3  dimensional  rectangular  coordinates.  That  is,  solve  the  “particle 
in  a  box”  problem  for  a  cube. 


18. 


Separate  the  time-independent  Schrodinger  equation  (3.22)  in  spherical  coordinates 
assuming  that  V  =  V(r)  is  independent  of  6  and  <p.  (If  V  depends  only  on  r,  then 
we  are  dealing  with  central  forces,  for  example,  electrostatic  or  gravitational  forces.) 
Hints:  You  may  find  it  helpful  to  replace  the  mass  m  in  the  Schrodinger  equation 
by  M  when  you  are  working  in  spherical  coordinates  to  avoid  confusion  with  the 
letter  m  in  the  spherical  harmonics  (7.10).  Follow  the  separation  of  (7.1)  but  with 
the  extra  term  [V(r)  —  75]  47.  Show  that  the  8,  <p  solutions  are  spherical  harmonics 
as  in  (7.10)  and  Problem  16.  Show  that  the  r  equation  with  k  =  1(1  +  1)  is  [compare 
(7.6)] 


Id/  2dR\ 
R  dr  \  dr  ) 


2  Mr2 


[V(r)  -  E ) 


1(1  +  1). 


19.  Find  the  eigenfunctions  and  energy  eigenvalues  for  a  “particle  in  a  spherical  box” 
r  <  a.  Hints:  See  Problem  6.6.  Write  the  R  equation  from  Problem  18  with  V  =  0, 
and  compare  Chapter  12,  Problem  17.6,  with  y  =  R,  x  =  f3r  where  j3  =  \J2 ME/ti2, 
and  n  =  l. 


20.  Write  the  Schrodinger  equation  (3.22)  if  -0  is  a  function  of  x,  and  V  =  \muj2x2 
(this  is  a  one-dimensional  harmonic  oscillator).  Find  the  solutions  ipn(x)  and  the 
energy  eigenvalues  En.  Hints:  In  Chapter  12,  equation  (22.1)  and  the  first  equation 
in  (22.11),  replace  x  by  ax  where  a  =  \J muj/h.  (Don’t  forget  appropriate  factors 
of  a  for  the  a;’s  in  the  denominators  of  D  =  d/dx  and  ip"  =  d2ip/dx2.)  Compare 
your  results  for  equation  (22.1)  with  the  Schrodinger  equation  you  wrote  above  to 
see  that  they  are  identical  if  En  =  (n  +  |)7iu.  Write  the  solutions  ipn(x)  of  the 
Schrodinger  equation  using  Chapter  12,  equations  (22.11)  and  (22.12). 

21.  Separate  the  Schrodinger  equation  (3.22)  in  rectangular  coordinates  in  3  dimensions 
assuming  that  V  =  \ mu!2(x2+y2+z 2).  (This  is  a  3-dimensional  harmonic  oscillator). 
Observe  that  each  of  the  separated  equations  is  of  the  form  of  the  one- dimensional 
oscillator  equation  in  Problem  20.  Thus  write  the  solutions  ipn(x,y,z)  for  the  3- 
dimensional  problem,  where  n  =  nx  +  ny  +  nz.  Find  the  energy  eigenvalues  En  and 
their  degree  of  degeneracy  (see  Problem  6.7  and  Chapter  15,  Problem  4.21). 
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22.  Find  the  energy  eigenvalues  and  eigenfunctions  for  the  hydrogen  atom.  The  potential 
energy  is  V ( r )  =  — e2/r  in  Gaussian  units,  where  e  is  the  charge  of  the  electron  and  r 
is  in  spherical  coordinates.  Since  V  is  a  function  of  r  only,  you  know  from  Problem  18 
that  the  eigenfunctions  are  R(r)  times  the  spherical  harmonics  V;m(#,  </>),  so  you  only 
have  to  find  R(r).  Substitute  V ( r )  into  the  R  equation  in  Problem  18  and  make  the 
following  simplifications:  Let  x  =  2 r/a,  y  =  rR ;  show  that  then 


r  =  ax/2,  R{r) 


-y{x), 


d 

dr 


2d  d  f  2  dR\  2  u 
a  chi’  d/\~cb:)  =  aXV 


Let  a2  =  —2 ME/Ti2  (note  that  for  a  bound  state,  E  is  negative,  so  a 2  is  positive) 
and  A  =  Me2a/h2,  to  get  the  first  equation  in  Problem  22.26  of  Chapter  12.  Do 
this  problem  to  find  y(x),  and  the  result  that  A  is  an  integer,  say  n.  [Caution:  not 
the  same  n  as  in  equation  (22.26)].  Hence  find  the  possible  values  of  a  (these  are 
the  radii  of  the  Bohr  orbits),  and  the  energy  eigenvalues.  You  should  have  found  a 
proportional  to  n;  let  a  =  na,  where  a  is  the  value  of  a  when  n  —  1,  that  is,  the 
radius  of  the  first  Bohr  orbit.  Write  the  solutions  R(r)  by  substituting  back  y  =  rR, 
and  x  =  2 r/(na),  and  find  En  from  a. 


8.  POISSON'S  EQUATION 

We  are  going  to  derive  Poisson’s  equation  (1.2)  for  a  simple  problem  whose  answer 
we  know  in  advance.  Using  our  known  solution,  we  shall  be  able  to  see  a  method 
of  solving  more  difficult  problems. 

Recall  from  Chapter  6,  Section  8,  that  the  gravitational  field  is  conservative, 
that  is,  curlF  =  0,  and  there  is  a  potential  function  V  such  that  F  =  —  VU.  If 
we  consider  the  gravitational  field  at  a  point  P  due  to  a  point  mass  m  a  distance  r 
away,  we  have 


(8.1) 


Gm 

and  F  = - — u 


where  u  is  a  unit  vector  along  r  toward  P.  It  is  straightforward  to  show  that 
divF  =  0  and  V  satisfies  Laplace’s  equation  (Problem  1),  that  is, 

(8.2)  V  F  =  -V  •  VU  =  -V2U  =  0. 


Now  suppose  there  are  many  masses  rrii  at  distances  r,  from  P.  The  total 
potential  at  P  is  the  sum  of  the  potentials  due  to  the  individual  rrii,  that  is, 

\  -v  Grrii 

“  n 

i 

and  the  total  gravitational  field  at  P  is  the  vector  sum  of  the  fields  Fj,  that  is, 

F  =  -^VV)  =  -  VU. 

i 

Note  that  we  are  taking  it  for  granted  that  none  of  the  masses  to*  are  at  P,  that  is, 
that  no  ri  is  zero.  Since 


i 


V  •  Fj 


-V2U  =  0, 


Section  8 


Poisson's  Equation  653 


we  have  also 


V  •  F  =  -V2V  =  0. 


Instead  of  a  number  of  masses  rrii,  we  can 
consider  a  continuous  distribution  of  mass 
inside  a  volume  r  (Figure  8.1).  Let  p  be  the 
mass  density  of  the  distribution;  then  the 
mass  in  an  element  dr  is  pdr.  The  gravita¬ 
tional  potential  at  P  due  to  this  mass  pdr  is 
—  ( Gp/r )  dr.  Then  the  total  gravitational  po¬ 
tential  at  P  due  to  the  whole  mass  distribution 
is  the  triple  integral  over  the  volume  r: 


volume  r 


As  before,  the  contribution  to  V  at  P  due  to  each  bit  of  mass  satisfies  Laplace’s 
equation  and  therefore  V  satisfies  Laplace’s  equation.  Also  the  total  field  F  at  P  is 
the  vector  sum  of  the  fields  due  to  the  elements  of  mass,  and  as  before  we  have 

V  F  =  -V2V  =  0. 


Again  note  that  we  are  implicitly  assuming  that  none  of  the  mass  distribution 
coincides  with  P,  that  is,  that  r  /  0,  which  means  that  point  P  is  not  a  point  of 
the  region  r. 

Now  let  us  investigate  what  happens  if  P  is  a  point  of  r.  Can  we  find  V  from  (8.3) 
and  does  V  satisfy  Laplace’s  equation?  Let  S'  be  a  small  sphere  of  radius  a  about 
P;  imagine  all  the  mass  removed  from  inside  S  (Figure  8.2).  Then  our  previous  dis¬ 


cussion  holds  at  points  inside  S  since 
these  points  are  not  in  the  mass  distri¬ 
bution.  If  F'  and  V'  are  the  new  field 
and  potential  (with  the  matter  inside 
S  removed),  then  V  •  F'  =  —  V2!?'  =  0 
at  points  inside  S.  Now  restore  the  mass 
inside  S;  let  F  and  V  represent  the  field 
and  potential  due  to  the  whole  distri¬ 
bution  and  let  Fg  and  Vs  represent  the 
field  and  potential  due  to  just  the  mass 
inside  S.  Then  F  =  F'  +  Fg  and  at 
points  inside  S 

(8.4)  V-F  =  V-F' 


V  •  Fg  =  V  •  Fg 


since  V  •  F'  =  0  inside  S. 

By  the  divergence  theorem  (see  Figure  8.2  and  Chapter  6,  Section  10) 


(8.5) 


V  •  Fg  dr  = 


Fg  •  n  da. 


volume  of  S 


surface  of  S 


If  we  let  the  radius  a  of  S  tend  to  zero,  the  density  p  of  matter  inside  S  tends  to 
its  value  at  P;  thus  for  small  a,  S  contains  a  total  mass  M  approximately  equal  to 
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|7r a3p,  where  p  is  evaluated  at  P.  The  gravitational  field  at  the  surface  of  S  due 
to  this  mass  is  of  magnitude 


p  GM  4 

Fs  =  — =  G-nap 
a  o 

directed  toward  P.  Thus  in  (8.5),  Fg  •  n  =  —  | Gnap  because  Fs  and  n  are  antipar¬ 
allel.  Since  Fs  is  constant  over  the  surface  S,  the  right-hand  side  of  (8.5)  is  Fs  •  n 
times  the  area  of  the  sphere.  The  left-hand  side  is,  for  small  a,  approximately  the 
value  of  V  •  Fs  at  P  times  the  volume  of  S.  Then  we  have 

(V  •  Fs)(^7ra3)  =  (-^G7rap)(47ra2) 


or 

(8.6) 

V  •  Fg  =  — 47t Gp  at  P. 

Since 

V  •  Fs  =  V  •  F  =  -  V  •  VP  =  -V2V, 

we  have 

(8.7) 

V2V  =  47 iGp. 

This  is  Poisson’s  equation;  we  see  that  the  gravitational  potential  in  a  region  con¬ 
taining  matter  satisfies  Poisson’s  equation  as  claimed  in  (1.2).  Note  that  if  p  =  0, 
(8.7)  becomes  (8.2)  as  it  should. 

Next  we  must  consider  whether  our  formula  (8.3)  for  V  is  valid  when  P  is  a 
point  of  the  mass  distribution.  The  integral  appears  to  diverge  at  r  =  0,  but  this 
is  not  really  so  as  we  see  most  easily  by  using  spherical  coordinates.  Then  (8.3) 
becomes 

V  =  —  J j J  r2  sin  8  dr  dO  dcj) 

volume  r 

and  we  see  that  there  is  no  trouble  when  r  =  0.  Thus  (8.3)  is  valid  in  general  and 
gives  a  solution  for  (8.7). 

Using  the  notation  of  (1.2)  for  Poisson’s  equation  [that  is,  replacing  AirGp  by  / 
and  V  by  u  in  (8.7)  and  (8.3)]  we  can  write 


(8.8) 


f  dr 


is  a  solution  of  V2u  =  /. 


In  the  more  detailed  notation  needed  when  we  use  this  solution  in  a  problem,  (8.8) 
becomes  (see  Figure  8.3): 
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In  (8.9)  and  Figure  8.3,  the  point  ( x,y,z )  is  the 
point  at  which  we  are  calculating  the  potential  u\ 
the  point  (x',y',z')  is  a  point  in  the  mass  distri¬ 
bution  over  which  we  integrate;  r  in  (8.8)  is  the 
distance  between  these  two  points  and  is  written 
out  in  full  in  (8.9). 

Equations  (8.8)  or  (8.9)  actually  give  a  very 
special  solution  of  Poisson’s  equation.  Recall  that 
it  is  customary  to  take  the  zero  point  for  gravita¬ 
tional  (and  electrostatic)  potential  energy  at  infin¬ 
ity,  and  this  is  what  we  have  done.  Thus  (8.8)  or  (8.9)  gives  a  solution  of  Poisson’s 
equation  which  tends  to  zero  at  infinity.  In  another  problem  this  may  not  be  what 
we  want.  For  example,  suppose  we  have  an  electrostatic  charge  distribution  near 
a  grounded  plane.  The  electrostatic  potential  satisfies  Poisson’s  equation,  but  here 
we  want  a  solution  which  is  zero  on  the  grounded  plane  rather  than  at  infinity.  To 
see  how  we  might  find  such  a  solution,  observe  that  if  u  is  a  solution  of  Poisson’s 
equation,  and  w  is  any  solution  of  Laplace’s  equation  (V2ui  =  0),  then 

(8.10)  V2(u  +  w)  =  V2u  +  V2w  =  V2u  =  /; 

thus  u  +  w  is  a  solution  of  Poisson’s  equation.  Then  we  can  add  to  the  solution 
(8.9)  any  solution  of  Laplace’s  equation;  the  combination  must  be  adjusted  to  fit 
the  given  boundary  conditions  just  as  we  have  done  in  the  problems  in  previous 
paragraphs. 

Example  1.  Let  us  do  the  following  simple  problem  to  illustrate  this  process.  In  Figure 
8.4,  a  point  charge  q  at  (0,  0,  a)  is  outside  a  grounded  sphere  of  radius  R  and  center 


(x'.y'.z) 


Figure  8.3 


Figure  8.4 

at  the  origin.  Our  problem  is  to  find  the  electrostatic  potential  V  at  points  outside 
the  sphere.  The  potential  V  and  the  charge  density  p  are  related  by  Poisson’s 
equation 

(8.11)  V2P  =  —47 ip  (in  Gaussian  units). 

The  potential  at  (x,y,z)  due  to  a  given  charge  distribution  p  is  given  by  (8.8)  or 
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(8.9)  with  /  =  —4np: 


(8.12)  V(x,  y,z)  =  —  — 


-4tt  p(x',y',z') 


\/{x  —  x')2  +  (y  —  y')2  +  (z  —  z')2 


dx'  dy '  dz' . 


For  a  given  space-charge  distribution,  we  would  next  evaluate  this  integral.  For  the 
single  point  charge  q ,  we  have  (x',y',z')  =  (0,0,  a)  and  we  replace  fff  pdx'  dy'  dz' 
(which  is  simply  the  total  charge)  by  q  to  obtain 


(8.13) 


V  = 


i/a :2  +y2  +  (z  —  a)2 


[We  could,  of  course,  simply  have  written  down  (8.13)  without  using  (8.8);  (8.13)  is 
just  the  electrostatic  formula  corresponding  to  the  gravitational  formula  (8.1)  with 
which  we  started.] 

Now  we  want  to  add  to  (8.13)  a  solution  of  Laplace’s  equation  such  that  the 
combination  is  zero  on  the  given  sphere  (Figure  8.4).  It  will  be  convenient  to  change 
to  spherical  coordinates  and  to  use  solutions  of  Laplace’s  equation  in  spherical 
coordinates.  [Note  a  change  in  the  meaning  of  r  from  now  on.  We  have  been  using 
r  to  mean  the  distance  from  q  at  (x',y',z')  to  ( x,y,z );  from  now  on  we  want  to 
use  it  to  mean  the  distance  from  (0,0,0)  to  (a :,y,z).  See,  for  example,  Figures  8.3 
and  8.4.]  Writing  Vq  for  V  in  (8.13)  (to  distinguish  it  from  our  final  answer  which 
will  be  a  sum  of  Vq  and  a  solution  of  Laplace’s  equation)  and  changing  to  spherical 
coordinates,  we  get 


(8.14) 


Vq  =  = 

V  r2  —  2  ar  cos  6  +  a2 


The  solutions  of  Laplace’s  equation  in  spherical  coordinates  are  (Section  7): 

j 


(8.15) 


_7_i  \P["'(cos0)  f  Smm7 
r  I  1  cos  mcp 


Since  we  are  interested  in  the  region  outside  the  sphere,  we  want  r  solutions  which 
do  not  become  infinite  at  infinity;  thus  we  use  r-^1  and  discard  the  rl  solutions. 
Because  the  physical  problem  is  symmetric  about  the  z  axis,  we  look  for  solutions 
independent  of  <(>;  that  is,  we  choose  m  =  0,cos  mcj)  =  1.  Then  the  basis  functions 
for  our  problem  are  r~l~1Pi(cos0)  and  we  try  to  find  a  solution  of  the  form 


(8.16) 


V  =  Vq  +  cif  1  1P;(cos6l). 


We  must  satisfy  the  boundary  condition  V  =  0  when  r  =  R.  This  gives 

q 


(8.17) 


Vr=R  = 


\/R2  —  2aR  cos  6  +  a2 


°iR  1  1Pi{cos  9)  =  0. 


Thus  we  want  to  expand  Vq  in  a  Legendre  series.  Since  Vq  is  essentially  the  gener¬ 
ating  function  for  Legendre  polynomials,  this  is  very  easy.  Comparing  (8.17)  and 
the  formulas  of  Chapter  12,  Section  5  [(5.1)  and  (5.2),  or  more  simply,  (5.12)  and 
(5.17)],  we  find 


(8.18) 


\J  R2  —  2ai?.cos  9  +  a2 


RlPi  (cos  9) 


7^+1 


Section  8 


Poisson's  Equation  657 


Thus  the  coefficients  c;  in  (8.17)  are  given  by 

qRl 


(8.19) 


ciR 


-i- 1 


■)!+ 1 


or  ci  =  -- 


qi?2Z+1 


il+t 


Substituting  (8.19)  into  (8.16),  we  obtain  the  final  solution  for  V: 


(8.20) 


V  = 


yjr2  —  2  ar  cos  9  +  a2 


f?2!+1r  *  1P/(cos0) 


7^+1 


Since  the  second  term  in  (8.20)  is  of  the  same  general  form  as  (8.18),  we  can 
simplify  (8.20)  by  summing  the  series  to  get  (Problem  2) 


(8.21) 


V  = 


_ Q _ 

\Jr2  —  2 ar  cos  9  +  a2 


_ (R/a)q _ 

yjr2  +  ( R2/a )2  —  2 r{R2/a)  cos  6 


Formula  (8.21)  has  a  very  interesting  physical  interpretation.  The  second  term  is 
the  potential  of  a  charge  —(R/a)q  at  the  point  (0,  0,  i?2/a);  thus  we  could  replace  the 
grounded  sphere  by  this  charge  and  have  the  same  potential  for  r  >  R.  This  result 
can  be  shown  also  by  elementary  analytic  geometry  and  is  known  as  the  “method 
of  images.”  For  problems  with  simple  geometry  (involving  planes,  spheres,  circular 
cylinders),  it  may  offer  a  simpler  method  of  solution  than  the  one  we  have  discussed; 
however,  our  purpose  was  to  illustrate  the  more  general  method. 


Use  of  Green  Functions  In  Chapter  8,  Section  12,  we  used  Green  functions 
to  solve  ordinary  differential  equations  with  a  nonzero  right-hand  side.  Here  we 
consider  the  use  of  Green  functions  to  solve  a  corresponding  partial  differential 
equation  in  three  dimensions,  namely  Poisson’s  equation 

(8.22)  V2u  =  /(r)  =  f(x,y,z). 

Suppose  that  we  have  a  solution  of  Poisson’s  equation  when  the  right  hand  side  is 
a  3-dimensional  5  function  (see  Chapter  8,  Sections  11  and  12): 

(8.23)  V2G(r,  r')  =  5(r  —  r')  =  S(x  —  x')5(y  —  y')6(z  —  z'). 

The  three-dimensional  S  function  has  the  property  that 

(8.24)  J  jj  f{x' ,  y ',  z')6(r  -  r')  dr'  =  f(x,  y,  z) 

if  the  volume  of  integration  includes  the  point  {x,  y,  z)  (and  the  integral  is  zero 
otherwise).  Recall  that  the  right-hand  side  of  Poisson’s  equation  is  proportional  to 
the  mass  density  or  the  charge  density.  The  volume  integral  of  the  density  gives  the 
total  mass  or  total  charge.  Since  fff  S(r  —  r')  dr'  =  1,  the  right-hand  side  of  (8.23) 
corresponds  to  a  point  mass  or  point  charge.  That  is,  the  Green  function  in  (8.23) 
is  the  potential  due  to  a  point  source.  Just  as  we  showed  in  Chapter  8,  Section  12, 
that  (12.4)  is  a  solution  of  (12.1),  we  find  here  that  a  solution  of  (8.22)  is  given  by 
(see  Problem  6) 


u(r) 


(8.25) 


G{v,  r')/(r')  dr' . 
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In  equation  (8.9)  we  found  that  a  solution  of  (8.22)  is 


(8.26) 


i(r)  =  - 


1 

47T 


/  CO 

Ir  —  r'l 


dr' . 


Comparing  (8.25)  and  (8.26),  we  conclude  that  a  solution  of  (8.23)  is 

1 


(8.27) 


G(r,r')  =  - 


47r|r  —  r'j 


Now  (8.26)  and  (8.27)  give  solutions  which  are  zero  at  infinity;  usually  we  want 
solutions  which  are  zero  on  some  given  surface  (for  example,  zero  electrostatic  po¬ 
tential  on  a  grounded  sphere  or  plane).  In  order  to  obtain  such  a  solution,  we  add 
to  (8.27)  a  solution  F( r,r')  of  Laplace’s  equation  chosen  so  that  the  new  Green 
function 

(8.28)  G(ry)  =  ^_i__  +  .F(1,r') 


satisfies  the  desired  zero  boundary  conditions.  Then  (8.25)  with  G(r,  r')  as  in  (8.28) 
gives  a  solution  of  (8.22)  which  is  zero  of  the  boundary.  For  example,  in  equation 
(8.21),  V  is  the  potential  outside  the  grounded  sphere  r  =  R  due  to  a  point  charge 
at  r  =  a  >  R.  Rewriting  that  result  in  our  present  notation  gives  the  Green  function 
(8.28)  which  satisfies  (8.23)  and  is  zero  on  the  sphere  r  =  R,  namely  (Problem  7) 


(8.29) 


G(r,r') 


1 

47r|r  —  r'j 


R/r' 

Att\v  —  R2r' /r'2\ 


(Also  see  Problems  8  and  9.) 


PROBLEMS,  SECTION  8 

1.  Show  that  the  gravitational  potential  V  =  —Gm/r  satisfies  Laplace’s  equation,  that 
is,  show  that  V2(l/r)  =  0  where  r2  =  x2  +  y2  +  z2,  r  ^  0. 

2.  Using  the  formulas  of  Chapter  12,  Section  5,  sum  the  series  in  (8.20)  to  get  (8.21). 

3.  Do  the  problem  in  Example  1  for  the  case  of  a  charge  q  inside  a  grounded  sphere 
to  obtain  the  potential  V  inside  the  sphere.  Sum  the  series  solution  and  state  the 
image  method  of  solving  this  problem. 

4.  Do  the  two-dimensional  analogue  of  the  problem  in  Example  1.  A  “point  charge” 
in  a  plane  means  physically  a  uniform  charge  along  an  infinite  line  perpendicular  to 
the  plane;  a  “circle”  means  an  infinitely  long  circular  cylinder  perpendicular  to  the 
plane.  However,  since  all  cross  sections  of  the  parallel  line  and  cylinder  are  the  same, 
the  problem  is  a  two-dimensional  one.  Hint:  The  potential  must  satisfy  Laplace’s 
equation  in  charge-free  regions.  What  are  the  solutions  of  the  two-dimensional 
Laplace  equation? 

5.  Find  the  method  of  images  for  problem  4. 

6.  Substitute  (8.25)  into  (8.22)  and  use  (8.23)  and  (8.24)  to  show  that  (8.25)  is  a 
solution  of  (8.22). 

7.  Verify  that  the  Green  function  in  (8.29)  is  zero  when  r  =  R.  Also  verify  that  the 
point  at  which  the  second  term  becomes  infinite  is  inside  the  sphere,  so  outside  the 
sphere  this  term  satisfies  Laplace’s  equation  as  required.  Thus  write  a  triple  integral 
for  the  solution  of  (8.22)  for  r  >  R  which  is  zero  on  the  sphere  r  —  R. 
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8.  Show  that  the  Green  function  (8.28)  which  is  zero  on  the  piane  2  =  0  is 

G(r,  r')  =  -  -t-  [(a;  -  x'f  +  (y  -  y'f  +  (z  -  z'f]  1/2 

+  -^[{x-  x')2  +  {y-  y'f  +  (z  +  z')2]  ~1/2  . 

Hence  write  a  triple  integral  for  the  solution  of  (8.22)  for  2  >  0  which  is  zero  for 
2  =  0. 

9.  Show  that  our  results  can  be  extended  to  find  the  following  solution  of  (8.22)  which 
satisfies  given  nonzero  boundary  conditions: 

u(r)  =  JJJ  G(r,r')/(r')dr'  +  JJ  da' 

where  G(r,  r')  is  the  Green  function  (8.28)  which  is  zero  on  the  surface  a,  and 
dG/dn'  =  VG  •  n'  is  the  normal  derivative  of  G  (see  Chapter  6,  Section  6).  Hints: 
In  Green’s  second  identity  (Chapter  6,  Problem  10.16)  let  <j>  =  u( r)  and  ip  =  G(r,  r'), 
and  use  (8.22)  and  (8.23)  to  find  \72<f>  and  V2ip.  Comment:  Although  we  derived  the 
divergence  theorem  and  so  Green’s  identities  only  for  bounded  regions  in  Chapter  6, 
they  are  valid  for  unbounded  regions  if  the  functions  involved  tend  to  zero  sufficiently 
rapidly. 

9.  INTEGRAL  TRANSFORM  SOLUTIONS  OF  PARTIAL  DIFFERENTIAL  EQUATIONS 

Laplace  Transform  Solutions  We  have  seen  (Chapter  8,  Section  9)  that  tak¬ 
ing  the  Laplace  transform  of  an  ordinary  differential  equation  converts  it  into  an 
algebraic  equation.  Taking  the  Laplace  transform  of  a  partial  differential  equation 
reduces  the  number  of  independent  variables  by  one,  and  so  converts  a  two- variable 
partial  differential  equation  into  an  ordinary  differential  equation.  To  illustrate  this, 
we  solve  the  following  problem. 


Example  1.  A  semi-infinite  bar  (extending  from  a;  =  0tox  =  oo),  with  insulated  sides,  is 
initially  at  the  uniform  temperature  u  =  0°.  At  t  =  0,  the  end  at  x  =  0  is  brought 
to  it  =  100°  and  held  there.  Find  the  temperature  distribution  in  the  bar  as  a 
function  of  x  and  t. 

The  differential  equation  satisfied  by  u  is 


(9.1) 


d2u  1  du 
dx 2  a 2  dt ' 


We  are  going  to  take  the  t  Laplace  transform  of  (9.1);  the  variable  x  will  just  be  a 
parameter  in  this  process.  Let  U  be  the  Laplace  transform  of  u ,  that  is, 


(9.2) 


U{x,p) 


By  Chapter  8,  equation  (9.1)  we  have 


L 


^  =pU  -  ut=o  =  pU 


since  u  =  0  when  t  =  0.  Also 


du 

~dt 
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(remember  that  x  is  just  a  parameter  here;  we  are  taking  a  t  Laplace  transform). 
The  transform  of  (9.1)  is  then 


(9.3) 


d2U 
dx 2 


Now  if  we  think  of  p  as  a  constant  and  x  as  the  variable,  this  is  an  ordinary  differ¬ 
ential  equation  for  U  as  a  function  of  x.  Its  solutions  are 


(9.4) 


U  = 


g  ~(Vp/a)x_ 


To  find  the  correct  combination  of  these  solutions  to  fit  our  problem,  we  need  the 
Laplace  transforms  of  the  boundary  conditions  on  u  since  these  give  the  conditions 
on  U .  Using  LI  (see  Laplace  Transform  Table,  page  469)  to  find  the  transforms, 
we  have 

u=  100  at  x  =  0,  U  =  L(100)  =  ±f-  at  x  =  0; 

u  — »  0  as  i-too ,  U  — >  L(0)  =0  as  nco. 

Since  U  — >  0  as  x  — >  oo,  we  see  that  we  must  use  the  solution  gUvWoO*  from  (9.4) 
and  discard  the  positive  exponential  solution.  We  determine  the  constant  multiple 
of  this  solution  which  fits  our  problem  from  the  condition  that  U  =  100/p  at  x  =  0. 
Thus  we  find  that  the  U  solution  satisfying  the  given  boundary  conditions  is 

(9.6)  U=  —  e-(vW°9*. 

P 

We  find  u  by  looking  up  the  inverse  transform  of  (9.6);  it  is,  by  L22 


(9.7) 


u  =  100 


1  —  erf 


x 

2a\fi 


and  this  is  the  solution  of  the  problem. 


Fourier  Transform  Solutions  In  the  examples  in  Sections  2,  3  and  4,  we  ex¬ 
panded  a  given  function  in  a  Fourier  series.  This  was  possible  because  the  function 
was  to  be  represented  by  a  series  over  a  finite  interval.  We  could  then  take  that 
interval  as  the  period  for  the  Fourier  series.  If  we  are  dealing  with  a  function  which 
is  given  over  an  infinite  interval  (and  not  periodic),  then  instead  of  representing  it 
by  a  Fourier  series  we  represent  it  by  a  Fourier  integral  (Chapter  7,  Section  12). 
Let  us  do  this  for  a  specific  problem. 


Example  2.  An  infinite  metal  plate  (Figure  9.1)  covering  the  first  quadrant  has  the  edge 
along  the  y  axis  held  at  0°,  and  the  edge  along  the  x  axis  held  at 


(9.8) 


u(x, 0) 


100°,  0  <  x  <  1, 
0°,  x  >  1. 


Find  the  steady-state  temperature  distribution  as  a  function  of  x  and  y. 
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The  differential  equation  and  its  solutions  are  the  y 

same  as  in  the  semi-infinite  plate  problem  dis- 

cussed  in  Section  2,  equations  (2.1),  (2.6),  and 

(2.7).  As  in  that  problem,  we  assume  u  — >  0  as  u_0  P,ate 

y  — >  oo,  and  use  only  the  e~ky  terms.  Since  u  —  0 

when  x  =  0,  we  use  only  the  sine  solutions.  The 

basis  functions  we  want  are  then  u  =  e~ky  sin  kx.  u  =  ioo°  l  u  =  o°  x 

We  do  not  have  any  requirement  here  which  de¬ 
termines  k  as  we  did  in  Section  2.  We  must  then  Figure  9.1 

allow  all  fc’s  and  try  to  find  a  solution  in  the  form 

of  an  integral  over  k.  Instead  of  coefficients  bn  in  a  series,  we  have  a  coefficient 
function  B(k)  to  determine.  Remember  that  k  >  0  since  e~ky  must  tend  to  zero  as 
y  — >  oo.  Thus  we  try  to  find  a  solution  of  the  form 


Figure  9.1 


When  y  =  0,  we  have 


pOO 

u(x,y)  =  /  B(k)e~ky  sin  kxdk. 

Jo 

pOO 

u(x,  0)  =  /  B(k)  sin  kx  dk. 

Jo 


This  is  the  first  of  equations  (12.14)  in  Chapter  7,  if  we  identify  k  with  a,  u(x,  0) 
with  fs(x),  and  B(k)  with  y/2/wgs(a).  Thus  the  given  temperature  on  the  x  axis 
is  a  Fourier  sine  transform  of  the  desired  coefficient  function,  so  B{k)  can  be  found 
as  the  inverse  transform.  Using  the  second  of  equations  (12.14)  in  Chapter  7,  we 
get 

/2~  2  f°°  2  f°° 

—gs(k)  =  —  /  fs (x)  sin  kx  dx  =  —  /  u(x,  0)  sin  kx  dx. 

n  K  J  o'  7T  J0 

For  the  given  u(x,  0)  in  (9.8),  we  find 


2  T1  200 

B(k)  =  —  /  100  sin  kxdx  = - 

n  Jo  71 


200  cos  kx 

7T  k 


=  ^(l-cos  *). 

7TK 


Finding  B(k)  corresponds  to  evaluating  the  coefficients  in  a  Fourier  series.  Substi¬ 
tuting  (9.12)  into  (9.9),  we  get  the  solution  to  our  problem  in  the  form  of  an  integral 
instead  of  a  series: 


u(x,y)  =  —  [ 

77  Jo 


1  —  cos  k 


e  ky  sin  kx  dk. 


An  integral  can,  of  course,  be  evaluated  numerically  just  as  a  convergent  series 
can  be  approximated  by  calculating  a  few  terms.  However,  (9.13)  can  be  integrated; 
a  convenient  way  to  do  it  is  to  recognize  that  it  is  a  Laplace  transform  of  f{k)  = 
[(1  —  cos  k)  sin  kx]/k,  where  x  is  just  a  parameter  and  y  corresponds  to  p  and  k  to 
t.  From  L19  and  L20 

.  .  .  .  200  I"  x  1  £  +  11  x  —  1 

(9.14)  u(x,  y)  = -  arc  tan - arc  tan - arc  tan -  . 

7T  [  V  2  y  2  y 

This  can  also  be  written  in  polar  coordinates  as  (Problem  1) 

100  fir  r2  —  cos 20 \ 

(9.15)  u= -  —  —  arctan - ; — — —  . 

v  ’  7r  \  2  sin  20  ) 
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PROBLEMS,  SECTION  9 
i. 


2. 


Verify  that  (9.15)  follows  from  (9.14).  Hint:  Use  the  formulas  for  tan  (a  ±  /?),  tan  2a, 
etc.,  to  condense  (9.14)  and  then  change  to  polar  coordinates.  You  may  find 

100  sin  28 

u  =  - arc  tan  — - . 

7r  r2  —  cos  28 

Show  that  if  you  use  principal  values  of  the  arc  tangent,  this  formula  does  not  give 
the  correct  boundary  conditions  on  the  x  axis,  whereas  (9.15)  does. 

A  metal  plate  covering  the  first  quadrant  has  the  edge  which  is  along  the  y  axis 
insulated  and  the  edge  which  is  along  the  x  axis  held  at 


u(x,  0)  = 


f  100(2  -  x) 

I  0, 


for  0  <  x  <  2, 
for  x  >  2. 


3. 


Find  the  steady-state  temperature  distribution  as  a  function  of  x  and  y.  Hint: 
Follow  the  procedure  of  Example  2,  but  use  a  cosine  transform  (because  du/dx  =  0 
for  x  =  0).  Leave  your  answer  as  an  integral  like  (9.13). 

Consider  the  heat  flow  problem  of  Section  3.  Solve  this  by  Laplace  transforms  (with 
respect  to  t)  by  starting  as  in  Example  1.  You  should  get 


d2U  _  p_  _  100 

dx2  a2  a2l 

Solve  this  differential  equation  to  get 


and  [7(0,  p)  =  U(l,p)  =  0. 


U(x,p)  =  - 


100  sinh  (p1t2/a)x  100 

psinh  (p1/2/®)?  +  pi 


Assume  the  following  expansion,  and  find  u  by  looking  up  the  inverse  Laplace  trans¬ 
forms  of  the  individual  terms  of  U: 


sinh  (p1/2 /a)x 
psinh  ( p1!2  ja)l 


x 

pi 


sin  (nx/l) 


sin  (2tvx/1) 


sin  ( 3nx/l ) 


p  +  (n2a2 /l2)  2  [p  +  (47r2a2/£2)]  3  [p  +  (97r2a2/i2)] 


Your  answer  should  be  (3.15). 

4.  A  semi- infinite  bar  is  initially  at  temperature  100°  for  0  <  x  <  1,  and  0°  for  x  >  1. 
Starting  at  t  =  0,  the  end  x  =  0  is  maintained  at  0°  and  the  sides  are  insulated. 
Find  the  temperature  in  the  bar  at  time  t,  as  follows.  Separate  variables  in  the 

2  j  2  t.2  y. 

heat  flow  equation  and  get  elementary  solutions  e_“  sinfca;  and  e_“  cos  kx. 
Discard  the  cosines  since  u  =  0  at  x  =  0.  Look  for  a  solution 

r°°  2  2 

u{x,t)  =  /  B(k)e~a  k  4  sin  kxdk. 

Jo 


and  proceed  as  in  Example  2.  Leave  your  answer  as  an  integral. 

5.  A  long  wire  occupying  the  x  axis  is  initially  at  rest.  The  end  x  =  0  is  oscillated  up 
and  down  so  that 

p(0,t)  =  2sin3t,  t  >  0. 

Find  the  displacement  y(x,t).  The  initial  and  boundary  conditions  are  y(0,t)  = 
2  sin  3 1,  y(x,  0)  =  0,  dy/dt | t=o  =  0.  Take  Laplace  transforms  of  these  conditions 
and  of  the  wave  equation  with  respect  to  t  as  in  Example  1.  Solve  the  resulting 
differential  equation  to  get 

Qe  ~(p/v)x 


Y(x,p) 


p2  +  9 
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Use  L3  and  L28  to  find 


y{x,t) 


2  sin  3  (t  — 

0, 


x  <  vt, 
X  >  vt. 


6.  Continue  the  problem  of  Example  2  in  the  following  way:  Instead  of  using  the 
explicit  form  of  B(k )  from  (9.12),  leave  it  as  an  integral  and  write  (9.13)  in  the  form 


u(x,y) 


200 

7T 


sin  kx  dk 


l 


i 

sin  kt  dt. 


Change  the  order  of  integration  and  evaluate  the  integral  with  respect  to  k  first. 
(Hint:  Write  the  product  of  sines  as  a  difference  of  cosines.)  Now  do  the  t  integration 
and  get  (9.14). 

7.  Continue  with  Problem  4  as  in  Problem  6. 


10.  MISCELLANEOUS  PROBLEMS 

1.  Find  the  steady-state  temperature  distribution  in  a  rectangular  plate  covering  the 
area  0  <  a:  <  1,  0  <  y  <  2,  if  T  =  0  for  x  =  0,  x  =  1,  y  =  2,  and  T  =  1  —  x  for  y  =  0. 

2.  Solve  Problem  1  if  T  =  0  for  x  =  0,  x  =  1,  y  =  0,  and  T  =  1  —  x  for  y  =  2.  Hint: 
Use  sinh  ky  as  the  y  solution;  then  T  =  0  when  y  =  0  as  required. 

3.  Solve  Problem  1  if  the  sides  x  =  0  and  x  =  1  are  insulated  (see  Problems  2.14  and 

2.15),  and  T  =  0  for  y  =  2,  T  =  1  —  x  for  y  =  0. 

4.  Find  the  steady-state  temperature  distribution  in  a  plate  with  the  boundary  tem¬ 
peratures  T  =  30°  for  x  =  0  and  y  =  3;  T  =  20°  for  y  —  0  and  x  =  5.  Hint: 

Subtract  20°  from  all  temperatures  and  solve  the  problem;  then  add  20°.  (Also  see 

Problem  2.) 

5.  A  bar  of  length  l  is  initially  at  0°.  From  t  =  0  on,  the  ends  are  held  at  20°.  Find 
u(x,  t )  for  t  >  0. 

6.  Do  Problem  5  if  the  x  =  0  end  is  insulated  and  the  x  =  l  end  held  at  20°  for  t  >  0. 
(See  Problem  3.9.) 

7.  Solve  Problem  2  if  the  sides  x  =  0  and  x  =  1  are  insulated. 

8.  A  slab  of  thickness  10  cm  has  its  two  faces  at  10°  and  20°.  At  t  =  0,  the  face 
temperatures  are  interchanged.  Find  u(x,  t )  for  t  >  0. 

9.  A  string  of  length  l  has  initial  displacement  j/o  =  x(l  —  x).  Find  the  displacement  as 
a  function  of  x  and  t. 

10.  Solve  Problem  5.7  if  half  the  curved  surface  of  the  cylinder  is  held  at  100°  and  the 
other  half  at  —100°  with  the  ends  at  0°. 

11.  The  series  in  Problem  5.12  can  be  summed  (see  Problem  2.6).  Show  that 

„„  100  2ar  sin  9  y 

u  =  50  H - arc  tan  — - — .  a 

7r  a*  —  rz 

12.  A  plate  in  the  shape  of  a  quarter  circle  has  boundary  temper-  o° 
atures  as  shown.  Find  the  interior  steady-state  temperature 
u(r,6).  (See  Problem  5.12.) 

"0 
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13.  Sum  the  series  in  Problem  12  to  get 

200  2a2r2  sin  29 

u  =  - arc  tan - - - : — . 

7T  a4  -r4 

Hint:  See  Problem  2.6. 

14.  A  long  cylinder  has  been  cut  into  quarter  cylinders  which  are  insulated  from  each 
other;  alternate  quarter  cylinders  are  held  at  potentials  +100  and  —100.  Find  the 
electrostatic  potential  inside  the  cylinder.  Hints:  Do  you  see  a  relation  to  Problem 
12  above?  Also  see  Problem  5.12. 

15.  Repeat  Problems  12  and  13  for  a  plate  in  the  shape  of  a  circular  sector  of  angle  30° 
and  radius  10  if  the  boundary  temperatures  are  0°  on  the  straight  sides  and  100° 
on  the  circular  arc.  Can  you  then  state  and  solve  a  problem  like  14? 

16.  Consider  the  normal  modes  of  vibration  for  a  square  membrane  of  side  n  (see  Prob¬ 
lem  6.3).  Sketch  the  2,  1  and  1,  2  modes.  Show  that  the  line  y  =  x  is  a  nodal  line  for 
the  combination  sin  x  sin  2y  —  sin  2x  sin  y  of  these  two  modes.  Thus  hnd  a  vibration 
frequency  of  a  membrane  in  the  shape  of  a  45°  right  triangle. 

17.  Sketch  some  of  the  normal  modes  of  vibration  for  a  semicircular  drumhead  and 
find  the  characteristic  vibration  frequencies  as  multiples  of  the  fundamental  for  the 
corresponding  circular  drumhead. 

18.  Repeat  Problem  17  for  a  membrane  in  the  shape  of  a  circular  sector  of  angle  60°. 

19.  A  long  conducting  cylinder  is  placed  parallel  to  the  2  axis  in  an  originally  uniform 
electric  field  in  the  negative  x  direction.  The  cylinder  is  held  at  zero  potential.  Find 
the  potential  in  the  region  outside  the  cylinder.  Hints:  See  Problem  7.13.  You  want 
solutions  of  Laplace’s  equation  in  polar  coordinates  (Problem  5.12). 

20.  Use  Problem  7.16  to  find  the  characteristic  vibration  frequencies  of  sound  in  a  spher¬ 
ical  cavity. 

21.  The  surface  temperature  of  a  sphere  of  radius  1  is  held  at  u  =  sin2  9  +  cos3  9.  Find 
the  interior  temperature  u(r,  9,  </>). 

22.  Find  the  interior  temperature  in  a  hemisphere  if  the  curved  surface  is  held  at  u  = 
cos  9  and  the  equatorial  plane  at  u  =  1. 

23.  Find  the  steady-state  temperature  in  the  region  between  two  spheres  r  =  1  and 
r  =  2  if  the  surface  of  the  outer  sphere  has  its  upper  half  held  at  100°  and  its  lower 
half  at  —100°  and  these  temperatures  are  reversed  for  the  inner  sphere.  Hint:  See 
Problem  7.14.  Here  you  will  need  to  find  two  Legendre  series  (when  r  =  1  and  when 
r  =  2)  and  solve  for  a;  and  6;. 

24.  Find  the  general  solution  for  the  steady-state  temperature  in  Figure  2.2  if  the  bound¬ 
ary  temperatures  are  the  constants  T  =  A,  T  =  B ,  etc.,  on  the  four  sides,  and  the 
rectangle  covers  the  area  0<x<a,  0<y<b.  Hints:  You  can  subtract,  say,  A 
from  all  four  temperatures,  solve  the  problem,  and  then  add  A  back  again.  Thus 
a  solution  with  one  side  at  T  —  0  and  the  other  three  at  given  temperatures  solves 
the  general  problem.  You  have  previously  solved  problems  (Section  2)  with  temper¬ 
atures  C  and  D  given.  For  B,  see  Problem  2. 

25.  The  Klein-Gordon  equation  is  V2u  =  ( l/v2)d2u/dt 2  +  \2u.  This  equation  is  of 
interest  in  quantum  mechanics,  but  it  also  has  a  simpler  application.  It  describes,  for 
example,  the  vibration  of  a  stretched  string  which  is  embedded  in  an  elastic  medium. 
Separate  the  one-dimensional  Klein-Gordon  equation  and  find  the  characteristic 
frequencies  of  such  a  string. 

Answer:  vn=l-  sj ( n/l )2  +  (A/7t)2. 
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26.  Find  the  characteristic  frequencies  of  a  circular  membrane  which  satisfies  the  Klein- 
Gordon  equation  (Problem  25).  Hint:  Separate  the  equation  in  two  dimensions  in 
polar  coordinates. 

27.  Do  Problem  26  for  a  rectangular  membrane. 

28.  Find  the  steady-state  temperature  in  a  semi-infinite  plate  covering  the  region  a:  >  0, 
0  <  y  <  1,  if  the  edges  along  the  x  axis  and  y  axis  are  insulated  (see  Problem  2.14) 
and  the  top  edge  is  held  at 


00°,  0  <  x  <  1, 

0°,  x  >  1. 


Hint:  Look  for  a  solution  as  a  Fourier  integral.  Leave  your  answer  as  an  integral 
(just  as  we  usually  give  answers  as  series.) 


CHAPTER 


14 


Functions  of  a 
Complex  Variable 


1.  INTRODUCTION 

In  Chapter  2  we  discussed  plotting  com¬ 
plex  numbers  z  =  x  +  iy  in  the  complex 
plane  (see  Figure  1.1)  and  finding  values 
of  the  elementary  functions  of  z  such  as 
roots,  trigonometric  functions,  logarithms, 
etc.  Now  we  want  to  discuss  the  calculus  of 
functions  of  z,  differentiation,  integration, 
power  series,  etc.  As  you  know  from  such 
topics  as  differential  equations,  Fourier  se¬ 
ries  and  integrals,  mechanics,  electricity, 
etc.,  it  is  often  very  convenient  to  use  complex  expressions.  The  basic  facts  and 
theorems  about  functions  of  a  complex  variable  not  only  simplify  many  calculations 
but  often  lead  to  a  better  understanding  of  a  problem  and  consequently  to  a  more 
efficient  method  of  solution.  We  are  going  to  state  some  of  the  basic  definitions  and 
theorems  of  the  subject  (omitting  the  longer  proofs),  and  show  some  of  their  uses. 

As  in  Chapter  2,  the  value  of  a  function  of  z  for  a  given  z  is  a  complex  number. 

Example.  Consider  a  simple  function  of  z,  namely  f(z)  =  z2.  We  may  write 
f(z )  =  z2  =  {x  +  iy)2  =  x2  -  y2  +  2 ixy  =  u(x,  y)  +  iv(x,  y), 
where  u(x,  y)  =  x2  —  y2  and  v(x,  y)  =  2 xy. 

In  Chapter  2,  we  observed  that  a  complex  number  z  =  x  +  iy  is  equivalent  to  a  pair 
of  real  numbers  x,  y.  Here  we  see  that  a  function  of  z  is  equivalent  to  a  pair  of  real 
functions,  u{x,y)  and  v(x,y),  of  the  real  variables  x  and  y.  In  general,  we  write 


(1.1)  f(z)  =  f(x  +  iy)  =  u{x ,  y)  +  iv(x,  y), 
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where  it  is  understood  that  u  and  v  are  real  functions  of  the  real  variables  x  and  y. 

Recall  that  functions  are  customarily  single-valued ,  that  is,  /(z)  has  just  one 
(complex)  value  for  each  z.  Does  this  mean  that  we  cannot  define  a  function  by  a 
formula  such  as  In  z  or  arc  tan  zl  By  Chapter  2,  we  have 

lnz  =  In  \z\  +  i(9  +  2nn ), 

where  tan  9  =  y/x.  For  each  z,  In  z  has  an  infinite  set  of  values.  But  if  9  is  allowed  a 
range  of  only  2-7T  ,  then  In  z  has  one  value  for  each  z  and  this  single- valued  function  is 
called  a  branch  of  lnz.  Thus  in  using  formulas  such  as  a/z.  In  z,  arc  tan  z,  to  define 
functions,  we  always  discuss  a  single  branch  at  a  time  so  that  we  have  a  single¬ 
valued  function.  (As  a  matter  of  terminology,  however,  you  should  know  that  the 
whole  collection  of  branches  is  sometimes  called  a  “multiple- valued  function.”) 


PROBLEMS,  SECTION  1 

Find  the  real  and  imaginary  parts  u(x,  y )  and  v(x,  y )  of  the  following  functions. 


1. 

z3 

2. 

z 

3. 

z 

4. 

1*1 

5. 

Re  z 

6. 

z 

e 

7. 

cosh  z 

8. 

sin  z 

9. 

1 

z 

10. 

2z  +  3 

11. 

2  z  —  i 

12. 

z 

z  -\-  2 

iz  +  2 

z2  +  1 

13. 

hi  \z\ 

14. 

2- 
Z  Z 

15. 

ez 

16. 

2  -2 
z  —  Z 

17. 

COS  z 

18. 

V* 

19. 

lnz  (Use  0  <  6  <  2-7T.) 

20.  (1  +  2  i)z2  +  (i 

-  1  )z  + 

21. 

eIZ  ( Careful:  cos  z  and  sin  z  are  not 

u  and  v.) 

2.  ANALYTIC  FUNCTIONS 


Definition  The  derivative  of  f(z)  is  defined  (just  as  it  is  for  a  function  of  a  real 
variable)  by  the  equation 


(2.1) 


fl(  a  df  A f 

J  (z)  =  —  =  lim  — — . 
dz  Az— >o  Az 


where  A /  =  f(z  +  Az)  —  /(z)  and  Az  =  Ax  +  iAy. 


Definition:  A  function  /(z)  is  analytic  (or  regular  or  holomorphic  or  mono¬ 
genic)  in  a  region*  of  the  complex  plane  if  it  has  a  (unique)  derivative  at  every 
point  of  the  region.  The  statement  “/(z)  is  analytic  at  a  point  z  =  a”  means 
that  /(z)  has  a  derivative  at  every  point  inside  some  small  circle  about  z  =  a. 


Isolated  points  and  curves  are  not  regions;  a  region  must  be  two-dimensional. 
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Let  us  consider  what  it  means  for  /(z)  to  have  a  derivative.  First  think  about 
a  function  /(x)  of  a  real  variable  x;  it  is  possible  for  the  limit  of  A//Ax  to  have 
two  values  at  a  point  Xo,  as  shown  in  Figure  2.1 — one  value  when  we  approach  xo 
from  the  left  and  a  different  value  when  we  approach  xo  from  the  right.  When  we 
say  that  /(x)  has  a  derivative  at  x  =  Xo,  we  mean  that  these  two  values  are  equal. 
However,  for  a  function  /(z)  of  a  complex  variable  z,  there  are  an  infinite  number 
of  ways  we  can  approach  a  point  zq]  a  few  ways  are  shown  in  Figure  2.2.  When  we 
say  that  /(z)  has  a  derivative  at  z  —  z o,  we  mean  that  f'{z )  [as  defined  by  (2.1)] 
has  the  same  value  no  matter  how  we  approach  zq.  This  is  an  amazingly  stringent 
requirement  and  we  might  well  wonder  whether  there  are  any  analytic  functions. 
On  the  other  hand,  it  is  hard  to  imagine  making  any  progress  in  calculus  unless  we 
can  find  derivatives! 


Let  us  immediately  reassure  ourselves  that  there  are  analytic  functions  by  using 
the  definition  (2.1)  to  find  the  derivatives  of  some  simple  functions. 

Example  1.  Show  that  (d/dz)(z2)  =  2 z.  By  (2.1)  we  have 

do  (z  +  Az)2  —  z2  z2  T  2zAz  +  (Az)2  —  z2 

—  z2  =  lim  1  / - —  =  lim  - AA - L - _ 

dz  Az— >0  Az  Az— >o  Az 

=  lim  (2z  +  Az)  =  2z. 

Az— >(J 

We  see  that  the  result  is  independent  of  how  Az  tends  to  zero;  thus  z2  is  an  analytic 
function.  By  the  same  method  it  follows  that  ( d/dz)(zn )  =  nzn~ 1  if  n  is  a  positive 
integer  (Problem  30). 

Observe  that  the  definition  (2.1)  of  a  derivative  is  of  exactly  the  same  form  as  the 
corresponding  definition  for  a  function  of  a  real  variable.  Because  of  this  similarity, 
many  familiar  formulas  can  be  proved  by  the  same  methods  used  in  the  real  case,  as 
we  have  just  discovered  in  differentiating  z2.  You  can  easily  show  (Problems  25  to 
28)  that  derivatives  of  sums,  products,  and  quotients  follow  the  familiar  rules  and 
that  the  chain  rule  holds  [if  /  =  f(g)  and  g  =  g(z ),  then  df  /dz  =  (df  / dg)(dg / dz)]. 
Then  derivatives  of  rational  functions  of  z  follow  the  familiar  real- variable  formulas. 
If  we  assume  the  definitions  and  theorems  of  Chapters  1  and  2,  we  can  see  that  the 
derivatives  of  the  other  elementary  functions  also  follow  the  familiar  formulas;  for 
example,  (d/dz)(smz)  =  cos z,  etc.  (Problems  29  to  33). 

Now  you  may  be  wondering  what  is  new  here  since  all  our  results  so  far  seem  to 
be  just  the  same  as  for  functions  of  a  real  variable.  The  reason  for  this  is  that  we 
have  been  discussing  only  functions  /(z)  that  have  derivatives.  Comparing  Figures 
2.1  and  2.2,  we  pointed  out  the  essential  difference  between  finding  ( d/dx)f(x ) 
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and  finding  ( d/dz)f{z),  namely  that  there  are  an  infinite  number  of  ways  we  can 
approach  Zq  in  Figure  2.2. 


Example 2.  Find  (d/dz)(\z\2).  Note  that  |x| 2  =  x2,  and  its  derivative  is  2x.  If  \z\2  has  a 
derivative,  it  is  given  by  (2.1),  that  is,  by 


lim 

Az— >0 


=  lim 
Az— >0 


|z  +  Az|2 
Az 


The  numerator  of  this  fraction  is  always  real  (because  absolute  values  are  real — 
recall  \z\  =  \j  x2  +  y2  =  r.)  Consider  the  denominator  Az  =  Ax  +  iAy.  As  we 
approach  zo  in  Figure  2.2  (that  is,  let  Az  — >  0),  Az  has  different  values  depending 
on  our  method  of  approach.  For  example,  if  we  come  in  along  a  horizontal  line, 
then  Ay  =  0  and  Az  =  Ax;  along  a  vertical  line  Ax  =  0  so  Az  =  iAy,  and  along 
other  directions  Az  is  some  complex  number;  in  general,  Az  is  neither  real  nor  pure 
imaginary.  Since  the  numerator  of  Af  /Az  is  real  and  the  denominator  may  be  real 
or  imaginary  (in  general,  complex),  we  see  that  liniAz^o  Af  / Az  has  different  values 
for  different  directions  of  approach  to  Zo,  that  is,  | z | 2  is  not  analytic. 

Now  we  have  seen  examples  of  both  analytic  and  nonanalytic  functions,  but  we 
still  do  not  know  how  to  tell  whether  a  function  has  a  derivative  [except  to  appeal 
to  (2.1)].  The  following  theorems  answer  this  question. 


Theorem  I  (which  we  shall  prove).  If  /(z)  =  u(x,  y)  +  iv(x,  y)  is  analytic  in  a 
region,  then  in  that  region 

.  .  du  dv  dv  du 

(  ’  fa  =  dy'  fa  =  ~!hj' 

These  equations  are  called  the  Cauchy-Riemann  conditions. 


Proof.  Remembering  that  /  =  /(z),  where  z  =  x  +  iy,  we  find  by  the  rules  of 
partial  differentiation  (see  Problem  28  and  also  Chapter  4) 


(2.3) 


Since  / 
(2.4) 


t(x,y ) 


d£  = 

df_ 

dz 

=  £.  i. 

dx 

dz 

dx 

dz 

d£  = 

df_ 

dz 

_<y_. 

dy 

dz 

dy 

dz 

(1-1), 

we 

also  have 

d £ 

dx 


du 

dx 


dv 

dx 


and 


d£ 

dy 


du 

dy 


dv 

dy' 


Notice  that  if  /  has  a  derivative  with  respect  to  z,  then  it  also  has  partial  derivatives 
with  respect  to  x  and  y  by  (2.3).  Since  a  complex  function  has  a  derivative  with 
respect  to  a  real  variable  if  and  only  if  its  real  and  imaginary  parts  do  [see  (1.1)] ,  then 
by  (2.4)  u  and  v  also  have  partial  derivatives  with  respect  to  x  and  y.  Combining 
(2.3)  and  (2.4)  we  have 


df__df__du  .dv  df_  _  ldf_  _  1  f  du  ,dv\_dv_.du 

dz  dx  dx+  1  dx  &n  dz  i  dy  i  \  dy  *  dy  J  dy  *  dy’ 
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Since  we  assumed  that  df/dz  exists  and  is  unique  (this  is  what  analytic  means), 
these  two  expressions  for  df/dz  must  be  equal.  Taking  real  and  imaginary  parts, 
we  get  the  Cauchy-Riemann  equations  (2.2). 


Theorem  II  (which  we  state  without  proof).  If  u(x,y)  and  v(x,y)  and  their 
partial  derivatives  with  respect  to  x  and  y  are  continuous  and  satisfy  the  Cauchy- 
Riemann  conditions  in  a  region,  then  /(z)  is  analytic  at  all  points  inside  the  region 
(not  necessarily  on  the  boundary). 


Although  we  shall  not  prove  this  (see  texts  on  complex  variables) ,  we  can  make 
it  plausible  by  showing  that  it  is  true  when  we  approach  z o  along  any  straight  line. 


Example  3.  Find  df/dz  assuming  that  we  approach  zq  along  a  straight  line  of  slope  m, 
and  show  that  df/dz  does  not  depend  on  m  if  u  and  v  satisfy  (2.2).  The  equation 
of  the  straight  line  of  slope  m  through  the  point  zq  =  xq  +  iyo  is 


y  -  Vo  =  m(x  -  xQ) 


and  along  this  line  we  have  dy/dx  =  m.  Then  we  find 


df_ 

dz 


du 


du 


du+  idv  dx 
dx  +  idy 
du  du 
dx  dy 


— —  dx  +  — —  dy  +  i  I  — —  dx  +  —  dy 


dy 


dv 


dv 


dx 


dy 


dx  +  idy 


m  - 


dv 

dx 


dv 

TT' 

dy 


1  +  im 


Using  the  Cauchy-Riemann  equations  (2.2),  we  get 


dz 


du  dv 
dx  dx 


1  +  im 

du  .  .  .  dv  .  . 


du 


1  +  im 


dx 


dv 

dx' 


Thus  df/dz  has  the  same  value  for  approach  along  any  straight  line.  The  theorem 
states  that  it  also  has  the  same  value  for  approach  along  any  curve. 


Some  definitions: 

A  regular  point  of  f(z)  is  a  point  at  which  f(z)  is  analytic. 

A  singular  point  or  singularity  of  f(z)  is  a  point  at  which  f(z)  is  not  analytic. 
It  is  called  an  isolated  singular  point  if  f(z)  is  analytic  everywhere  else  inside 
some  small  circle  about  the  singular  point. 
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Theorem  III  (which  we  state  without  proof).  If  f(z)  is  analytic  in  a  region 
( R  in  Figure  2.3),  then  it  has  derivatives  of  all  orders  at  points  inside  the  region 
and  can  be  expanded  in  a  Taylor  series  about  any  point  zo  inside  the  region. 
The  power  series  converges  inside  the  circle  about  zq  that  extends  to  the  nearest 
singular  point  (C  in  Figure  2.3). 


Singular 

point 


Figure  2.3 

Notice  again  what  a  strong  condition  it  is  on  f(z)  to  say  that  it  has  a  derivative. 
It  is  quite  possible  for  a  function  of  a  real  variable  f{x)  to  have  a  first  derivative 
but  not  higher  derivatives.  But  if  f(z)  has  a  first  derivative  with  respect  to  z ,  then 
it  has  derivatives  of  all  orders,  and  all  these  derivatives  are  analytic  functions. 

This  theorem  also  explains  a  fact  about  power  series  which  may  have  puzzled 
you.  The  function  f(x)  =  1/(1  +  x2)  does  not  have  anything  peculiar  about  its 
behavior  at  x  =  ±1.  Yet  if  we  expand  it  in  a  power  series 

(2.5)  1  „  =  1  -  x2  +  x4  -  x6  +  •  •  • 

v  '  1  +  x2 

we  see  that  the  series  converges  only  for  |cc|  <  1.  We  can  see  why  this  happens  if 
we  consider  instead 

(2.6)  f(z)  =  =  l-z2  +  z4-z6  +  ---  . 

1  +  zz 

When  z  =  ±i,  f(z)  and  its  derivatives  become  infinite;  that 
is,  f(z)  is  not  analytic  in  any  region  containing  z  =  ±i.  The 
point  Zq  of  the  theorem  is  the  origin  and  the  circle  C  (bound¬ 
ing  the  disk  of  convergence  of  the  series)  passes  through  the 
nearest  singular  points  ±*  (Figure  2.4).  Since  a  power  series 
in  2  always  converges  inside  its  disk  of  convergence  and  di¬ 
verges  outside  (Chapter  2,  Problem  6.14),  we  see  that  (2.5) 

[which  is  (2.6)  for  y  =  0]  converges  for  |x|  <  1  and  diverges  for 
|  a;  |  >  1.  This  simple  example  shows  an  important  reason  for 
studying  functions  of  a  complex  variable;  our  study  of  f(z)  gives  us  insights  about 
the  corresponding  f(x).  Formulas  involving  not  only  the  elementary  functions  but 
also  r  functions,  Bessel  functions,  and  many  others  are  more  easily  derived  and 
understood  by  considering  them  as  functions  of  z. 

A  function  </>(x,  y)  which  satisfies  Laplace’s  equation  in  two  dimensions,  namely, 
V2(j)  =  d2<fi/dx2  +  d2cj)/dy2,  is  called  a  harmonic  function.  A  great  many  physical 
problems  lead  to  Laplace’s  equation,  and  consequently  we  are  very  much  interested 
in  finding  solutions  of  it.  (See  Section  10  and  Chapter  13.)  The  following  theorem 
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should  then  give  you  a  clue  as  to  one  reason  why  the  theory  of  functions  of  a  complex 
variable  is  important  in  applications. 


Theorem  IV.  Part  1  (to  be  proved  in  Problem  44).  If  f(z)  =  u  +  iv  is  analytic 
in  a  region,  then  u  and  v  satisfy  Laplace’s  equation  in  the  region  (that  is,  u  and  v 
are  harmonic  functions). 

Part  2  (which  we  state  without  proof).  Any  function  u  (or  v)  satisfying 
Laplace’s  equation  in  a  simply-connected  region,  is  the  real  or  imaginary  part  of 
an  analytic  function  f(z). 


Thus  we  can  find  solutions  of  Laplace’s  equation  simply  by  taking  the  real  or 
imaginary  parts  of  an  analytic  function  of  z.  It  is  also  often  possible,  starting  with 
a  simple  function  which  satisfies  Laplace’s  equation,  to  find  the  explicit  function 
f(z)  of  which  it  is,  say,  the  real  part. 


Example  4.  Consider  the  function  u(x,y)  =  x2  —  y2.  We  find  that 


—  =  2 
dy 2 


2  =  0, 


that  is,  u  satisfies  Laplace’s  equation  (or  u  is  a  harmonic  function).  Let  us  find  the 
function  v(x ,  y)  such  that  u+iv  is  an  analytic  function  of  z.  By  the  Cauchy-Riemann 
equations 

dv  du 
dy  dx  ^ X ' 

Integrating  partially  with  respect  to  y ,  we  get 


v(x,y )  =  2  xy  +  g(x), 


where  g(x)  is  a  function  of  x  to  be  found.  Differentiating  partially  with  respect  to 
x  and  again  using  the  Cauchy-Riemann  equations,  we  have 


dv 

dx 


2  y  +  g'(x) 


du 

dy 


=  2  y. 


Thus  we  find 


g'{x)  =  0,  or  g  =  const. 


Then 

f(z)  =  u  +  iv  =  x2  —  y2  +  2  ixy  +  const.  =  z2  +  const. 

The  pair  of  functions  u,  v  are  called  conjugate  harmonic  functions.  (Also  see  Prob¬ 
lem  64.) 


PROBLEMS,  SECTION  2 


1  to  21.  Use  the  Cauchy-Riemann  conditions  to  find  out  whether  the  functions  in  Problems 
1.1  to  1.21  are  analytic.  Similarly,  find  out  whether  the  following  functions  are  analytic. 


x  —  ly 


y  —  ix 


22.  y  +  ix 


23. 


x2  +  y2 


24. 


x2  +  y2 
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Using  the  definition  (2.1)  of  ( d/dz)f(z ),  show  that  the  following  familiar  formulas  hold. 
Hint:  Use  the  same  methods  as  for  functions  of  a  real  variable. 


25.  iWW  +  B,(,)].4  +  B|. 


26. 


d_ 

dz 


lf(z)g(z)]  =  f(z)^  +  g(z) 


dz 


27  —  ( -  gf  ~  fg' 

dz  \g(z) 


g{z)£  0.  28. 


~rf[9(z)\  =  ~T~~7 -■  (See  hint  below.) 
dz  L  v  n  dg  dz  v  ' 


Problem  28  is  the  chain  rule  for  the  derivative  of  a  function  of  a  function.  Hint:  Assume 
that  df  /dg  and  dg/dz  exist,  and  write  equations  like  (3.5)  of  Chapter  4  for  A /  and  A g; 
substitute  A g  into  A/,  divide  by  A z,  and  take  limits. 


29. 

31. 

32. 

33. 


d_ 

dz 


(z3)  =  3z2. 


30. 


d_ 

dz 


/  n\  n 

(z  )  =  nz 


-7-  In  z  =  —,  2^  0.  Hint:  Expand  In  ( 1  + 

dz  z  \  z 

Using  the  definition  of  ez  by  its  power  series  [(8.1)  of  Chapter  2],  and  the  theorem 
(Chapters  1  and  2)  that  power  series  may  be  differentiated  term  by  term  (within 
the  disk  of  convergence),  and  the  result  of  Problem  30,  show  that  ( d/dz){ez )  =  ez . 

Using  the  definitions  of  sin  z  and  cos  z  [Chapter  2,  equation  (11.4)],  find  their  deriva¬ 
tives.  Then  using  Problem  27,  find  (d/dz)  (cot  z),  z  ^  mr. 

Using  series  you  know  from  Chapter  1,  write  the  power  series  (about  the  origin)  of  the 
following  functions.  Use  Theorem  III  to  find  the  disk  of  convergence  of  each  series.  What 
you  are  looking  for  is  the  point  (anywhere  in  the  complex  plane)  nearest  the  origin,  at 
which  the  function  does  not  have  a  derivative.  Then  the  disk  of  convergence  has  center  at 
the  origin  and  extends  to  that  point.  The  series  converges  inside  the  disk. 


ln(l  -  z) 

35. 

cos  2: 

36. 

a/1  +  - 

tanh  z 

38. 

1 

39. 

Z 

2i  +  z 

z2  +  9 

(1-z)-1 

41. 

iz 

e 

42. 

sinh  z 

43. 


44. 


45. 


46. 


47. 


In  Chapter  12,  equations  (5.1)  and  (5.2),  we  expanded  the  function  4>{x,  h)  in  a  series 
of  powers  of  h.  Use  Theorem  III  (see  instructions  for  Problems  34  to  42  above)  to 
show  that  the  series  for  ij>{x,h )  converges  for  |A|  <  1  and  —  1  <  x  <  1.  Here  h  is 
the  variable  and  a;  is  a  parameter;  you  should  find  the  (complex)  value  of  h  which 
makes  4?  infinite,  and  show  that  the  absolute  value  of  this  complex  number  is  1 
(independent  of  x  when  x2  <  1).  This  proves  that  the  series  for  real  h  converges  for 
\h\  <  1. 

Prove  Theorem  IV,  Part  1.  Hint:  Recall  the  equality  of  the  second  cross  partial 
derivatives;  see  Chapter  4,  end  of  Sectionl. 

Let  /(z)  =  u  +  iv  be  an  analytic  function,  and  let  F  be  the  vector  F  =  ui  +  uj.  Show 
that  the  equations  div  F  =  0  and  curl  F  =  0  are  equivalent  to  the  Cauchy- Riemann 
equations. 

Find  the  Cauchy-Riemann  equations  in  polar  coordinates.  Hint :  Write  z  =  retB  and 
/(z)  =  u(r,6)  +  iv(r,9).  Follow  the  method  of  equations  (2.3)  and  (2.4). 

Using  your  results  in  Problem  46  and  the  method  of  Problem  44,  show  that  u 
and  v  satisfy  Laplace’s  equation  in  polar  coordinates  (see  Chapter  10,  Section  9)  if 
/(z)  =  u  +  iv  is  analytic. 
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Using  polar  coordinates  (Problem  46),  find  out  whether  the  following  functions  satisfy  the 
Cauchy- Riemann  equations. 


48.  VT 

49.  |z| 

50. 

51.  zn 

52.  |z|2 

53. 

Show  that  the  following  functions  are  harmonic,  that  is,  that  they  satisfy  Laplace’s  equa¬ 
tion,  and  find  for  each  a  function  /(z)  of  which  the  given  function  is  the  real  part.  Show 
that  the  function  v(x,y)  (which  you  find)  also  satisfies  Laplace’s  equation. 


54. 

y 

55. 

o  2  3 

3 x  y-y 

56. 

xy 

57.  x  +  y 

58. 

cosh  y cos  x 

59. 

ex  cos  y 

60. 

ln(a:2 

2\ 

+  y  ) 

61. 

X 

62. 

63. 

y 

x2  +  y2 

e  v  sin  x 

(1- 

x)2  +  y2 

64.  It  can  be  shown  that,  if  u(x,  y)  is  a  harmonic  function  which  is  defined  at  zo  = 
x0  +  iyo,  then  an  analytic  function  of  which  u(x,  y )  is  the  real  part  is  given  by 

/(z)  =  2u  (£±£2j  *  2^°)  +  const. 

[See  Struble,  Quart.  Appl.  Math.,  37  (1979),  79-81.]  Use  this  formula  to  find  /(z) 
in  Problems  54  to  63.  Hint:  If  u(0,  0)  is  defined,  take  zo  =  0. 

3.  CONTOUR  INTEGRALS 


Theorem  V  Cauchy’s  theorem  (see  discussion  below).  Let  C  be  a  simple^ 
closed  curve  with  a  continuously  turning  tangent  except  possibly  at  a  finite  num¬ 
ber  of  points  (that  is,  we  allow  a  finite  number  of  corners,  but  otherwise  the  curve 
must  be  “smooth”).  If  /(z)  is  analytic  on  and  inside  C,  then 

(3.1)  j>  f(z)dz  =  0. 

around  C 


(This  is  a  line  integral  as  in  vector  analysis;  it  is  called  a  contour  integral  in  the 
theory  of  complex  variables.) 

Proof.  We  shall  prove  Cauchy’s  theorem  assuming  that  f'(z)  is  continuous.  (With 
more  effort  it  is  possible  to  prove  it  without  this  assumption,  and  then  show  that  if 
f'{z )  exists  in  a  region,  it  is,  in  fact,  continuous  there.  See  also  Theorem  III  which 
we  stated  without  proof;  it  is  usually  proved  using  the  results  of  Cauchy’s  theorem.) 

(3.2)  ®  f(z)dz  =  (j>  (u  +  iv)(dx  +  i  dy) 

Jc  Jc 

=  ®  (udx  —  v  dy)  +  i  ®  (v  dx  +  u  dy) . 

Jc  Jc 


f  A  simple  curve  is  one  which  does  not  cross  itself. 
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Green’s  theorem  in  the  plane  (Chapter  6,  Section  9)  says  that  if  P(x,y ),  Q(x,y), 
and  their  partial  derivatives  are  continuous  in  a  simply-connected  region  R,  then 

(3.3)  f  pdx  +  Qdy-  JJ  dxd„, 

area 
inside  C 

where  C  is  a  simple  closed  curve  lying  entirely  in  R.  The  curve  C  is  traversed  in  a 
direction  so  that  the  area  inclosed  is  always  to  the  left;  the  area  integral  is  over  the 
area  inside  C.  Applying  (3.3)  to  the  first  integral  in  (3.2),  we  get 

(3.4)  £(udx-vdy)=  JJ  dx dy. 

area 
inside  C 

Since  we  are  assuming  that  f'(z)  is  continuous,  then  u  and  v  and  their  derivatives 
are  continuous;  by  the  Cauchy- Riemann  equations  the  integrand  on  the  right  of 

(3.4)  is  zero  at  every  point  of  the  area  of  integration,  so  the  integral  is  equal  to  zero. 
In  the  same  way  the  second  integral  in  (3.2)  is  zero;  thus  (3.1)  is  proved. 


Theorem  VI  Cauchy’s  integral  formula  (which  we  shall  prove).  If  f(z)  is 
analytic  on  and  inside  a  simple  closed  curve  C,  the  value  of  f(z)  at  a  point  z  =  a 
inside  C  is  given  by  the  following  contour  integral  along  C : 


/(“>  =  2S 


/(*) 


dz. 


z  —  a 


Proof.  Let  a  be  a  fixed  point  inside  the  simple  closed 
curve  C  and  consider  the  function 

(3.5)  0(z)  = 

z  —  a 

where  f(z)  is  analytic  on  and  inside  C.  Let  C1  be  a 
small  circle  (inside  C)  with  center  at  a  and  radius  p. 

Make  a  cut  between  C  and  C'  along  AB  (Figure  3.1); 
two  cuts  are  shown  to  make  the  picture  clear,  but  later 
we  shall  make  them  coincide.  We  are  now  going  to 
integrate  along  the  path  shown  in  Figure  3.1  (in  the  Figure  3.1 

direction  shown  by  the  arrows)  from  A,  around  C ,  to 

B,  around  C' ,  and  back  to  A.  Notice  that  the  area  between  the  curves  C  and  C’ 
is  always  to  the  left  of  the  path  of  integration  and  is  inclosed  by  it.  In  this  area 
between  C  and  C' ,  the  function  (f>(z)  is  analytic;  we  have  cut  out  a  small  disk  about 
the  point  z  =  a  at  which  cf>(z )  is  not  analytic.  Cauchy’s  theorem  then  applies  to  the 
integral  along  the  combined  path  consisting  of  C  counterclockwise,  C'  clockwise, 
and  the  two  cuts.  The  two  integrals,  in  opposite  directions  along  the  cuts,  cancel 
when  the  cuts  are  made  to  coincide.  Thus  we  have 


(3.6) 


j)  4>(z)  dz  ■ 


4>(z)  dz  =  0  or 


C  counter-  C'  clockwise 

clockwise 


j)  4>{z)  dz 


4>{z)  dz  where  both  are  counterclockwise. 
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Along  the  circle  C' ,  z  =  a  +  pel6 ,  dz  =  pie1 9  d6 ,  and  (3.6)  becomes 


(3.7) 


j)  <j>(z)  dz 


4>{z)  dz 


<f  I^dz 

Jc  z~a 


Since  our  calculation  is  valid  for  any  (sufficiently  small)  value  of  p,  we  shall  let 
p  — >  0  (that  is,  z  — >  a)  to  simplify  the  formula.  Because  f(z)  is  continuous  at  z  =  a 
(it  is  analytic  inside  C),  limz^a  f(z)  =  /(a).  Then  (3.7)  becomes 


(3.8) 

or 


j)  <t>(z )  dz 


Jc  z-a 


f(z)i  d6 


f(a)i  dO  =  2nif(a) 


(3.9)  /(a)  =  — —  (f  J-izL  dz,  a  inside  C. 

27rz  Jc  z  —  a 

This  is  Cauchy’s  integral  formula.  Note  carefully  that  the  point  a  is  inside 
C;  if  a  were  outside  C,  then  cf>(z )  would  be  analytic  everywhere  inside  C  and  the 
integral  would  be  zero  by  Cauchy’s  theorem.  A  useful  way  to  look  at  (3.9)  is  this: 
If  the  values  of  f(z)  are  given  on  the  boundary  of  a  region  (curve  C),  then  (3.9) 
gives  the  value  of  f(z)  at  any  point  a  inside  C .  With  this  interpretation  you  will 
find  Cauchy’s  integral  formula  written  with  a  replaced  by  z,  and  z  replaced  by 
some  different  dummy  integration  variable,  say  w: 

(3.10)  f(z)  =  — (f  dw,  z  inside  C. 

2  iri  Jc  w  —  z 


For  some  important  uses  of  this  theorem,  see  Problems  11.3  and  11.36  to  11.38. 


PROBLEMS,  SECTION  3 

Evaluate  the  following  line  integrals  in  the  complex  plane  by  direct  integration,  that  is,  as 
in  Chapter  6,  Section  8,  not  using  theorems  from  this  chapter.  (If  you  see  that  a  theorem 
applies,  use  it  to  check  your  result.) 


1.  zdz  along  a  straight  line  parallel  to  the  x  axis. 

2-  Jo+\z2~z)dz 

(a)  along  the  line  y  =  x; 

(b)  along  the  indicated  broken  line. 

3.  §c  z2  dz  along  the  indicated  paths: 


1  +  i 


- 1  +  i 


-i 


1  +  i 


0 

(a) 


-1 


0 

(b) 


1 
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4.  f  dz/(l  —  z2)  along  the  whole  positive  imaginary  axis,  that  is,  the  y  axis;  this  is 
frequently  written  as  fg°°  dz/(l  —  z2). 

5.  f  e~z  along  the  positive  part  of  the  line  y  =  n;  this  is  frequently  written  as 

roo+iTr  —z  i 

.  e  dz. 

J  n r 


6.  f‘  zdz  along  the  indicated  paths: 


7. 


8. 


h 
L 


dz 


(a) 


(b) 


:  +  z2 

r27r-\-ioo 


along  the  line  y  =  x  from  0  to  oo. 

dz 


2 iz  j 

e  dz 


»■  / 


oo+2  i 


1+2  i  ix  2  i)2 


10 •  L 


2+ioo 


IZ  7 

ze  dz 


11.  Evaluate  §c(z  —  3)  dz  where  C  is  the  indicated  closed  curve  along  the  2 i 
first  quadrant  part  of  the  circle  \z\  =2,  and  the  indicated  parts  of  the 
x  and  y  axes.  Hint:  Don’t  try  to  use  Cauchy’s  theorem!  (Why  not? 
Further  hint:  See  Problem  2.3.)  — 


12-  Jo1 


\z\2  dz  along  the  indicated  paths: 


1  +  2  i 


2i 


2 

*1  +  2  i 


(b) 

13.  In  Chapter  6,  Section  11,  we  showed  that  a  necessary  condition  for  fbF  ■  dr  to  be 
independent  of  the  path  of  integration,  that  is,  for  j>c  F  ■  dr  around  a  simple  closed 
curve  C  to  be  zero,  was  curlF  =  0,  or  in  two  dimensions,  dFy/dx  =  dFx/dy.  By 
considering  (3.2),  show  that  the  corresponding  condition  for  +,  f(z)  dz  to  be  zero  is 
that  the  Cauchy-Riemann  conditions  hold. 

14.  In  finding  complex  Fourier  series  in  Chapter  7,  we  showed  that 

n2ir 

I  inx  —imx  in  / 

/  e  e  dx  =  (J,  n  ^  m. 

Jo 

Show  this  by  applying  Cauchy’s  theorem  to 


n  —  m  —  1  7  . 

z  dz,  n  >  m, 


where  C  is  the  circle  \z\  =  1.  (Note  that  although  we  take  n  >  m  to  make  2n-m~1 
analytic  at  z  =  0,  an  identical  proof  using  zm~n with  n  <  m  completes  the  proof 
for  all  n  ^  m.) 

If  f(z)  is  analytic  on  and  inside  the  circle  \z\  =  1,  show  that  J2n  elG  f{el9)  dd  =  0. 

If  f(z)  is  analytic  in  the  disk  \z\  <  2,  evaluate  /Q2,r  e2ie/(el9)  dd. 

Use  Cauchy’s  theorem  or  integral  formula  to  evaluate  the  integrals  in  Problems  17  to  20. 

17.  /  smzdz  wh  c  is  the  circle  (a)  N  =  1. 

Jc2z-n  (b)M=2. 


15. 

16. 


18. 


sin  2  z  dz 
c  6z  —  n 


where  C  is  the  circle  I2I  =3. 
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19. 


20. 


e3zdz  . 


z  —  In  2 


if  C  is  the  square  with  vertices  ±1  ±  i. 


cosh  zdz  .  .  (a)  \z\  1. 

— — -  if  C  is  the  circle  )  ' 

c  2ln2  —  z  (b)  \z\  =  2. 


21.  Differentiate  Cauchy’s  formula  (3.9)  or  (3.10)  to  get 

f{w)  dw 


nz)  =  hi 


7c  (w  -  z)2 
By  differentiating  n  times,  obtain 


OT  /,(a)  =  i/c 


f{z)  dz 
c  ( z-a. )2' 


/<”>(*)  =  ^  f  fMdw  or  ln)  =  n!  T  M  dz 
2m  Jc  ( w  —  z)n+1  2m  Jc  (z  —  a)nJrl 


Use  Problem  21  to  evaluate  the  following  integrals, 
sin  2z  dz 


22. 

23. 

24. 


c  “  t1-)3 
e3z  dz 
c  (z  —  In  2)4 
cosh  s  dz 


fc  (2  In  2  —  z)5 

4.  LAURENT  SERIES 


where  C  is  the  circle  \z\  =  3. 
where  C  is  the  square  in  Problem  19. 
where  C  is  the  circle  I z\  =2. 


Theorem  VII  Laurent’s  theorem  [equation  (4.1)]  (which  we  shall  state  with¬ 
out  proof).  Let  C\  and  C2  be  two  circles  with  center  at  z0.  Let  f(z)  be  analytic 
in  the  region  R  between  the  circles.  Then  f{z )  can  be  expanded  in  a  series  of  the 
form 

(4-1)  f{z)  =  do  +  0,\{z  —  Zq)  +  <22(2  —  Zq )2  +  •  •  ■  4 - h  - rw  +  ■  ■  • 

Z~  Z0  ( Z  -  Z0) 

convergent  in  R.  Such  a  series  is  called  a  Laurent  series.  The  “6”  series  in  (4.1) 
is  called  the  principal  part  of  the  Laurent  series. 


Example  1.  Consider  the  Laurent  series 

„2)  /w.1+|44+...+(|)“+... 


Let  us  see  where  this  series  converges.  First  consider  the  series  of  positive  powers; 
by  the  ratio  test  (see  Chapters  1  and  2),  this  series  converges  for  |^y/2|  <  1,  that 
is,  for  |z|  <  2.  Similarly,  the  series  of  negative  powers  converges  for  |l/z|  <  1,  that 
is,  | z\  >  1.  Then  both  series  converge  (and  so  the  Laurent  series  converges)  for  |z| 
between  1  and  2,  that  is,  in  a  ring  between  two  circles  of  radii  1  and  2. 

We  expect  this  result  in  general.  The  “a”  series  is  a  power  series,  and  a  power 
series  converges  inside  some  circle  (say  C2  in  Figure  4.1).  The  “6”  series  is  a  series 
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of  inverse  powers  of  z,  and  so  converges  for  \l/z\  <  some  constant;  thus  the  “6” 
series  converges  outside  some  circle  (say  C\  in  Figure  4.1).  Then  a  Laurent  series 
converges  between  two  circles  (if  it  converges  at  all).  (Note  that  the  inner  circle 
may  be  a  point  and  the  outer  circle  may  have  infinite  radius) . 


Figure  4.1 


Figure  4.2 

The  formulas  for  the  coefficients  in  (4.1)  are  (Problem  5.2) 


(4.3) 


=  —  I  /(*) dz 

2ni  Jc  (jz  -  z0)n+1 ’ 


1  /  /(z)  dz 

2iti  Jc  (z  -  z0)~n+1 ’ 


where  C  is  any  simple  closed  curve  surrounding  zq  and  lying  in  R.  However,  this  is 
not  usually  the  easiest  way  to  find  a  Laurent  series.  Like  power  series  about  a  point, 
the  Laurent  series  (about  Zq)  for  a  function  in  a  given  annular  ring  (about  zq)  where 
the  function  is  analytic,  is  unique,  and  we  can  find  it  by  any  method  we  choose. 
(See  examples  below.)  Warning:  If  f(z)  has  several  isolated  singularities  (Figure 
4.2),  there  are  several  annular  rings,  R\,  R2,  in  which  f(z)  is  analytic;  then 
there  are  several  different  Laurent  series  for  f(z),  one  for  each  ring.  The  Laurent 
series  which  we  usually  want  is  the  one  that  converges  near  zq.  If  you  have  any 
doubt  about  the  ring  of  convergence  of  a  Laurent  series,  you  can  find  out  by  testing 
the  “a”  series  and  the  “6”  series  separately. 


Example  2.  The  function  from  which  we  obtained  (4.2)  was 


(4.4) 


m 


12 

z(2  -  z)(  1  +  z) 


This  function  has  three  singular  points,  at  z  =  0,  z  =  2,  and  z  —  —  1.  Thus  there 
are  two  circles  C\  and  C2  about  zq  =  0  in  Figure  4.2,  and  three  Laurent  series  about 
zq  =  0,  one  series  valid  in  each  of  the  three  regions  Ri  (0  <  \z\  <  1) ,  7?2 (1  <  \z\  <  2), 
and  J?3(|z|  >2).  To  find  these  series  we  first  write  f(z)  in  the  following  form  using 
partial  fractions  (Problem  2) : 


/(*)  = 


4 


1 


(4.5) 


z 


1  +  z  2  —  z 
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Now,  for  0  <  1 2 1  <  1,  we  expand  each  of  the  fractions  in  the  parenthesis  in  (4.5)  in 
powers  of  z.  This  gives  (Problem  2): 

(4.6)  f{z)  =  -3  +  9z/2  -  15z2/4  +  33xr3/8  +  •  •  •  +  6  jz. 


This  is  the  Laurent  series  for  f(z)  which  is  valid  in  the  region  0  <  \z\  <  1.  To 
obtain  the  series  valid  in  the  region  \z\  >  2,  we  write  the  fractions  in  (4.5)  as 


1  +  z  zl  +  l/z’  2  —  z  z\  —  2/z 

and  expand  each  fraction  in  powers  of  1  / z.  This  gives  the  Laurent  series  valid  for 
\z\>  2  (problem  2): 

(4.8)  f(z)  =  -{12/z3){1  +  1/z  +  3/z2  +  5/z3  +  11/z4  +  •••)• 

Finally,  to  obtain  (4.2),  we  expand  the  fraction  1/(2  —  z)  in  powers  of  z,  and  the 
fraction  1/(1  +  z)  in  powers  of  1  / z\  this  gives  a  Laurent  series  which  converges  for 
1  <  \z  |  <  2.  Thus  the  Laurent  series  (4.6),  (4.2)  and  (4.8)  all  represent  f(z)  in 
(4.4),  but  in  three  different  regions. 

Let  zq  in  Figure  4.2  be  either  a  regular  point  or  an  isolated  singular  point  and 
assume  that  there  are  no  other  singular  points  inside  G\.  Let  f{z)  be  expanded 
in  the  Laurent  series  about  zq  which  converges  inside  C\  (except  possibly  at  zq); 
we  say  that  we  have  expanded  f(z)  in  the  Laurent  series  which  converges  near  zq. 
Then  we  have  the  following  definitions. 


Definitions: 

If  all  the  b's  are  zero,  f(z)  is  analytic  at  z  =  zo,  and  we  call  zq  a  regular  point. 
(See  Problem  4.1) 

If  bn  ^  0,  but  all  the  b's  after  bn  are  zero,  f(z)  is  said  to  have  a  pole  of  order  n 
at  z  =  zo-  If  n  =  1,  we  say  that  f(z)  has  a  simple  pole. 

If  there  are  an  infinite  number  of  b's  different  from  zero,  f(z)  has  an  essential 
singularity  at  z  =  zq. 

The  coefficient  b\  of  1  /{z  —  zq)  is  called  the  residue  of  f(z )  at  z  =  zq. 


Example  3. 

(a) 


ez  =  1  +  z  + 


z2  z3 


2!  3! 

is  analytic  at  z  =  0;  the  residue  of  ez  at  z  =  0  is  0. 


(b) 


ez  _  1  1  1  1 

^~^+z^  +  2h  +  3! 


ez  1 

has  a  pole  of  order  3  at  z  =  0;  the  residue  of  at  z  =  0  is  — . 

z6  2! 
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(c) 


31/z  =  1  + 


1 

2!^ 


has  an  essential  singularity  at  z  =  0;  the  residue  of  el^z  at  z  =  0  is  1. 

Most  of  the  functions  we  shall  consider  will  be  analytic  except  for  poles — such 
functions  are  called  meromorphic  functions.  If  f(z)  has  a  pole  at  z  =  Zq,  then 
|/(jz)|  — >  oo  as  z  — >  Zq.  A  three-dimensional  graph  with  \f(z)\  plotted  vertically 
over  a  horizontal  complex  plane  would  look  like  a  tapered  pole  near  z  =  zq.  We  can 
often  see  that  a  function  has  a  pole  and  find  the  order  of  the  pole  without  finding 
the  Laurent  series. 


Example  4. 


w  z2{z-l)3(z+l) 

has  a  pole  of  order  2  at  2  =  0,  a  pole  of  order  3  at  2  =  1,  and  a  simple  pole  at 
0=  -1. 


(b) 


sin2  z 


has  a  simple  pole  at  z  =  0. 


To  see  that  these  results  are  correct,  consider  finding  the  Laurent  series  for  f{z)  = 
g{z)/{z  —  Zo)n.  We  write  g(z)  =  a®  +  a\(z  —  zq)  +  ■■  ■  ;  then  the  Laurent  series  for 
f(z )  starts  with  the  term  (z  —  Zo)~n  unless  clq  =  0,  that  is  unless  g(zo)  =  0.  Then 
the  order  of  the  pole  of  f(z)  is  n  unless  some  factors  cancel.  In  Example  4b,  the 
sin  z  series  starts  with  z,  so  sin2  z  has  a  factor  z2;  thus  (sin2  z)/z3  has  a  simple  pole 
at  2  =  0. 


PROBLEMS,  SECTION  4 


1.  Show  that  the  sum  of  a  power  series  which  converges  inside  a  circle  C  is  an  analytic 
function  inside  C.  Hint:  See  Chapter  2,  Section  7,  and  Chapter  1,  Section  11,  and 
the  definition  of  an  analytic  function. 

2.  Show  that  equation  (4.4)  can  be  written  as  (4.5).  Then  expand  each  of  the  fractions 
in  the  parenthesis  in  (4.5)  in  powers  of  2  and  in  powers  of  1/z  [see  equation  (4.7)] 
and  combine  the  series  to  obtain  (4.6),  (4.8),  and  (4.2). 

For  each  of  the  following  functions  find  the  first  few  terms  of  each  of  the  Laurent  series 
about  the  origin,  that  is,  one  series  for  each  annular  ring  between  singular  points.  Find 
the  residue  of  each  function  at  the  origin.  ( Warning:  To  find  the  residue,  you  must  use 
the  Laurent  series  which  converges  near  the  origin.)  Hints:  See  Problem  2.  Use  partial 
fractions  as  in  equations  (4.5)  and  (4.7).  Expand  a  term  1  /(z  —  a)  in  powers  of  0  to  get  a 
series  convergent  for  |z|  <  a,  and  in  powers  of  1/z  to  get  a  series  convergent  for  |z|  >  a. 


z(z  —  1  )(z  —  2) 


z(z-  l)(z  —  2)2 


5. 


z  —  1 
z3(z-  2) 


6. 


1 

z2(l  +  z)2 


7. 


2- z 
1  -  z2 


8. 


_ 30 _ 

(1  +  z)(z  —  2)  (3  +  z ) 
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For  each  of  the  following  functions,  say  whether  the  indicated  point  is  regular,  an  essential 
singularity,  or  a  pole,  and  if  a  pole  of  what  order  it  is. 


10. 


11. 


12. 


(a) 

sin  z 

- ,  z  = 

z 

:  0 

(b) 

COS  £ 

— z  =  0 
-s'3 

(c) 

z3-l 

2  =  1 

(d) 

eZ  ~  i 

(z-1)3’ 

z-l’  ~ 

(a) 

ez  -  1 

=  2  i 

(b) 

tan2  z,  z  =  7r 

z2  +  4  ’ 

(c) 

1  —  COS  3 

z  =  0 

(d) 

(  n  \ 

24  ’ 

COa[z-7rJ  ’ 

(a) 

e2  -  1  -  2 

O 

II 

(b) 

sin~  -  n 

Z3  ’  ~  ° 

(c) 

22  -  1 

(d) 

COS  z 

(z-1)2’ 

Z  —  I 

(z  —  7t/2)4  ’  " 

(a) 

sin  z  —  z 

z  =  0 

(b) 

z2-l 

z' 0  ’ 

(z2  + 1)2’  2 

(c) 

II 

N 

=  0 

(d) 

o 

II 

5.  THERESIDUETHEOREM 

Let  z o  be  an  isolated  singular  point  of  f(z).  We  are  going  to  find  the  value  of 
§c  f(z)  dz  around  a  simple  closed  curve  C  surrounding  zq  but  inclosing  no  other 
singularities.  Let  f(z)  be  expanded  in  the  Laurent  series  (4.1)  about  z  =  Zq  that 
converges  near  z  =  zq.  By  Cauchy’s  theorem  (V),  the  integral  of  the  “a”  series  is 
zero  since  this  part  is  analytic.  To  evaluate  the  integrals  of  the  terms  in  the  “6” 
series  in  (4.1),  we  replace  the  integrals  around  C  by  integrals  around  a  circle  C'  with 
center  at  zq  and  radius  p  as  in  (3.6),  (3.7),  and  Figure  3.1.  Along  C",  z  =  zq  +  pel9; 
calculating  the  integral  of  the  b\  term  in  (4.1),  we  find 


(5.1) 


bidz 
c  (z  -  zo) 


=  h 


Jo 


pie19  d6 


pe 


iS 


2mb\. 


It  is  straightforward  to  show  (Problem  1)  that  the  integrals  of  all  the  other  bn  terms 
are  zero.  Then  j>c  f(z)  dz  =  2mb\,  or  since  b\  is  called  the  residue  of  f(z)  at  z  =  zq, 
we  can  say 


j)  f(z )  dz  =  27 xi  ■  residue  of  f(z)  at  the  singular  point  inside  C. 

The  only  term  of  the  Laurent  series  which  has  survived  the  integration  process  is 
the  b\  term;  you  can  see  the  reason  for  the  term  “residue.”  If  there  are  several 
isolated  singularities  inside  C,  say  at  zo,  z\,  Z2,  •  •  • ,  we  draw  small  circles  about 
each  as  shown  in  Figure  5.1  so  that  f(z)  is  analytic  in  the  region  between  C  and  the 
circles.  Then,  introducing  cuts  as  in  Figure  3.1,  we  find  that  the  integral  around 
C  counterclockwise,  plus  the  integrals  around  the  circles  clockwise,  is  zero  (since  the 
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Figure  5.1 

integrals  along  the  cuts  cancel),  or  the  integral  along  C  is  the  sum  of  the  integrals 
around  the  circles  (all  counterclockwise).  But  by  (5.1),  the  integral  around  each 
circle  is  2m  times  the  residue  of  f(z)  at  the  singular  point  inside.  Thus  we  have 
the  residue  theorem: 


(5.2) 


f(z )  dz  =  2ni  ■  sum  of  the  residues  of  f(z )  inside  C, 


where  the  integral  around  C  is  in  the  counterclockwise  direction. 


The  residue  theorem  is  useful  in  evaluating  many  definite  integrals;  we  shall 
consider  this  in  Section  7.  But  first,  in  Section  6,  we  need  to  develop  some  techniques 
for  finding  residues. 

PROBLEMS,  SECTION  5 

1.  If  C  is  a  circle  of  radius  p  about  zq,  show  that 

c £  t — =  2ni  if  n  =  1, 

Jc  {z  -  Zo)n 

but  for  any  other  integral  value  of  n,  positive  or  negative,  the  integral  is  zero.  Hint: 
Use  the  fact  that  z  =  z o  +  pet8  on  C. 

2.  Verify  the  formulas  (4.3)  for  the  coefficients  in  a  Laurent  series.  Hint:  To  get  an, 
divide  equation  (4.1)  by  (z  —  zo)n+1  and  use  the  results  of  Problem  1  to  evaluate 
the  integrals  of  the  terms  of  the  series.  Use  a  similar  method  to  find  bn. 

3.  Obtain  Cauchy’s  integral  formula  (3.9)  from  the  residue  theorem  (5.2). 

6.  METHODS  OF  FINDING  RESIDUES 

A.  Laurent  Series  If  it  is  easy  to  write  down  the  Laurent  series  for  f(z )  about 
z  =  Zq  that  is  valid  near  zq,  then  the  residue  is  just  the  coefficient  b\  of  the  term 
1/(2  —  Zo).  Caution:  Be  sure  you  have  the  expansion  about  z  =  zq;  the  series  you 
have  memorized  for  ez,  sinz,  etc.,  are  expansions  about  z  =  0  and  so  can  be  used 
only  for  finding  residues  at  the  origin  (see  Section  4,  Example  3).  Here  is  another 
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example:  Given  f(z)  =  ez/(z  —  1),  find  the  residue,  R{  1),  of  f(z )  at  z  =  1.  We 
want  to  expand  ez  in  powers  of  z  —  1;  we  write 


ez  e  ■  ez  1 
2-1 =  2-1 


e 

2-1 


1  +  (2  -  1)  + 


(z-1)2  , 
2! 


e 

= - -  +  e  +  •  •  •  . 

2  —  1 


Then  the  residue  is  the  coefficient  of  1/(2  —  1),  that  is,  -R(l)  =  e. 


B.  Simple  Pole  If  f(z)  has  a  simple  pole  at  2  =  2o,  we  find  the  residue  by 
multiplying  f{z)  by  (2  —  20)  and  evaluating  the  result  at  2  =  2q  (Problem  10). 


Example  1.  Find  R{—  i)  and  R{ 5)  for 

~  (22  +  1)(5  —  2)  ’ 


Multiply  f(z)  by  (2  +  ^),  [Caution:  not  by  (2 2  +  1)],  and  evaluate  the  result  at 
2  =  —  We  find 


Similarly, 


(2+  |)/(2) 

i?K) 


+  it _ i _ 

1  +  2j(22+l)(5-2) 


2(5  +i)  22' 


2 

2(5-2)’ 


(*  -  5)/(2) 

i?(5) 


5)(22+l)(5-2) 

5 

_n' 


2 

22  +  1’ 


Example  2.  Find  i?(0)  for  f(z)  =  (cosz)/z.  Since  2/(2)  =  cos  2,  we  have 

i?(0)  =  (cos  2)2=o  =  cosO  —  1. 

To  use  this  method,  we  may  in  some  problems  have  to  evaluate  an  indeterminate 
form,  so  in  general  we  write 


(6.1)  R(zo)  =  lim  (2  —  zo)f(z)  when  20  is  a  simple  pole. 

2— *20 


Example  3.  Find  the  residue  of  cot  2  at  2  =  0.  By  (6.1) 

R{ 0)  =  lim  =  cosO  •  lim  —  =  1-1  =  1. 

2^0  sin  2  2^0  sin  2 

If,  as  often  happens,  f(z)  can  be  written  as  g(z)/h(z ),  where  g(z)  is  analytic 
and  not  zero  at  2 0  and  h(zo)  =  0,  then  (6.1)  becomes 


R(zo)  =  lim 
2— >20 


(2  ~  20)g(2) 

h(z) 


g(zo)  lim 

Z^Zo 


z  -  Zp 

h(z) 


g{z0)  lim 

Z^Zq 


1 

W) 


g(z  0) 

h'(zo) 
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by  L’Hopital’s  rule  or  the  definition  of  h'(z)  (Problem  11). 
Thus  we  have 


(6.2) 


R{z0) 


g{z  o) 
h'(z0) 


f(z)  =  g{z)/h(z),  and 
if  g(zg)  =  finite  const.  ^  0,  and 
Ji(z0)  =  0,  h'(zo)  ±  0. 


Often  (6.2)  gives  the  most  convenient  way  of  finding  the  residue  at  a  simple  pole. 


Example  4.  Find  the  residue  of  (sinz)/(l  —  z4)  at  z  =  i.  By  (6.2)  we  have 


m 


sin  z  sin  i 

—4 z3  z=i  —4 i3 


(2i)(4i) 


—  (e  —  e  1)  =  -  sinh  1. 
8V  ’  4 


Now  you  may  ask  how  you  know,  without  finding  the  Laurent  series,  that  a 
function  has  a  simple  pole.  Perhaps  the  simplest  answer  is  that  if  the  limit  obtained 
using  (6.1)  is  some  constant  (not  0  or  oo),  then  f(z)  does  have  a  simple  pole  and  the 
constant  is  the  residue.  [If  the  limit  =  0,  the  function  is  analytic  and  the  residue 
=  0;  if  the  limit  is  infinite,  the  pole  is  of  higher  order.]  However,  you  can  often 
recognize  the  order  of  a  pole  in  advance.  [See  end  of  Section  4  for  the  simple  case 
in  which  (z  —  zo)n  is  a  factor  of  the  denominator.]  Suppose  f(z )  is  written  in  the 
form  g(z)/h(z),  where  g{z)  and  h(z)  are  analytic.  Then  you  can  think  of  g(z)  and 
h(z)  as  power  series  in  (z  —  zg).  If  the  denominator  has  the  factor  (z  —  zg)  to  one 
higher  power  than  the  numerator,  then  f(z)  has  a  simple  pole  at  zg.  For  example, 


9  z  COS2  z 
ZCOt  2  =  - 2 - 

sin  ^ 


z(l  —  z1 12  +  •  •  •  )2 
(z  -  z3/3!  +  ■  ■  ■  )2 


*(!+•••) 
z2(l  +  ---) 


has  a  simple  pole  at  z  =  0.  By  the  same  method  we  can  see  whether  a  function  has 
a  pole  of  any  order 


C.  Multiple  Poles  When  f(z)  has  a  pole  of  order  n,  we  can  use  the  following 
method  of  finding  residues. 


Multiply  f(z)  by  (z  —  zg)m,  where  m  is  an  integer  greater  than  or  equal  to  the 
order  n  of  the  pole,  differentiate  the  result  m  —  1  times,  divide  by  (m  —  1)!,  and 
evaluate  the  resulting  expression  at  z  =  zg. 


It  is  easy  to  prove  that  this  rule  is  correct  (Problem  12)  by  using  the  Laurent 
series  (4.1)  for  f(z)  and  showing  that  the  result  of  the  outlined  process  is  b\. 
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Example  5.  Find  the  residue  of  f(z)  =  (zsinz)/(z  —  7 r)3  at  z  =  n. 

We  take  m  =  3  to  eliminate  the  denominator  before  differentiating;  this  is  an 
allowed  choice  for  m  because  the  order  of  the  pole  of  /(z)  at  7r  is  not  greater  than 
3  since  zsinz  is  finite  at  7r.  (The  pole  is  actually  of  order  2,  but  we  do  not  need 
this  fact.)  Then  following  the  rule  stated,  we  get 


P/  n  1  d2  . 

flW  =  2i3?(*smz) 


=  -[— zsinz  +  2  cos  z]z=7r  =  — 1. 


(To  compute  the  derivative  quickly,  use  Leibniz’  rule  for  differentiating  a  product; 
see  Chapter  12,  Section  3.) 

Much  of  this  work  can  be  done  by  computer.  However,  remember  that  the  point 
of  doing  these  problems  is  to  gain  skill  in  using  the  ideas  and  techniques  of  complex 
variable  theory.  So  a  good  study  method  is  to  do  the  problems  as  outlined  above 
and  then  check  your  results  by  computer. 


PROBLEMS,  SECTION  6 

Find  the  Laurent  series  for  the  following  functions  about  the  indicated  points;  hence  find 
the  residue  of  the  function  at  the  point.  (Be  sure  you  have  the  Laurent  series  which 
converges  near  the  point.) 


1. 

4. 

7. 

10. 

11. 

12. 

13. 


z(z  +  1) 
cosh  z 


z  =  0 


z * 

sin-7rz 
4z2  —  1 


,  z  =  0 


2. 


5. 


8. 


z(z-  1) 
ez 


z2  -  1 
1  +  cos  z 

(z  —  7r)  2 


Z  =  1 


z  =  1 


3. 


6. 


9. 


z  =  0 


z  =  0 


z2  -  5z  +  6 


z  =  2 


Show  that  rule  B  is  correct  by  applying  it  to  (4.1). 

Derive  (6.2)  by  using  the  limit  definition  of  the  derivative  h'(zo)  instead  of  using 
L’Hopital’s  rule.  Remember  that  h(zo)  =  0  because  we  are  assuming  that  /(z)  has 
a  simple  pole  at  zo- 

Prove  rule  C  for  finding  the  residue  at  a  multiple  pole,  by  applying  it  to  (4.1).  Note 
that  the  rule  is  valid  for  n  =  1  (simple  pole)  although  we  seldom  use  it  for  that  case. 

Prove  rule  C  by  using  (3.9).  Hints:  If  /(z)  has  a  pole  of  order  n  at  z  =  a,  then 
f(z)  =  g(z)/(z  —  a)n  with  g(z)  analytic  at  z  =  a.  By  (3.9) 


L 


g{z) 

c  («  -  a) 


dz  =  27 rig(a) 


with  C  a  contour  inclosing  a  but  no  other  singularities.  Differentiate  this  equation 
(n  —  1)  times  with  respect  to  a.  (Or,  use  Problem  3.21.) 

Find  the  residues  of  the  following  functions  at  the  indicated  points.  Try  to  select  the 
easiest  of  the  methods  outlined  above.  Check  your  results  by  computer. 


14. 


16. 


(3z  +  2)(2  —  z) 

z  —  2 


atz  =  — |;atz  =  2  15. 


(1  -  2z)(5z  —  4) 


at  z  =  | ;  at  z  =  | 


z(l-z) 


at  z  =  0;  at  z  =  1 


17. 


z  +  2  1  1 

-jpj— I  atz  =  -;atz  =  — - 
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18. 

20. 

22. 

24. 

26. 

28. 

30. 

32. 

34. 


z  +  2 


22  +9 

at 

2 

at 

1  -  24 

e22 

at 

l+ez 

1  —  COS  2 2 

z3 

2iriz 

at 

at  z  =  0 

27ri/3 

•y  -  (3  ' 


1  —  z3 

z  +  2 

(. z 2  +  9  )(z2  +  1) 
cosh  2  —  1 


at  2  =  3i 


at  2  =  0 


(z2  +  4)2 
2-2 


at  2  =  2i 


o/-,  n  ,,  at  ^  =  0  and  at  2  =  - 
z2( 1  —  2 z)2  2 


19. 

21. 

23. 

25. 

27. 

29. 

31. 

33. 

35. 


2z  —  TV 


at  2  =  7t/2 


24  +  16 


at  2  =  \/2(l  +  i ) 


e12  2* 

— - -  at  2  =  — 

922  +  4  3 


e22  -  1 
22 

COS  2 

1  —  2  sin  2 
e22 

4  cosh  2  —  5 

e32  -  32  -  1 
24 

1  +  COS  2 


at  2  =  0 


at  2  =  7t/6 


at  2  =  In  2 


at  2  =  0 


(n  —  z)3 
z 

(22  +  l)2 


at  2  =  7T 


at  z  =  i 


14'  to  35'  Use  the  residue  theorem  to  evaluate  the  contour  integrals  of  each  of  the  functions 
in  Problems  14  to  35  around  a  circle  of  radius  |  and  center  at  the  origin.  Check  carefully 
to  see  which  singular  points  are  inside  the  circle.  You  may  use  your  results  in  the  previous 
problems  as  far  as  they  go,  but  you  may  have  to  compute  some  more  residues. 

36.  For  complex  2,  JP(z)  can  be  defined  by  the  series  (12.9)  in  Chapter  12.  Use  this 
definition  to  find  the  Laurent  series  about  2  =  0  for  z~3Jo(z).  Find  the  residue  of 
the  function  at  2  =  0. 

37.  The  gamma  function  F(2)  is  analytic  except  for  poles  at  2  =  x  =  0,  —1,  —2,  —3  •  •  • 
(all  the  negative  integers).  Find  the  residues  at  these  poles.  Hints:  See  Example  1 
above  and  Chapter  11,  Equation  (4.1). 


7.  EVALUATION  OF  DEFINITE  INTEGRALS  BY  USE  OF  THE  RESIDUE  THEOREM 

We  are  going  to  use  (5.2)  and  the  techniques  of  Section  6  to  evaluate  several  different 
types  of  definite  integrals.  The  methods  are  best  shown  by  examples. 


Example  1.  Find  I  = 

If  we  make  the  change  of  variable  z  =  e  ,  then  as 
9  goes  from  0  to  2-7T,  2  traverses  the  unit  circle  |z|  =  1 
(Figure  7.1)  in  the  counterclockwise  direction,  and 
we  have  a  contour  integral.  We  shall  evaluate  this 
integral  by  the  residue  theorem.  If  z  =  e*e,  we  have 


/ 


dd 


5  +  4  cos  9 


dz  =  iel6  d9  =  iz  d9  or  d9  =  —  dz, 

iz 


A9 


i9 


cost/  = 


2 


2 


Figure  7.1 
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Making  these  substitutions  in  /,  we  get 


I  = 


—  dz 
iz 


c  5  +  2 (z  +  1  / z) 


If  dz 
i  Jc  5z  +  2 z2  +  2 


If  dz 

i  Jc  (2-2  +  l)(-2  +  2)  ’ 


where  C  is  the  unit  circle.  The  integrand  has  poles  at  z  =  —  \  and  2  =  —2;  only 
z  =  —  \  is  inside  the  contour  C.  The  residue  of  l/[(2 z  +  1  )(z  +  2)]  at  z  =  —  \  is 


lim  (z  +  i)  • 
2— >  — 1/2  1 


1 

(2  z  +  1)  (z  +  2) 


1 

2(z  +  2) 


l 

3' 


Then  by  the  residue  theorem 

1  i  ,  27T 

/  =  -  2niR(-\)  =  2tt  ■  ±  =  — . 

X  o 

This  method  can  be  used  to  evaluate  the  integral  of  any  rational  function  of 
sin  (9  and  cos0  between  0  and  27r,  provided  the  denominator  is  never  zero  for  any 
value  of  9.  You  can  also  find  an  integral  from  0  to  7r  if  the  integrand  is  even,  since 
the  integral  from  0  to  2n  of  an  even  periodic  function  is  twice  the  integral  from  0 
to  7T  of  the  same  function.  (See  Chapter  7,  Section  9  for  discussion  of  even  and  odd 
functions.) 


Example  2.  Evaluate  /  = 


dx 


'-oo  1  +  X2 


Here  we  could  easily  find  the  indefinite  integral  and  so  evaluate  I  by  elemen¬ 
tary  methods.  However,  we  shall  do  this  simple  problem  by  contour  integration  to 
illustrate  a  method  which  is  useful  for  more  complicated  problems. 

This  time  we  are  not  going  to  make  a  change  of  variable  in  I.  We  are  going  to 
start  with  a  different  integral  and  show  how 
to  find  I  from  it.  We  consider 

dz 


cl  +  z2’ 

where  C  is  the  closed  boundary  of  the  semi¬ 
circle  shown  in  Figure  7.2.  For  any  p  >  1,  the 
semicircle  incloses  the  singular  point  z  =  i 
and  no  others;  the  residue  of  the  integrand  at 
z  =  i  is 


R(i)  =  lim(£  —  i) 


1 


(z-i)(z  +  i) 


1 

~2i 


Figure  7.2 


Then  the  value  of  the  contour  integral  is  27ri(l/2«)  =  -k.  Let  us  write  the  integral 
in  two  parts:  (1)  an  integral  along  the  x  axis  from  —p  to  p;  for  this  part  z  =  x;  (2) 
an  integral  along  the  semicircle,  where  z  =  pel6 .  Then  we  have 


(7.1) 


dz 

TTz 2 


fp  dx  r  pieie  dO 

_p  1  +  x2  +  J0  1  +  p2e2ie ' 
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We  know  that  the  value  of  the  contour  integral  is  7 r  no  matter  how  large  p  becomes 
since  there  are  no  other  singular  points  besides  z  =  i  in  the  upper  half-plane.  Let 
p  — >  00;  then  the  second  integral  on  the  right  in  (7.1)  tends  to  zero  since  the 
numerator  contains  p  and  the  denominator  p2.  Thus  the  first  term  on  the  right 
tends  to  7r  (the  value  of  the  contour  integral)  as  p  — >  00,  and  we  have 


1  = 


dx 

1  +  x2 


=  7T. 


This  method  can  be  used  to  evaluate  any  integral  of  the  form 

rmdx 

J- 00  Q{x) 

if  P(x)  and  Q(x)  are  polynomials  with  the  degree  of  Q  at  least  two  greater  than 
the  degree  of  P,  and  if  Q(z)  has  no  real  zeros  (that  is,  zeros  on  the  x  axis).  If  the 
integrand  P{x) /Q{x)  is  an  even  function,  then  we  can  also  find  the  integral  from  0 
to  00. 


Example  3.  Evaluate  I 


f°°  cos  x  dx 

Jo  l  +  x2  ' 


We  consider  the  contour  integral 


f  eizdz 

fcTT. 

where  C  is  the  same  semicircular  contour  as  in  Example  2.  The  singular  point 
inclosed  is  again  z  =  i,  and  the  residue  there  is 

eiz  e-1  1 

lim(z  —  i)  - - Try - :r  =  -7—-  =  - — . 

z^i  (z  —  i)(z  +  i)  2 1  2  le 

The  value  of  the  contour  integral  is  27rz(l/2ie)  =  7r/e.  As  in  Example  2  we  write 
the  contour  integral  as  a  sum  of  two  integrals: 

eix  dx  f  eiz  dz 

1  +  x2  J  1  +  z2 ' 

along  upper  half 
oi  z  =  pezG 

As  before,  we  want  to  show  that  the  second  integral  on  the  right  of  (7.2)  tends  to 
zero  as  p  — >  00.  This  integral  is  the  same  as  the  corresponding  integral  in  (7.1) 
except  for  the  elz  factor.  Now 

\eiz\  =  |eix-»|  =  \eix\\e-y\  =  e~y  <  1 


since  y  >  0  on  the  contour  we  are  considering.  Since  \elz\  <  1,  this  factor  does  not 
change  the  proof  given  in  Example  2  that  the  integral  along  the  semicircle  tends  to 
zero  as  the  radius  p  — >  00.  We  have  then 

e“  ,  7 r 
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or  taking  the  real  part  of  both  sides  of  this  equation, 


cos  x  dx 
1  +  x2 


7 r 

e 


Since  the  integrand  (cosx)/(l  +  x2)  is  an  even  function,  the  integral  from  0  to  oo 
is  half  the  integral  from  —  oo  to  oo.  Hence  we  have 


1  = 


f°°  cos  x  dx 

Jo  1  +  x2 


7 r 
2e 


Observe  that  the  same  proof  would  work  if  we  replaced  elz  by  emz  (m  >  0)  in 
the  above  integrals.  At  the  point  where  we  said  e~y  <  1  (since  y  >  0)  we  would 
then  want  e~my  <  1  for  y  >  0,  which  is  true  if  m  >  0.  [For  m  <  0,  we  could  use  a 
semicircle  in  the  lower  half-plane  (y  <  0);  then  we  would  have  emy  <  1  for  y  <  0. 
This  is  an  unnecessary  complication,  however,  in  evaluating  integrals  containing 
sinmx  or  cos  mx  since  we  can  then  choose  m  to  be  positive.]  Although  we  have 
assumed  here  that  (as  in  Example  2)  Q(x)  is  of  degree  at  least  2  higher  than  P(x), 
a  more  detailed  proof  (see  books  on  complex  variables)  shows  that  degree  at  least 
one  higher  is  enough  to  make  the  integral 


/ 


P(=) 

Q(z)( 


dz 


around  the  semicircle  tend  to  zero  as  p  — >  oo.  Thus 

f  elmx  dx  =  2iri  ■  sum  of  the  residues  of 

J- oo  <20*0  <2(^) 


in  the  upper  half-plane  if  all  the  following  requirements  are  met: 

1.  P(x)  and  Q(x)  are  polynomials,  and 

2.  Q(x)  has  no  real  zeros,  and 

3.  the  degree  of  Q{x)  is  at  least  1  greater  than  the  degree  of  P(x),  and  m  >  0. 


By  taking  real  and  imaginary  parts,  we  then  find  the  integrals 

p(x)  J  r  pw  ■ 

.  cos  mxdx,  /  .  .  sin  mxdx 

Q{X)  J- oo  Q(x) 


Example  4. 


Evaluate 


smx 

- dx. 

x 


Here  we  remove  the  restriction  of  Examples  2  and  3  that  Q{x )  has  no  real  zeros. 
As  in  Example  3,  we  consider 


To  avoid  the  singular  point  at  z  =  0,  we  integrate  around  the  contour  shown  in 
Figure  7.3.  We  then  let  the  radius  r  shrink  to  zero  so  that  in  effect  we  are  integrat¬ 
ing  straight  through  the  simple  pole  at  the  origin.  We  are  going  to  show  (later  in  this 
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section  and  Problem  21)  that  the  net  result  of  inte¬ 
grating  in  the  counterclockwise  direction  around  a 
closed  contour  which  passes  straight^  through  one 
or  more  simple  poles  is  2ni  ■  (sum  of  the  residues  at 
interior  points  plus  one-half  the  sum  of  the  residues 
at  the  simple  poles  on  the  boundary) .  ( Warning: 
this  rule  does  not  hold  in  general  for  a  multiple  pole 
on  a  boundary.)  You  might  expect  this  result.  If  a 
pole  is  inside  a  contour,  it  contributes  2ni-  residue,  to  the  integral;  if  it  is  outside, 
it  contributes  nothing;  if  it  is  on  the  straight  line  boundary,  its  contribution  is  just 
halfway  between  zero  and  2ni-  residue.  [See  Am.  J.  Phys.  52,  276  (1984).]  Using 
this  fact,  and  observing  that,  as  in  Example  3,  the  integral  along  the  large  semicircle 
tends  to  zero  as  R  tends  to  infinity,  we  have 


/  e*z 

dx  =  2ni  ■  —  residue  of  —  at  z  =  0  )  =  27r*  •  —  •  1  =  in. 


1 

J- oo  x  2  V  ^ 

Taking  the  imaginary  parts  of  both  sides,  we  get 

sin  x 


■  dx  =  7T. 


x 


To  show  more  carefully  that  our  result  is  correct,  let  us  return  to  the  contour  of 
Figure  7.3.  Since  elz / '  z  is  analytic  inside  this  contour,  the  integral  around  the  whole 
contour  is  zero.  As  we  have  said,  the  integral  along  C  tends  to  zero  as  R  — >  oo  by 
the  theorem  at  the  end  of  Example  3.  Along  the  small  semicircle  C' ,  we  have 


A8 


dz  =  re 
r  eiz  dz 

Jc  z 


d-> 

ieid6 ,  —  =  idO , 


=  /  elzidO. 

Jc 


As  r  ->  0,  2  ->  0,  eiz 
Figure  7.3)  tends  to 


1,  and  the  integral  (along  C  in  the  direction  indicated  in 
rO 


idd  =  —in. 


Then  we  have  as  R  — *  oo,  and  r  — >  0, 


or 


■  dx  —  in  =  0 


■  dx  =  *7 


as  before.  Taking  real  and  imaginary  parts  of  this  equation  (and  using  Euler’s 
formula  e“  =  cos  x  +  i  sin  x) ,  we  get 


cos  x 


dx  =  0, 


sin  x 


dx  =  n. 


fBy  “straight”  we  mean  that  the  contour  curve  has  a  tangent  at  the  pole,  that  is,  it  does  not 
turn  a  corner  there. 
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Since  (sin  a;) /a:  is  an  even  function,  we  have 


sin  x  1 

- dx  =  - 

x  2 


sm  x 


dx  = 


[For  another  way  of  evaluating  this  integral,  see  Chapter 


7T 

2' 

7,  equation  (12.19).] 


Principal  value  Now  consider  the  cosine  integral. 

dx 

is  a  divergent  integral  since  the  integrand  (cos  x)  /  x  is  approximately  1  jx  near  x  =  0 . 
The  value  zero  which  we  found  for  I  =  [_  (cos  x)/x  dx  is  called  the  principal  value 
(or  Cauchy  principal  value)  of  I.  To  see  what  this  means,  consider  a  simpler  integral, 
namely 

r5  dx 

Jo  x  ~  3 

The  integrand  becomes  infinite  at  x  =  3,  and  both  /Q3  dx/(x  —  3)  and  J ^  dx/ (x  —  3) 
are  divergent.  Suppose  we  cut  out  a  small  symmetric  interval  about  x  =  3,  and 
integrate  from  0  to  3  —  r  and  from  3  +  r  to  5.  We  find 


dx 

x  —  3 


f5  dx 

J 3+r  X  ~  3 


3 -r 


In  \x  —  3| 


o 


=  In  r  —  In  3, 


In  2  —  In  r. 


The  sum  of  these  two  integrals  is 


In  2  —  In  3  =  In  ■ 


this  sum  is  independent  of  r.  Thus,  if  we  let  r  — >  0,  we  get  the  result  In  |  which  is 
called  the  principal  value  of 


dx 

x  —  3 


^often  written  PV 


dx 

x  —  3 


The  terms  In  r  and  —  In  r  have  been  allowed  to  cancel  each  other;  graphically  an 
infinite  area  above  the  x  axis  and  a  corresponding  infinite  area  below  the  x  axis  have 
been  canceled.  In  computing  the  contour  integral  we  integrated  along  the  x  axis 
from  —oo  up  to  — r,  and  from  +r  to  +oo,  and  then  let  r  — >  0;  this  is  just  the  process 
we  described  for  finding  principal  values,  so  the  result  we  found  for  the  improper 
integral  J//°  (cos  x) /x  dx ,  namely  zero,  was  the  principal  value  of  this  integral. 


Exa  m  p  I  e  5.  Evaluate 

rp- 1 

——dr,  0<p<l, 

1  +  r 

and  use  the  result  to  prove  (5.4)  of  Chapter  11. 
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We  first  find 


(7.3) 


/ 


zp~ 1 
1  +  2 


dz, 


0  <  p  <  1, 


around  C  in  Figure  7.4. 


Before  we  can  evaluate  this  integral,  we  must  ask  what  zp_1  means,  since  for  each  2 
there  may  be  more  than  one  value  of  zp_1.  (See  discussion  of  branches  at  the  end  of 
Section  1.)  For  example,  consider  the  case  p  =  1;  then  zp_1  =  z-1/2.  Recall  from 
Chapter  2,  Section  10,  that  there  are  two  square  roots  of  any  complex  number.  At 
a  point  where  6  =  7t/4,  say,  we  have 

2  =  r^'\  z -1/2  =  r-1/2e-i7r/8. 


But  if  9  increases  by  27r  (we  think  of  following  a  circle  around  the  origin  and  back 
to  our  starting  point),  we  have 

2  =  rei0+4+2+;  „-l/2  =  r-l/2e-i(w/8+7T)  =  _r- l/2e— wr/8. 


Similarly,  for  any  starting  point  (with  r  ^  0),  we  find  that  z-1/2  or  zp_1  comes 
back  to  a  different  value  (different  branch)  when  9  increases  by  27t  and  we  return 
to  our  starting  point.  If  we  want  to  use  the  formula  zp_1  to  define  a  (single- valued) 
function,  we  must  decide  on  some  interval  of  length  27t  for  9  (that  is,  we  must 
select  one  branch  of  zp_1).  Let  us  agree  to  restrict  9  to  the  values  of  0  to  2-7T  in 
evaluating  the  contour  integral  (7.3).  We  may  imagine  an  artificial  barrier  or  cut 
(which  we  agree  not  to  cross)  along  the  positive  x  axis;  this  is  called  a  branch  cut. 
(See  Example  3,  Section  9.)  A  point  which  we  cannot  encircle  (on  an  arbitrarily 
small  circle)  without  crossing  a  branch  cut  (thus  changing  to  another  branch)  is 
called  a  branch  point ;  the  origin  is  a  branch  point  here. 

In  Figure  7.4,  then,  0  =  0  along  AB  (upper  side 
of  the  positive  x  axis);  when  we  follow  C  around  to 
DE,  9  increases  by  27r,  so  9  =  2n  on  the  lower  side  of 
the  positive  x  axis.  Note  that  the  contour  in  Figure 
7.4  never  takes  us  outside  the  0  to  27t  interval,  so  the 
factor  zp— 1  in  (7.3)  is  a  single-valued  function.  The 
integrand  in  (7.3),  namely  zp_1/(  1  +  2),  is  now  an 
analytic  function  inside  the  closed  curve  C  in  Figure 
7.4  except  for  the  pole  at  2  =  —  1  =  em .  The  residue 
there  is  (el7I’)p~1  =  —  el^p.  Then  we  have 

(7.4)  <j)  j - dz  =  —27 rfe”rp,  0  <  p  <  1. 

Along  either  of  the  two  circles  in  Figure  7.4  we  have  z  =  relS  and  the  integral  is 


rp-l  Pi{p-1)8 


1  +  re 


id 


rie lB  d9  =  i 


rPeip8 


1  +  re 


i0 


d9. 


This  integral  tends  to  zero  if  r  — >  0  or  if  r  — >■  00.  (Verify  this;  note  that  the 
denominator  is  approximately  1  for  small  r,  and  approximately  reie  for  large  r.) 
Thus  the  integrals  along  the  circular  parts  of  the  contour  tend  to  zero  as  the  little 
circle  shrinks  to  a  point  and  the  large  circle  expands  indefinitely.  We  are  left  with 
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the  two  integrals  along  the  positive  x  axis  with  AB  now  extending  from  0  to  oo  and 
DE  from  oo  to  0.  Along  AB  we  agreed  to  have  9  =  0,  so  z  =  re1'0  =  r,  and  this 
integral  is 


/ 

J  r= 


■  dr. 


/r=0  1  +  r 

Along  DE,  we  have  9  =  2n,  so  z  =  re2’1’1  and  this  integral  is 

fO  (re27Ti)p-l 


Jl'ni 


dr  =  — 


/  1  I  |’p27ri 

J  r=oo  x  ° 

Adding  the  AB  and  DE  integrals,  we  get 

r°°  rp~  i 


/  o 


1  +  r 


dr. 


(1  -  e27rip 
by  (7.4).  Then  the  desired  integral  is 
(7.5) 


dr  =  —27 rieI7rp 


/■°°  rp_1  ,  -2t riei7rp  n  ■  2i 

I  - dr  =  - =  — - : — 

o  1  +  r  1  -  e2™P  el7rP  -  e~™P 


sin  irp 

Let  us  use  (7.5)  to  obtain  (5.4)  of  Chapter  11.  Putting  q  =  1  —  p  in  (6.5)  and 
(7.1)  of  Chapter  11,  we  have 


(7.6) 


B(p,l  -p)=  f 
Jo 


y- 


.p-i 


l  +  y 


dy 


and 


B(p,l-p)=T(p)T(l-p)  r(i)  =  1. 
Combining  (7.5)  and  (7.6)  gives  (5.4)  of  Chapter  11,  namely 

f°°  yP-1 

rb)r(1  ~P)  =  B(p,  1  -p)=  /  — —  dy  = 

Jo  1  +  y 


sm  7T p 


Argument  Principle  Since  w  =  f(z)  is  a  complex  number  for  each  z,  we  can 
write  w  =  Rex&  (just  as  we  write  z  =  re1 6)  where  R  =  |w|  and  0  is  the  angle  of  w 
[or  we  could  call  it  the  angle  of  f(z)}.  As  z  changes,  w  =  f(z)  also  changes  and  so 
R  and  0  vary  as  we  go  from  point  to  point  in  the  complex  {x,  y)  plane.  We  want 
to  show  that 

(a)  if  f(z)  is  analytic  on  and  inside  a  simple  closed  curve  C  and  f(z)  ^  0  on  C, 
then  the  number  of  zeros  of  f(z)  inside  C  is  equal  to  (1/2tt)-  (change  in  the  angle 
of  f(z)  as  we  traverse  the  curve  C); 

(b)  if  f(z)  has  a  finite  number  of  poles  inside  C,  but  otherwise  meets  the  re¬ 
quirements  stated, §  then  the  change  in  the  angle  of  f(z)  around  C  is  equal  to  {2tt)- 
(the  number  of  zeros  minus  the  number  of  poles). 

(Just  as  we  say  that  a  quadratic  equation  with  equal  roots  has  two  equal  roots, 
so  here  we  mean  that  a  zero  of  order  n  counts  as  n  zeros  and  a  pole  of  order  n 
counts  as  n  poles.) 

To  show  (a)  and  (b)  we  consider 


m 

/(*) 


dz. 


§A  function  which  is  analytic  except  for  poles  is  called  meromorphic. 
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By  the  residue  theorem,  the  integral  is  equal  to  27r*-(sum  of  the  residues  at  singu¬ 
larities  inside  C).  It  is  straightforward  to  show  (Problem  42)  that  the  residue  of 
F(z)  =  f'(z)/f(z)  at  a  zero  of  f(z)  of  order  n  is  n,  and  the  residue  of  F(z)  at  a 
pole  of  f(z)  of  order  p  is  —p.  Then  if  N  is  the  number  of  zeros  and  P  the  number 
of  poles  of  f(z)  inside  C ,  the  integral  is  27 ri(N  —  P).  Now  by  direct  integration,  we 
have 


(7.7) 


£  /£! dz = in/(z)i  c = inRei&\ c 


=  Ln  R\ 


iQ 


c> 


where  R  =  \f(z)\  and  0  is  the  angle  of  f(z).  Recall  from  Chapter  2,  Section  13,  that 
Ln  R  means  the  ordinary  real  logarithm  (to  the  base  e)  of  the  positive  number  R, 
and  is  single- valued;  ln  f(z)  is  multiple-valued  because  0  is  multiple- valued.  Then 
if  we  integrate  from  a  point  A  on  C  all  the  way  around  the  curve  and  back  to  A, 
Ln  R  has  the  same  value  at  A  both  at  the  beginning  and  at  the  end,  so  the  term 
Lni?|c  is  Lni?  at  A  minus  Lni?  at  A ;  this  is  zero.  The  same  result  may  not  be 
true  for  0;  that  is,  the  angle  may  have  changed  as  we  go  from  point  A  all  the  way 
around  C  and  back  to  A.  (Think,  for  example,  of  the  angle  of  z  as  we  go  from  z  =  1 
around  the  unit  circle  and  back  to  z  =  1;  the  angle  of  z  has  increased  from  0  to 
27t.)  Collecting  our  results,  we  have 


<7-8'  N-p=hiMd-h^ 
1 


27T 


(change  in  the  angle  of  f(z)  around  C), 


where  N  is  the  number  of  zeros  and  P  the  number  of  poles  of  f(z)  inside  (7,  with 
poles  of  order  n  counted  as  n  poles  and  similarly  for  zeros  of  order  n.  Equation 
(7.8)  is  known  as  the  argument  principle  (recall  from  Chapter  2  that  argument 
means  angle). 


This  principle  is  often  used  to  find  out  how  many  zeros  (or  poles)  a  given  func¬ 
tion  has  in  a  given  region.  (Locating  the  zeros  of  a  function  has  important  appli¬ 
cations  to  determining  the  stability  of  linear  systems  such  as  electric  circuits  and 
servomechanisms . 

Example  6.  Let  us  show  that  f(z)  =  z3  +  4z  +  1  has  exactly  one  zero  in  the  first 
quadrant.  The  closed  curve  C  in  (7.8)  is,  for  this  problem,  the  contour  OPQ 
in  Figure  7.5,  where  PQ  is  a  large  quarter  circle.  We  first  observe  that  on  the  x 
axis,  x3  +  4a;  +  1  >  0  for  x  >  0,  and  on  the  y  axis,  (iy)3  +  4 iy  +  1  ^  0  for  any 
y  (since  its  real  part,  namely  1,  ^  0).  Then  f(z)  ^  0  on 
OP  or  OQ.  Also  f(z)^0  on  PQ  if  we  choose  a  circle  large 
enough  to  inclose  all  zeros.  We  now  want  to  find  the  change 
in  the  angle  0  of  f(z)  =  Rel&  as  we  go  around  (7.  Along 
OP,  z  =  x;  then  f(z)  =  f(x)  is  real  and  so  0  =  0.  Along 
PQ,  z  =  re™ ,  with  r  constant  and  very  large.  For  very  large 
r,  the  z3  term  in  f(z)  far  outweighs  the  other  terms,  and  we 
have  f(z)  =  z3  =  r3e3t6 .  As  9  goes  from  0  to  7r/2  along  PQ, 


Figure  7.5 
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0  =  3 9  goes  from  0  to  3n/2.  On  QO ,  z  =  iy,  f(z)  =  — iy 3  +  4 iy  +  1;  then 

imaginary  part  of  f(z)  4 y  —  y 3 

LeUl  \~)  ==  - - — - -  =  - . 

real  part  of  f(z)  1 

For  very  large  y  (that  is,  at  Q),  we  had  0  =  3n/2  (for  y  =  oo,  we  would  have 
tan#  =  — oo,  and  0  would  be  exactly  3n/2).  Now  as  y  decreases  along  QO ,  the 
value  of  tan  0  =  4 y  —  y3  decreases  in  magnitude  but  remains  negative  until  it 
becomes  0  at  y  =  2.  This  means  that  0  changes  from  3ir/2  to  2n.  Between  y  =  2 
and  y  =  0,  the  tangent  becomes  positive,  but  then  decreases  to  zero  again  without 
becoming  infinite.  This  means  that  the  angle  O  increases  beyond  27t  but  not  as  far 
as  27r+7r/2,  and  then  decreases  again  to  27t.  Thus  the  total  change  in  0  around  C  is 
27t,  and  by  (7.8),  the  number  of  zeros  of  f(z)  in  the  first  quadrant  is  (l/2ir)  -2tt  =  1. 
If  we  realize  that  (for  a  polynomial  with  real  coefficients)  the  zeros  off  the  real  axis 
always  occur  in  conjugate  pairs,  we  see  that  there  must  also  be  one  zero  for  z  in 
the  fourth  quadrant,  and  the  third  zero  must  be  on  the  negative  x  axis. 


Bromwich  integral  (Inverse  Laplace  Transform)  In  Chapter  8  (Section  8ff), 
we  found  inverse  Laplace  transforms  from  a  table  (pages  469-471),  or  by  computer, 
but  we  had  no  general  formula  for  the  inverse  transform.  By  analogy  with  Fourier 
transforms  (Chapter  7,  Section  12),  where  we  have  similar  integrals  for  the  direct 
and  inverse  transforms,  we  might  reasonably  wonder  whether  an  inverse  Laplace 
transform  could  be  given  by  an  integral.  To  discuss  this,  we  repeat  here  for  conve¬ 
nience  the  definitions  of  Laplace  and  Fourier  transforms. 


(7.9) 


pOO 

L(f)=  /  ,f(t)e~ptdt  =  F(p) 
Jo 


(7.10) 


/OO 

g(a)eiaxda 

-OO 

#(«)  =  J  f(x)e~lOLXdx 


If  we  compare  the  Laplace  transform  (7.9)  with  the  Fourier  transform  [ g(a )  in 
(7.10)],  we  observe  that  if  p  were  imaginary,  the  integrals  would  be  almost  the 
same.  This  suggests  that  we  should  consider  complex  p ,  and  that  the  integral  we 
want  for  the  inverse  Laplace  transform  might  be  an  integral  in  the  complex  p  plane 
(that  is,  a  contour  integral).  Let’s  investigate  this  idea. 

In  the  definition  (7.9)  of  the  Laplace  transform  of  /(f),  let  p  be  complex,  say 
p=  z  =  x+iy.  (Note  that  this  possibility  has  already  been  considered  in  Chapter  8.) 
Then  (7.9)  becomes 


(7.11) 


F(p)  =  F{z)  =  F(x  +  iy)=  / 

Jo 


o~Pt 


f(t)  dt 


poo  poo 

/  e~{x+iy)tf(t)dt=  /  e~xtf(t)e~iyt  dt,  x  =  Rep>k. 
Jo  Jo 


[Recall  (Chapter  8,  Section  8)  that  we  must  have  some  restriction  on  Rep  to  make 
the  integral  converge  at  infinity.  The  restriction  depends  on  what  the  function 
/(f)  is,  but  is  always  of  the  form  Rep  >  k,  for  some  real  k,  as  you  can  see  in 
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the  table  of  Laplace  transforms,  pages  469-471.]  Now  (7.11)  is  of  the  form  of  a 
Fourier  transform.  To  see  this,  compare  (7.11)  with  (7.10)  making  the  following 
correspondences:  e~xyt  dt  corresponds  to  e~lax  dx ,  that  is,  y  corresponds  to  a  and 
t  to  x  [the  x  in  (7.11)  is  just  a  constant  parameter  in  this  discussion];  the  function 


(7.12) 


e~xtf(t),  t>  0, 
0,  t<  0, 


corresponds  to  f(x)  in  (7.10)  and  F(p)  =  F(x  +  iy )  corresponds  to  g{a)\  and  finally, 
we  recall  that  the  1/(27t)  factor  may  be  in  either  integral  in  (7.10).  Then  assuming 
that  <j>(t)  satisfies  the  required  conditions  for  a  function  to  have  a  Fourier  transform 
[see  Chapter  7,  Section  12:  Dirichlet  conditions,  and  /_  \(f>(t)\dt  finite],  we  can 

write  the  inverse  transform  to  get 

1  r°° 

(7.13)  (j)(t)  =  —  J  F(x  +  iy)elyt  dy. 

Using  the  definition  (7.12)  of  </>(£),  we  find 

i  r°°  i  f,QG 

(7.14)  f(t)  =  ext  ■  —  J  F(x  +  iy)eiytdy  =  —  J  F(x  +  iy)e^‘^t  dy 

for  t  >  0.  Since  x  is  constant,  say  x  =  c,  we  have  dz  =  d{x  +  iy)  =  i  dy,  and  we  can 
write  (7.14)  as 


(7.15) 


-|  /*C+200 

f(t)=—  F(z)ezt  dz, 

ZTTl  J q — 200 


t  >  0, 


where  the  notation  means  (see  Problem  3.4)  that  we  integrate  along  a  vertical  line 
x  =  c  in  the  2  plane.  [This  can  be  any  vertical  line  on  which  x  =  c  >  k  as  required 
by  the  restriction  on  Rep  in  (7.11).]  The  integral  (7.15)  for  the  inverse  Laplace 
transform  is  known  as  the  Bromwich  integral. 

We  would  like  to  use  contour  integration  and  the  residue  theorem  to  evaluate 
f(t)  in  (7.15)  for  a  given  F(p)  [which  we  call  F{z)  since  we  consider  complex  p\. 
In  Examples  2  and  3,  we  have  evaluated  integrals  along  a  straight  line  (the  x  axis) 
by  considering  the  contour  made  up  of  the  x  axis  and  a  large  semicircle  inclosing 
the  upper  half  plane.  If  we  rotate  this  contour  90°,  we  have  a  contour  consisting  of 
a  vertical  straight  line  and  a  semicircle  inclosing  a  left  half-plane  (that  is,  the  area 
to  the  left  of  x  =  c ).  Let’s  use  this  contour  to  evaluate  (7.15).  We  restrict  F(z)  to 
be  of  the  form  P(z)/Q(z)  with  P(z)  and  Q(z)  polynomials,  and  Q(z)  of  degree  at 
least  one  higher  than  P{z)  (compare  the  conditions  in  Example  3).  Then  it  can  be 
shown  that,  as  in  Examples  2  and  3,  the  integral  along  the  semicircle  tends  to  zero 
as  the  radius  tends  to  infinity.  Thus  the  integral  along  the  straight  line  is  equal  to 
27t*  times  the  sum  of  the  residues  of  F(z)ezt  at  its  poles,  or,  cancelling  the  factor 
2ni  in  (7.15), 

(7.16)  f(t)  =  sum  of  residues  of  F{z)ezt  at  all  poles. 

We  must  include  all  poles  in  (7.16);  to  see  this  we  can  argue  as  follows.  We  know 
that  (7.15)  is  true  for  any  value  of  c  >  k.  Suppose  we  use  a  value  of  c  which  is 
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large  enough  so  that  all  poles  lie  to  the  left  of  x  =  c;  then  we  know  that  our  answer 
is  correct.  Turning  the  argument  around,  we  can  say  that  since  we  would  get  a 
different  answer  if  we  did  not  take  x  =  c  to  the  right  of  all  poles,  we  must  integrate 
along  a  vertical  line  such  that  all  poles  of  F(z)ezt  are  included  in  the  contour  to 
the  left  of  the  line. 


Example  7.  Find  the  inverse  transform  of  F(p)  = 


(p+2)(p2  +  1)' 


We  first  find  the  poles  of  F{z)ezt  and  factor  the  denominator  to  get 


F(z)ezt  = 


5e2 


(z  +  2)(z  +  i)(z  —  i ) 

Evaluating  the  residues  at  the  three  simple  poles  (Section  6,  method  B),  we  find 

5e-24 


residue  at  2  =  —2 
residue  at  z  =  i 
residue  at  z  =  —  i 
Then  by  (7.16)  we  have 

m  =  e~2t  + 


is 


is 


is 


=  e 


-2 1 


be1 


(2  +  i){2i) 
5e-it 

(2-i)(— 2i) 


_2t  ,  beit(2  —  i)  —  5e_i4(2  +  i) 


=  e  24  +  2 sin t  —  cost. 


(2  +  i)(2-i)(2i) 


Dispersion  relations  Consider 


/(*) 


dz  around  the  upper  half  plane  as  in 


J  z  —  a 

Problem  21.  Let  a  be  real.  Let  f(z)  be  analytic  for  y  >  0,  and  — >  0  rapidly  enough 
at  oo  so  that  the  integral  around  the  semicircle  in  the  upper  half  plane  — >  0  as  the 
radius  of  the  semicircle  — >  oo.  Then  by  Example  4  and  Problem  (21b)  we  get 


(7.17) 


^  ^  dx  =  inf  (a), 
x  —  a 


Now  we  write  f{x)  =  u{x)  +  iv(x),  and  take  real  and  imaginary  parts  of  (7.17): 

(7.18)  PV  [  dx  =  -nv(a ),  Pvf  ^-dx  =  nu(a). 

J-oo  x  —  a  J -oo  x  —  a 

These  (and  similar  integrals  relating  the  real  and  imaginary  parts  of  a  function 
satisfying  the  given  conditions)  are  called  dispersion  relations.  From  them,  you  can 
find  the  Kramers- Kronig  relations  (see  Problem  66)  named  for  the  two  people  who 
developed  similar  relations  involving  the  complex  index  of  refraction  for  light.  (Light 
traveling  through  a  material  medium  is  both  refracted  and  absorbed.  The  real  part 
of  the  complex  index  of  refraction  is  related  to  refraction  and  the  imaginary  part 
to  absorption.)  These  formulas  have  widespread  applications,  to  optics,  electricity, 
solid  state,  elementary  particle  theory,  quantum  mechanics,  etc. 

The  integrals  in  (7.18)  are  called  Hilbert  transforms ,  and  (7.18)  may  be  stated 
in  the  form:  u(x)  and  v(x )  are  Hilbert  transforms  of  each  other.  Compare  Fourier 
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transforms  (Chapter  7,  Section  12),  or  a  Laplace  transform  and  the  corresponding 
Bromwich  integral.  In  each  case  two  functions  have  the  property  that  each  is  given 
by  an  integral  involving  the  other.  This  is  what  an  integral  transform  means,  and 
there  are  other  integral  transforms  which  you  may  discover  in  tables  or  computer. 


PROBLEMS,  SECTION  7 


The  values  of  the  following  integrals  are  known  and  can  be  found  in  integral  tables  or  by 
computer.  Your  goal  in  evaluating  them  is  to  learn  about  contour  integration  by  applying 
the  methods  discussed  in  the  examples  above.  Then  check  your  answers  by  computer. 


1. 


3. 

5. 

7. 

9. 


11. 


13. 

15. 

17. 

19. 


dd 

13  +  5  sin  9 
d9 

5  —  4  sin  9 
d9 

1  —  2r  cos  9  +  r2 

cos  2 9  d9 
5  +  4  cos  9 

d9 

1  +  sin  9  cos  a 
dx 

(4a:2  +  l)3 
x 2  dx 

(x2  +  4)(a;2  +  9) 

cos  2a:  dx 
9a;2  +  4 

x  sin  x  dx 
x2  +  4a:  +  5 

cos  2a:  dx 
(4a:2  +  9)2 


(0  <  r  <  1) 


(a  =  const.) 


2. 

p2tt 

d9 

Jo 

5  —  3  cos  9 

4. 

p2tt 

sin2  9  d9 

Jo 

5  +  3  cos  9 

6. 

PIT 

d9 

Jo 

(2  +  cos  9)2 

8. 

P7T 

sin2  9  d9 

Jo 

13-  12  cos  9 

10. 

POO 

dx 

J  —  oo 

x2  +  4a:  +  5 

12. 

POO 

x2  dx 

Jo 

a:4 +  16 

14. 

POO 

sin  x  dx 

J  —oo 

x2  +  4a:  +  5 

16. 

POO 

x  sin  x  dx 

Jo 

9x 2  +  4 

18. 

POO 

cos  7 rx  dx 

Jo 

1  +  X2  +  X4 

20. 

POO 

cos  x  dx 

Jo 

(1  +  9a;2)2 

21.  In  Example  4  we  stated  a  rule  for  evaluating  a  contour  integral  when  the  contour 
passes  through  simple  poles.  We  proved  that  the  result  was  correct  for 


f  €?Z 

PV  — dz 
Jr  2 


around  the  contour  T  shown  here. 

(a)  By  following  the  same  method  (integrating  around  C'  of  Figure  7.3  and  letting 
r  — >  0)  show  that  the  result  is  correct  if  we  replace  elz  by  any  f(z)  which  is  analytic 
at  2  =  0. 

(b)  Repeat  the  proof  in  (a)  for 

PV  [  dz.  a  real 

Jr  V  ~  al 

(that  is,  a  pole  on  the  x  axis),  with  f(z)  analytic  at  z  =  a. 
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Using  the  rule  of  Example  4  (also  see  problem  21),  evaluate  the  following  integrals.  Find 
principal  values  if  necessary. 


22. 

24. 

26. 

28. 

30. 


/ 


dx 


-oo  {x-l){x2  +  1) 


/OO 

x  sin  7vx  . 


/ 


xdx 


L 


—  oo  (*-  !)4  -  1 

dx 


1  —  x4 


23. 


25. 


27. 


29. 


-oo  (x2+4)(2-x) 


dx 


(a)  By  the  method  of  Example  2  evaluate 


(b) 


/ 

I 

,oo  c 

Jo  1 

r°°  si 

Jo 

F 

Jo 


dx 


9x2  —  7T2 


—  4a:2 


da; 


■  dx 


dx 

1  +  x4 


Evaluate  the  same  integral  by  using  tables  or  computer  to  get  the  indefinite 
integral;  unless  you  are  very  careful  you  may  get  zero.  Explain  why. 

(c)  Make  the  change  of  variables  u  =  x4  in  the  integral  in  (a)  and  evaluate  the  u 
integral  using  (7.5). 


31. 


Use  the  method  of  Problem  30(c)  to  evaluate 


dx 

1  +  x6 


32.  Use  the  method  of  Problem  30(c)  and  the  contour  and  method  of  Example  5  to 
dx 

(1  +  a:4)2 ' 

Evaluate  the  following  integrals  by  the  method  of  Example  5. 


evaluate 


/ 


33. 

35. 


\fxdx 
1  +  a:2 

a;1/3  dx 
(1  +  x)(2  +  x) 


34. 

36. 


r°°  yfxdx 

J 0  (1 TW2 
f°  — 

Jo  x*/4{l  +  x)ax 


37. 

y 

2j7T 

iir 

-/ 

1  0 

A 

(a)  Show  that 

epx  ,  7T 

- dx  = - 

1  +  ex  sm  np 

for  0  <  p  <  1.  Hint:  Find  f  epz  dz/{l  +  ez )  around  the  rectangular  contour 
shown.  Show  that  the  integrals  along  the  vertical  sides  tend  to  zero  as  A  — >  oo. 
Note  that  the  integral  along  the  upper  side  is  a  multiple  of  the  integral  along 
the  a:  axis. 


(b)  Make  the  change  of  variable  y  =  ex  in  the  x  integral  of  part  (a),  and  using 
(6.5)  of  Chapter  11,  show  that  this  integral  is  the  beta  function,  B(p,  1  —  p). 
Then  using  (7.1)  of  Chapter  11,  show  that  r(p)r(l  —  p)  =  7r/sin7rp. 


Section  7 


Evaluation  of  Definite  Integrals  by  Use  of  the  Residue  Theorem  701 


38.  Using  the  same  contour  and  method  as  in  Problem  37a  evaluate 


0  <  p  <  1. 


Hint:  The  only  difference  between  this  problem  and  Problem  37a  is  that  you  now 
have  two  simple  poles  on  the  contour  instead  of  a  pole  inside.  Use  the  rule  of 
Example  4. 


39. 


Evaluate 


2-kx/Z 

- dx. 

cosh  TYX 


Hint:  Use  a  rectangle  as  in  Problem  37a  but  of  height  1  instead  of  27t.  Note  that 
there  is  a  pole  at  i/2. 


40.  Evaluate 

r  xdx 
J  Q  sinh  x 

Hint:  First  find  the  — oo  to  oo  integral.  Use  a  rectangle  of  height  n  and  note  the 
simple  pole  at  in  on  the  contour. 

41.  The  Fresnel  integrals,  sin  u2  du  and  cos  u2  du,  are  important  in  optics.  For 

the  case  of  infinite  upper  limits,  evaluate  these  integrals  as  follows:  Make  the 
change  of  variable  x  =  u2;  to  evaluate  the  resulting 
integrals,  find  z~1^2elz  dz  around  the  contour  shown. 

Let  r  — >  0  and  R  — >  oo  and  show  that  the  integrals 
along  these  quarter-circles  tend  to  zero.  Recognize  the 
integral  along  the  y  axis  as  a  P  function  and  so  eval¬ 
uate  it.  Hence  evaluate  the  integral  along  the  x  axis; 
the  real  and  imaginary  parts  of  this  integral  are  the 
integrals  you  are  trying  to  find. 

42.  If  F(z)  =  f'(z)/f(z), 

(a)  show  that  the  residue  of  F(z)  at  an  nth  order  zero  of  f(z),  is  n.  Hint:  If  f(z) 
has  a  zero  of  order  n  at  z  =  a,  then 

f(z)  =  an(z  -  a)"  +  an+i(z  -  a)n+1  -\ - . 


(b)  Also  show  that  the  residue  of  F(z)  at  a  pole  of  order  p  of  f(z),  is  —  p.  Hint: 

See  the  definition  of  a  pole  of  order  p  at  the  end  of  Section  4. 

43.  By  using  theorem  (7.8),  show  that  z3  +  z2  +  9  =  0  has  exactly  one  root  in  the  first 
quadrant.  Hence  show  that  it  has  one  root  in  the  fourth  quadrant  and  one  on  the 
negative  real  axis.  Hint :  See  Example  6. 

44.  The  fundamental  theorem  of  algebra  says  that  every  equation  of  the  form  f(z)  = 
anzn  +  an-iz71-1  +  •  •  •  +  ao  =  0,  a„  ^  0,  n  >  1,  has  at  least  one  root.  From  this  it 
follows  that  an  nth  degree  equation  has  n  roots.  Prove  this  by  using  the  argument 
principle.  Hint:  Follow  the  increase  in  the  angle  of  f(z)  around  a  very  large  circle 
2  =  re*8;  for  sufficiently  large  r,  all  roots  are  inclosed,  and  f(z)  is  approximately 
a„zn. 

As  in  Problem  43  find  out  in  which  quadrants  the  roots  of  the  following  equations  lie: 


45.  z3  +  z2  +  z  +  4  =  0  46.  z3  +  3z2  +  4z  +  2  =  0 

47.  z3  +  4z2  +  12  =  0  48.  z4  -  z3  +  6z2  -  3z  +  5  =  0 
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49.  z4  -  4z3  +  llz2  -  14z  +  10  =  0  50.  z4  +  z3  +  4z2  +  2z  +  3  =  0 

51.  Use  (7.8)  to  evaluate 


/'(*) 


dz,  where  f(z)  — 


c  f(z) 

around  the  circle  \z\  =  2;  around  \z\  =  1. 

z3  dz 


z3(z  +  l)2  sin  z 
(z2  +  l)2(z  —  3)  ’ 


1  +  2z4 


52.  Use  (7.8)  to  evaluate 

53.  Use  (7.8)  to  evaluate  ® 

v  ’  J  z4  +  8z2  +  16 

54.  Use  (7.8)  to  evaluate 


around  Izl  =  1. 


z3  +4z 


dz  around  the  circle  |z  —  2i|  =  2. 


sec2(z/4)  dz 
c  1  —  tan(z/4)  ’ 

where  C  is  the  rectangle  formed  by  the  lines  y  =  ±1,  x  =  ±|7r. 

Find  the  inverse  Laplace  transform  of  the  following  functions  by  using  (7.16). 


55. 

57. 

60. 

63. 

66. 


Hint:  Use  (6.2). 

56. 

1 

p4  +  4 

p4  -  1 

P+1 

58. 

P3 

59. 

3 P2 

p(p2  +  1) 

p4  —  16 

p3  +  8 

1 

61. 

P5 

62. 

(P-1)2 

P2{P+  1) 

p6  —  64 

p(p+  l)2 

P 

64. 

P2 

65. 

P 

p4  —  1 

(P2  -  1)(P2  - 

4) 

(p+  l)(p2  +4) 

In  equation  (7.18),  let  u(x) 

be  an  even  function  and 

v(x )  be  an  odd  function. 

(a)  If  f(x)  =  u(x)+iv(x),  show  that  these  conditions  are  equivalent  to  the  equation 
/*(*)  =  /(-*)■ 

(b)  Show  that 


nu(a)  =  PV 


r 


2xv(x) 


dx, 


nv(a)  =  —PV 


f 


2au(x) 


dx. 


These  are  the  Kramers-Kronig  relations.  Hint:  To  find  u(a ),  write  the  integral  for  u(a) 
in  (7.18)  as  an  integral  from  — oo  to  0  plus  an  integral  from  0  to  oo.  Then  in  the  — oo  to 
0  integral,  replace  x  by  —x  to  get  an  integral  from  0  to  oo,  and  use  v(—x)  =  —v{x).  Add 
the  two  0  to  oo  integrals  and  simplify.  Similarly  find  v(a). 


8.  THE  POINT  AT  INFINITY;  RESIDUES  AT  INFINITY 

It  is  often  useful  to  think  of  the  complex  plane  as  corresponding  to  the  surface  of 
a  sphere  in  the  following  way.  In  Figure  8.1,  the  sphere  is  tangent  to  the  plane  at 
the  origin  O.  Let  O  be  the  south  pole  of  the  sphere,  and  N  be  the  north  pole  of 
the  sphere.  If  a  line  through  N  intersects  the  sphere  at  P  and  the  plane  at  Q ,  we 
say  that  the  point  P  on  the  sphere  and  the  point  Q  on  the  plane  are  correspond¬ 
ing  points.  Then  we  have  a  one-to-one  correspondence  between  points  on  the  sphere 
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(except  N)  and  points  of  the  plane  (at 
finite  distances  from  O ) .  Imagine  point 
Q  moving  farther  and  farther  out  away 
from  O'  then  P  moves  nearer  and  nearer 
to  AT.  If  z  =  x  +  iy  is  the  complex 
coordinate  of  Q ,  then  as  Q  moves  out 
farther  and  farther  from  O,  we  would 
say  z  — »  oo.  It  is  customary  to  say  that 
the  point  N  corresponds  to  the  point  at 
infinity  in  the  complex  plane.  Observe 
that  straight  lines  through  the  origin  in 


Figure  8.1 


the  plane  correspond  to  meridians  of  the  sphere.  The  meridians  all  pass  through 
both  the  north  pole  and  the  south  pole.  Corresponding  to  this,  straight  lines  through 
the  origin  in  the  complex  plane  pass  through  the  point  at  infinity.  Circles  in  the 
complex  plane  with  center  at  O  correspond  to  parallels  of  latitude  on  the  sphere. 
This  mapping  of  the  complex  plane  onto  a  sphere  (or  the  mapping  of  the  sphere 
onto  a  tangent  plane)  is  called  a  stereographic  projection. 

To  investigate  the  behavior  of  a  function  at  infinity,  we  replace  z  by  1/z  and 
consider  how  the  new  function  behaves  at  the  origin.  We  then  say  that  infinity  is 
a  regular  point,  a  pole,  etc.,  of  the  original  function,  depending  on  what  the  new 
function  does  at  the  origin.  For  example,  consider  z 2  at  infinity;  1/z2  has  a  pole  of 
order  2  at  the  origin,  so  z2  has  a  pole  of  order  2  at  infinity.  Or  consider  e1/2;  since 
ez  is  analytic  at  z  =  0,  e1/2  is  analytic  at  oo. 

Next  we  want  to  see  how  to  find  the  residue  of  a  function  at  oo.  To  do  this,  we 
want  to  replace  z  by  1/z  and  work  around  the  origin.  In  order  to  keep  our  notation 
straight,  let  us  use  two  variables,  namely  Z  which  takes  on  values  near  oo,  and 
z  =  1/Z  which  takes  on  values  near  0.  The  residue  of  a  function  at  oo  is  defined  so 
that  the  residue  theorem  holds,  that  is 


(8.1)  j)  f(Z)  dZ  =  2iri  ■  (residue  of  f(Z)  at  Z  =  oo) 

if  C  is  a  closed  path  around  the  point  at  oo  but  inclosing  no  other  singular  points. 
Now  what  does  it  mean  to  integrate  “around  oo”?  Recall  that  we  have  agreed  to 
traverse  contours  so  that  the  area  inclosed  always  lies  to  our  left.  The  area  we  wish 
to  “inclose”  is  the  area  “around  oo”;  if  C  is  a  circle,  this  area  would  lie  outside 
the  circle  in  our  usual  terminology.  Figure  8.1  may  clarify  this.  Imagine  a  small 
circle  about  the  north  pole;  the  area  inside  this  circle  (that  is,  the  area  including 
N)  corresponds  to  points  in  the  plane  which  are  outside  a  large  circle  C.  We  must 
go  around  C  in  the  clockwise  direction  in  order  to  have  the  area  “around  oo”  to 
our  left.  Note  that  if  Z  =  Rel& ,  then  in  going  clockwise  around  C,  we  are  going  in 
the  direction  of  decreasing  0.  Let  us  make  the  following  change  of  variable  in  the 
integral  (8.1): 

Z  =  — ,  dZ  = - -  dz. 

z  zz 

If  Z  =  Rez®  traverses  a  circle  C  of  radius  R  in  the  direction  of  decreasing  0, 
then  z  =  1/Z  =  (1  / R)e~l&  =  re 16  traverses  a  circle  C'  of  radius  r  =  1/R  in  the 
counterclockwise  direction  (that  is,  9  =  —0  increases  as  0  decreases).  Thus  (8.1) 
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becomes 

(8.2)  j)  —  dz  =  2ni  •  residue  of  f(Z)  at  Z  =  oo. 

The  integral  in  (8.2)  is  an  integral  about  the  origin  and  so  can  be  evaluated  by 
calculating  the  residue  of  (—  l/z2)f(l/z)  at  the  origin.  (There  are  no  other  singular 
points  of  f(l/z)  inside  C'  because  we  assumed  that  there  were  no  singular  points 
of  f(Z)  outside  C  except  perhaps  oo.)  Thus  we  have 


(8.3)  (residue  of  f(Z)  at  Z  =  oo) 


(  residue  of  —  / 


and  we  can  use  the  methods  we  already  know  for  computing  residues  at  the  origin. 
Note  that  a  function  may  be  analytic  at  oo  and  still  have  a  residue  there. 


Example.  f(Z)  =  1/Z  is  analytic  at  oo  because  z  is  analytic  at  the  origin.  But  the 
residue  of  f(Z)  =  1/Z  at  Z  =  oo  is 


-( 


residue  of 


z  at  z  =  0  =  —  1. 


PROBLEMS,  SECTION  8 


1.  Let  f(z)  be  expanded  in  the  Laurent  series  that  is  valid  for  all  z  outside  some  circle, 
that  is,  \z\  >  M  (see  Section  4).  This  series  is  called  the  Laurent  series  “about 
infinity.”  Show  that  the  result  of  integrating  the  Laurent  series  term  by  term  around 
a  very  large  circle  (of  radius  >  M)  in  the  positive  direction,  is  2nibi  (just  as  in  the 
original  proof  of  the  residue  theorem  in  Section  5).  Remember  that  the  integral 
“around  oo  ”  is  taken  in  the  negative  direction,  and  is  equal  to  2ni- (residue  at  oo). 
Conclude  that  R( oo)  =  —bi .  Caution:  In  using  this  method  of  computing  R( oo), 
be  sure  you  have  the  Laurent  series  that  converges  for  all  sufficiently  large  2. 

2.  (a)  Show  that  if  f(z)  tends  to  a  finite  limit  as  z  tends  to  infinity,  then  the  residue 

of  f(z)  at  infinity  is  limz_,oo  z2  f'{z). 

(b)  Also  show  that  if  f(z)  tends  to  zero  as  2  tends  to  infinity,  then  the  residue  of 
f(z)  at  infinity  is  —  lim^oo  2/(2). 

Find  out  whether  infinity  is  a  regular  point,  an  essential  singularity,  or  a  pole  (and  if  a 
pole,  of  what  order)  for  each  of  the  following  functions.  Using  Problem  1,  or  Problem  2, 
or  (8.3),  find  the  residue  of  each  function  at  infinity.  Check  your  results  by  computer. 


3. 

z 

4. 

22  +  3 

5. 

.  1 

6. 

22  +  5 

z2  +  1 

(2  +  2)2 

z 

z 

7. 

423  +  22  +  3 

8. 

22  +2 

9. 

z2  -  1 

10. 

1  +  Z 

22 

322 

22  +  1 

1-2 

11. 

1 

tan  — 

2 

12. 

1  Z  +  1 

ln  1 

z  —  1 

13.  Give  another  proof  of  the  fundamental  theorem  of  algebra  (see  Problem  7.44)  as 
follows.  Let  /  =  f  (£)/ f(z)  dz  about  infinity,  that  is,  in  the  negative  direction 
around  a  very  large  circle  C.  Use  the  argument  principle  (7.8),  and  also  evaluate 
I  by  finding  the  residue  of  f{z)/f(z)  at  infinity;  thus  show  that  f(z)  has  n  zeros 
inside  C. 
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Evaluate  the  following  integrals  by  computing  residues  at  infinity.  Check  your  answers  by 
computing  residues  at  all  the  finite  poles.  (It  is  understood  that  f  means  in  the  positive 
direction.) 


14. 


1  —  z2  dz 
1  +  z2  z 


around  \z\  =  2. 


15. 


(2z  +  l)(z2  +  9) 


around  \z\  =  5. 


16.  Observe  that  in  Problems  14  and  15  the  sum  of  the  residues  at  finite  points  plus  the 
residue  at  infinity  is  zero.  Prove  that  this  is  always  true  for  a  function  which  has  a 
finite  number  of  singularities. 


9.  MAPPING 

We  often  find  it  useful  to  sketch  a  graph  of  a  given  function  y  =  f{x)  of  a  real 
variable  x.  Imagine  trying  to  make  a  similar  sketch  for  a  function  w  =  /(z)  of  a 
complex  variable  z.  We  need  a  plane  to  plot  values  of  z  and  another  plane  to  plot 
values  of  w  =  /(z),  that  is,  we  need  a  four-dimensional  space.  Lacking  this,  we 
must  resort  to  a  different  method.  Imagine  trying  to  “graph”  y  =  f(x)  using  only 
two  straight  lines,  but  not  a  plane.  A  “graph”  of  y  =  x2  might  look  like  Figure  9.1. 
Given  a  point  on  the  x  axis,  we  can  locate  a  corresponding  point  y  =  f(x )  on  the  y 
axis  and  label  the  two  points  with  the  same  letter  to  indicate  this  correspondence. 
(Note  that  to  finish  our  “graph,”  we  really  need  a  second  positive  y  axis  to  hold  the 
y  points  corresponding  to  negative  values  of  £.) 

ABODE  F 

- 1  I  !  L  1 _ i _ I _ I _ x 

0  1  1  V?  2  3  4  5 

A  B  C  D  E  F 

I  i  i  i  I  I  I  I  I  I  I  I  i  I 

0^123456789  10 

Figure  9.1 

Now  consider  a  similar  method  of  representing  a  function  of  a  complex  variable 
w  =  /(z).  We  use  a  z  plane  and  a  w  plane;  a  given  point  in  the  z  plane  (that  is,  a 
value  of  z)  determines  a  corresponding  value  of  w,  that  is,  a  point  in  the  w  plane. 
The  pair  of  points,  one  z  and  one  w,  are  called  images  of  each  other.  Although 
we  could  label  pairs  of  corresponding  z  and  w  points  (as  we  did  corresponding  x 
and  y  points  in  Figure  9.1),  it  is  usually  more  interesting  to  sketch  corresponding 
curves  or  regions  in  the  two  planes.  The  correspondence  between  a  point  (or  curve 
or  region)  in  the  z  plane,  and  the  image  point  (or  curve  or  region)  in  the  w  plane, 
is  called  a  mapping  or  a  transformation. 


Example  1.  Consider  the  function  w  =  i  +  zeI7r/4,  and  let  us  map  the  grid  of  coordinate 
lines  x  =  const.,  y  =  const,  (z  plane  in  Figure  9.2)  into  the  w  plane.  You  may 
be  able  to  see  at  once  that  this  transformation  amounts  to  a  rotation  of  the  grid 
through  an  angle  of  7r/4  (since  ze771’/4  =  7~eHe+'7r/4) )  plus  a  translation  i  (the  image 
of  z  =  0  is  ui  =  i),  giving  the  result  shown  in  the  w  plane  in  Figure  9.2.  Alternatively, 
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w  =  i  +  ze l7r/4  =  i  +  (x  +  iy)  ^cos  ^  +  i  sin  ^ 
1  +  i\  x  —  y 


=  i  +  (x  +  iy) 
Since  w  =  u  +  iv,  we  have 
(9.1) 


V2  )  V2 


+  i  1  + 


x  +  y 

V2 


x-y 


v  =  1  + 


x  +  y 


V2  ’  V2  ' 

Then  (eliminating  x  and  y  in  turn),  we  have 

(9.2)  u  —  v  =  —  1  —  yV%,  u  +  v  =  1  +  xy/2. 


The  image  of  the  x  axis  (y  =  0)  is,  from  the  first  equation  in  (9.2),  u  —  v=  —1;  the 
image  of  the  y  axis  (x  =  0)  is,  from  the  second  equation  in  (9.2),  u  +  v  =  1.  Plotting 
these  lines  in  the  w  plane,  and  also  plotting  the  images  of  x  =  ±\/2,  x  =  ±2y/2, 
y  =  ±\/2,  y  =  ±2\/2  [using  the  equations  (9.2)],  we  get  Figure  9.2.  (Verify  that 
the  shaded  squares  are  images  of  each  other.) 

If  the  elimination  [to  get  (9.2)]  is  not  easy,  we  can  use  equations  (9.1)  directly. 
Suppose  that  we  want  the  image  of  y  =  0.  With  y  =  0,  equations  (9.1)  become 
u  =  x/y/2,  v  =  1  +  x/\/2;  these  are  a  pair  of  parametric  equations  for  a  curve  in  the 
(u,v)  plane,  with  x  as  the  parameter.  Similarly,  to  find  the  image  of  x  =  const., 
we  substitute  the  value  of  x  into  (9.1);  we  then  have  a  pair  of  parametric  equations 
with  y  as  the  parameter. 

Note  that  we  could  just  as  easily  have  found  the  images  in  the  z  plane  of  the  lines 
u  =  const.,  and  v  =  const.  For  example,  letting  u  =  0  in  (9.1),  we  get  x  —  y  =  0; 
the  image  of  the  v  axis  (u  =  0)  is  the  45°  line  in  the  (x,  y)  plane.  (We  might  have 
guessed  that  going  back  to  the  z  plane  would  involve  a  rotation  through  —  45°. )  In 
any  given  problem,  we  may  start  with  simple  curves  (or  regions)  in  either  the  z 
plane  or  the  w  plane,  and  find  their  images  in  the  other  plane. 
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Example  2.  Let  us  map  the  coordinate  grid  u  =  const.,  v  =  const.,  into  the  z  plane  by 
the  function  w  =  z2.  We  have 

w  =  z2  =  {x  +  iy)2  =  x2  -  y2  +  2ixy , 

2  2  o 

u  =  x  —  y  ,  v  =  Zxy. 

Then  the  images  of  u  =  const,  are  hyperbolas  x2  —  y2  =  const.,  and  the  images  of 
v  =  const,  are  also  hyperbolas  xy  =  const.  (Figure  9.3).  Alternatively,  we  could 
map  the  lines  x  =  const.,  y  =  const,  into  the  w  plane  (Problem  1);  this  gives  two 
sets  of  parabolas  in  the  (u,  v )  plane.  Accurate  graphs  can  be  obtained  by  computer. 


Example  3.  Let  us  consider  still  another  useful  way  of  discussing  the  mapping  by  w  =  z2. 
Using  polar  coordinates,  we  have 

(9.4)  z  =  reie,  w  =  z2  =  r2e2i e. 

Consider  the  region  inside  the  circle  r  =  1  in  the  (x,y)  plane.  If  r  =  1  in  (9.4),  we 
have  2  =  e*e,  w  =  e2lB .  The  angle  of  w  is  twice  the  angle  of  z;  thus  the  first-quadrant 
part  of  the  area  inside  the  circle  r  =  1  in  the  z  plane  maps  into  a  semicircular  area 
in  the  w  plane  as  indicated  by  the  shading  in  Figure  9.4.  The  second  quadrant  of 
the  z  plane  disk  ( 9  between  tt/2  and  n)  maps  into  the  lower  half  of  the  disk  in  the 
w  plane  (angle  of  w  between  7r  and  2n)  as  indicated.  We  have  now  used  up  the 
whole  area  of  the  disk  in  the  w  plane  and  only  half  of  the  z  plane  disk.  (Compare 
Figure  9.1  and  the  comment  about  a  second  y  axis.)  In  order  to  have  a  one-to-one 
correspondence  between  points  in  the  z  plane  and  their  images  in  the  w  plane,  we 
draw  a  second  w  plane  (w  plane  II  in  Figure  9.4)  to  contain  the  images  of  points  in 
the  lower  half  of  the  z  plane.  (Convince  yourself  that  the  two  lower  quarter-disks  in 
the  z  plane  and  their  images  in  w  plane  II  are  correctly  indicated  by  the  shading.) 
We  agree  that  as  we  reach  the  angle  2ir  in  w  plane  I,  we  go  over  to  w  plane  II, 
and  as  we  reach  the  angle  47r  in  w  plane  II,  we  go  back  to  w  plane  I.  The  two  w 
planes  joined  in  this  way  are  called  a  Riemann  surface ;  each  plane  is  called  a  sheet 
of  the  Riemann  surface.  Note  that  the  line  along  which  the  sheets  of  the  Riemann 
surface  are  joined  (positive  real  axis  here)  is  a  branch  cut,  and  the  origin  is  a  branch 
point  (see  Example  5,  Section  7).  Here  the  branch  cut  and  Riemann  surface  are  in 
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the  w  plane  because  0  =  y/w  has  two  branches;  for  w  =  yfz,  the  Riemann  surface 
would  be  in  the  2  plane  (as  in  Section  7).  It  is  not  necessary  to  take  the  branch 
cut  along  the  positive  x  axis;  we  can  select  any  27r  interval  for  one  branch  of  y/w , 
for  example  from  —  7r  to  7 r  instead  of  0  to  27r.  A  Riemann  surface  may  have  many 
sheets;  for  example  for  w  =  z5,  there  are  5  sheets,  and  for  w  =  In  2,  there  are  an 
infinite  number.  For  more  detail,  see  complex  variables  texts. 


Conformal  Mapping  We  have  been  discussing  mappings  or  transformations.  We 
used  the  term  transformation  in  Chapters  3  and  10,  meaning  a  change  of  variables  or 
a  change  of  coordinate  system  or  a  change  of  basis;  let  us  see  the  connection  between 
the  two  discussions.  In  Chapter  10  we  used  only  one  plane  [the  (x,y)  plane];  we 
located  a  point  in  the  (x,y)  plane  by  giving  its  rectangular  coordinates  (x,y),  or 
its  polar  coordinates  (r,  9),  or  some  other  coordinates  (u,v).  In  polar  coordinates, 
the  circles  r  =  const.,  and  the  rays  9  =  const.,  were  sketched  in  the  (x,y)  plane. 
Similarly,  for  any  coordinate  system  (u,v)  (in  Chapter  10,  see  Section  8  and  the 
Section  8  problems),  we  sketched  the  curves  u  =  const.,  v  =  const.,  in  the  (x,y) 
plane.  In  the  complex  variable  language  we  are  now  using,  this  amounts  to  mapping 
the  w  plane  lines  u  =  const.,  v  =  const.,  into  the  2  plane.  In  Chapter  10  we  were 
particularly  interested  in  transformations  to  orthogonal  curvilinear  coordinates.  Let 
us  see  that  any  analytic  function  w  =  f(z)  =  u  +  iv  gives  us  a  transformation  to  an 
orthogonal  coordinate  system  (u,v).  We  have 


(9.5) 


dz  =  dx  +  idy,  dw  =  du  +  idv, 

\dz\2  =  dx2  +  dy2,  \dw\2  =  du 2  +  dv2 . 

Then  the  square  of  the  arc  length  element  in  the  (x,  y)  plane  is 
(9.6)  ds2  =  dx2  +  dy2  =  \dz\2  =  \dw\2  =  (du2  +  dv2). 


Since  there  is  no  dudv  term  in  ds2,  the  ( u,v )  coordinate  system  is  orthogonal 
(Chapter  10,  Section  8).  By  this  we  mean  that  if  we  obtain  u{ x,  y)  and  v(x,  y)  from 
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f{z)  =  u  +  iv  and  plot  the  curves  u(x,y)  =  const.,  v(x,y)  =  const,  in  the  (x,y) 
plane,  we  have  two  sets  of  mutually  orthogonal  curves.  These  are  the  coordinate 
curves  for  the  (it,  t;)  coordinate  systems  as  in  Chapter  10.  If  we  solve  the  equations 
u  =  u(x,  y),  v  =  v(x,  y)  for  x  and  y  in  terms  of  it  and  v,  we  have  the  transformation 
equations  from  the  variables  x,  y  to  the  variables  it,  v  as  in  Problems  8.6  to  8.9  of 
Chapter  10,  and  by  (9.6)  we  know  that  the  coordinate  system  (it,  v)  is  an  orthogonal 
system  [if  f(z)  is  analytic].  We  see  an  example  of  this  in  Figure  9.3  (two  orthogonal 
sets  of  hyperbolas) .  Note  from  (9.6)  that  the  two  scale  factors  in  a  (u,v)  coordinate 
system  obtained  this  way  are  equal. 

Although  we  used  only  one  plane  in  Chapter  10,  for  complex  variables  we  find 
it  useful  to  consider  both  the  z  plane  [that  is  the  (x,  y)  plane]  and  the  w  plane 
[that  is,  the  (it, i;)  plane].  In  the  (x,i/)  plane,  the  arc  length  element  ds  is  given 
by  ds2  =  dx2  +  dy2 .  Similarly,  in  the  (u,v)  plane,  the  arc  length  element  (which 
we  shall  call  dS )  is  given  by  dS2  =  du 2  +  dv2.  From  (9.5)  we  see  that  ds  = 
\dz\  and  dS  =  \dw\.  Then  the  ratio  of 
dS  to  ds  is  \dw/dz\.  Consider  a  point  z 
(and  its  image  w)  at  which  w(z)  is  analytic 
and  dw/dz  is  not  zero.  If  we  stay  near  z, 
the  value  of  dw/dz  is  almost  constant,  and 
the  ratio  dS/ds  is  nearly  constant.  This 
says  that  if  we  consider  a  small  area  in 
the  z  plane  ( ABCD  in  Figure  9.5)  and  its 
image  ( A'B'C'D '  in  Figure  9.5)  in  the  w 
plane,  then 


w  plane 


C' 


O'. 


B' 


A 


plane 


B 


Figure  9.5 


(9.7) 


MB'  B'C'  C'D'  D'M 


AB 


BC 


CD 


DA 


dS_ 

ds 


dw 


dz 


that  is,  the  two  small  areas  are  similar  figures  (since  corresponding  sides  are  propor¬ 
tional).  Because  of  this  property  of  any  mapping  by  an  analytic  function,  we  call  the 
mapping  or  transformation  conformal  (same  form  or  shape) .  Corresponding  angles 
are  equal  (A  =  A' ,  etc.)  and  the  net  result  of  the  transformation  is  to  magnify  (or 
minify)  and  rotate  each  infinitesimal  area.  Note  that  the  conformal  property  is  a 
local  one;  since  the  value  of  dw/dz  changes  from  point  to  point,  each  tiny  bit  of  a 
figure  is  magnified  and  rotated  by  a  different  amount,  and  so  a  large  figure  will  not 
have  the  same  shape  after  mapping.  Also  note  that  we  do  not  have  conformality 
in  the  neighborhood  of  a  point  where  dw/dz  =  0;  for  example,  in  Figure  9.4  a  tiny 
quarter-circle  about  the  origin  in  the  z  plane  maps  into  a  tiny  semicircle  in  the  w 
plane. 


PROBLEMS,  SECTION  9 


In  these  problems  you  should  be  able  to  make  rough  sketches  by  hand,  but  for  accurate 
graphs  use  a  computer. 

1.  Solve  equations  (9.3)  for  x  and  y  in  terms  of  u  and  v.  Use  your  equations  to  sketch 
the  images  in  the  w  plane  of  the  z  plane  lines  x  =  const,  (for  several  values  of  x) 
and  similarly  of  y  —  const. 
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For  each  of  the  following  functions  w  =  f(z)  =  u  +  iv,  find  u  and  v  as  functions  of  x 
and  y.  Sketch  the  graphs  in  the  (x,y)  plane  of  the  images  of  u  =  const,  and  v  =  const, 
for  several  values  of  u  and  several  values  of  v  as  was  done  for  w  =  z2  in  Figure  9.3.  The 
curves  u  =  const,  should  be  orthogonal  to  the  curves  v  =  const. 


2. 


w  = 


z  +  1 
2  i 


4.  w  =  ez 


5. 


z  —  i 
w  = - 

Z  +  1 


6.  w  —  y/z.  Hint:  This  is  equivalent  to  w 2  =  z;  find  x  and  y  in  terms  of  u  and  v  and 
then  solve  the  pair  of  equations  for  u  and  v  in  terms  of  x  and  y.  Note  that  this  is 
really  the  same  problem  as  Problem  1  with  the  z  and  w  planes  interchanged. 


7.  w  =  sin  z  8.  w  =  cosh* 

Describe  the  Riemann  surface  for 


9.  w  =  z3  10-  w  =  yfz  11.  u:  =.ln  z 

12.  If  w  —  f(z)  =  u(x,y)  +  iv(x,y),  f(z)  analytic,  defines  a  transformation  from  the 
variables  x,  y  to  the  variables  u,  v,  show  that  the  Jacobian  of  the  transformation 
(Chapter  5,  Section  4)  is  d(u,v)/d(x,y)  =  \  f\z)\2.  Hint:  To  simplify  the  deter¬ 
minant,  use  the  Cauchy- Riemann  equations  and  the  equations  (Section  2)  used  in 
obtaining  them. 

13.  Verify  the  matrix  equation 

( _  T  ( dx\ 

[dvj  -  J  \dy)  ■' 

where  J  is  a  matrix  whose  determinant  is  the  Jacobian  in  Problem  12.  Multiply  the 
matrix  equation  by  its  transpose  and  use  Problem  12  to  obtain  dS/ds  =  \dw/dz\  as 
in  (9.7). 

14.  We  have  discussed  the  fact  that  a  conformal  transformation  magnifies  and  rotates 
an  infinitesimal  geometrical  figure.  We  showed  that  \dw/dz\  is  the  magnification 
factor.  Show  that  the  angle  of  dw/dz  is  the  rotation  angle.  Hint:  Consider  the 
rotation  and  magnification  of  an  arc  dz  =  dx  +  idy  (of  length  ds  and  angle  arc  tan 
dy/dx )  which  is  required  to  obtain  the  image  of  dz ,  namely  dw. 

15.  Compare  the  directional  derivative  d<p/ds  (Chapter  6,  Section  6)  at  a  point  and 
in  the  direction  given  by  dz  in  the  2  plane,  and  the  directional  derivative  d(j>/dS 
in  the  direction  in  the  w  plane  given  by  the  image  dw  of  dz.  Hence  show  that 
the  rate  of  change  of  T  in  a  given  direction  in  the  z  plane  is  proportional  to  the 
corresponding  rate  of  change  of  T  in  the  image  direction  in  the  w  plane.  (See 
Section  10,  Example  2.)  Show  that  the  proportionality  constant  is  \dw/dz\.  Hint: 
See  equations  (9.6)  and  (9.7). 


10.  SOME  APPLICATIONS  OF  CONFORMAL  MAPPING 

Many  different  physical  problems  require  solution  of  Laplace’s  equation.  We  are 
going  to  show  how  to  solve  a  few  such  problems  by  conformal  mapping.  Much  of 
this  work  can  be  done  by  computer.  But  before  you  can  use  the  computer,  you 
need  to  know  the  basic  theory  behind  the  use  of  conformal  mapping.  Our  purpose 
in  this  section  is  to  learn  this  background.  First  we  consider  a  very  simple  problem 
for  which  we  know  the  answer  from  elementary  physics. 
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Example  1.  In  Figure  10.1,  the  shaded  area  in  the  (u,v)  plane  represents  a  rectangular 
plate.  The  ends  and  faces  of  the  plate  are  insulated,  the  bottom  edge  is  held  at 
temperature  T  =  0°,  and  the  top  edge  at  T  =  100°.  Then  we  know  from  elementary 
physics  that  the  temperature  increases  linearly  from  the  bottom  edge  (v  =  0)  to  the 
top  edge  (v  =  7r),  that  is,  T  =  (100/7r)u  at  any  point  of  the  plate.  Let  us  also  derive 
this  answer  by  a  more  advanced  method.  It  is  known  from  the  theory  of  heat  that 
the  temperature  T  of  a  body  satisfies  Laplace’s  equation  in  regions  where  there  is  no 
source  of  heat.  (See  Chapter  13,  Section  2.)  In  our  problem  we  want  a  solution  of 
Laplace’s  equation  which  satisfies  the  boundary  conditions,  that  is,  T  =  100°  when 
v  =  7r,  T  =  0°  when  v  =  0,  and  dT/du  =  0  on  the  ends  (see  Chapter  13,  Problem 
2.14).  You  should  verify  that  T  =  100v/7r  satisfies  d2T/du 2  +  d2T/dv2  =  0,  and 
satisfies  all  the  boundary  conditions.  Also  note  that  an  easy  way  to  know  that  v 
satisfies  Laplace’s  equation  is  to  observe  that  it  is  the  imaginary  part  of  w  =  u  +  iv, 
and  use  Theorem  IV  of  Section  2  which  says  that  the  real  and  imaginary  parts  of 
an  analytic  function  of  a  complex  variable  satisfy  Laplace’s  equation. 

Now  let  us  use  our  results  to  solve  a  harder  problem. 


Example  2.  Consider  the  mapping  of  the  rectangle  in  the  w  plane  into  the  z  plane  by  the 
function  w  =  In z  (Figure  10.1,  z  plane).  We  have 


(10.1) 


w  =  In  z  =  In  {re18)  =  In  r  +  id  =  u  +  iv, 
u  =  lnr,  v  =  6. 


Then  v  =  0  maps  into  9  =  0,  that  is,  the  positive  x  axis;  v  =  n  maps  into  9  =  n,  that 
is,  the  negative  x  axis  (z  plane,  Figure  10.1).  The  insulated  end  of  the  rectangle 
at  u  =  0  maps  into  lnr  =  0  or  r  =  1;  the  left-hand  end  of  the  rectangle  maps  into 
a  small  semicircle  about  the  origin  which  we  can  think  of  as  a  bit  of  insulation  at 
the  origin  separating  the  0°  and  100°  parts  of  the  x  axis.  (If  the  left-hand  end  of 
the  rectangle  is  at  u  =  —  oo,  we  have  lnr  =  —  oo,  r  =  0,  and  the  image  is  just  the 
origin;  for  finite  negative  u,  the  image  is  a  semicircle  with  r  <  1.)  We  can  now  solve 
the  problem  indicated  by  the  picture  in  the  z  plane  of  Figure  10.1.  A  semicircular 
plate  has  its  faces  and  its  curved  boundary  insulated,  and  has  half  its  flat  boundary 
at  0°  and  the  other  half  at  100°  (with  a  bit  of  insulation  at  the  center).  Find 
the  temperature  T  at  any  point  of  the  plate.  To  solve  this  problem  we  need  only 
transform  our  solution  in  the  (it,  v)  plane  to  the  variables  x,  y  by  using  (10.1).  Thus 
we  find 


T=lOOy=lOO0=lOOarctan^ 

7T  7T  7T  X 


(10.2) 


0  <  9  <  ir. 
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It  is  not  hard  to  justify  our  method;  we  need  to  show  that  our  solution  satisfies 
Laplace’s  equation  and  that  it  satisfies  the  boundary  conditions.  It  is  straightfor¬ 
ward  to  show  (Problem  1)  that  if  a  function  </>(m,  v)  satisfies  Laplace’s  equation 
d2cj)ldu2  +  d2(j)/dv2  =  0,  then  the  function  of  x  and  y  obtained  by  substituting 
u  =  u(x,y),  v  =  v(x,y)  in  <j>  satisfies  Laplace’s  equation  in  x  and  y,  where  u  and 
v  are  the  real  and  imaginary  parts  of  an  analytic  function  w  =  /(-)■  Thus  we 
know  that  (10.2)  satisfies  Laplace’s  equation  (or  in  this  case  you  can  easily  verify 
the  fact  directly).  We  must  also  know  that  the  transformed  T  satisfies  the  bound¬ 
ary  conditions;  this  is  where  conformal  mapping  is  so  useful.  Observe  in  Figure 
10.1  that  we  had  a  transformation  which  took  the  boundaries  of  a  simple  region 
(a  rectangle)  for  which  we  knew  the  solution  of  the  temperature  problem,  into  the 
boundaries  of  a  more  complicated  region  for  which  we  wanted  the  solution.  This  is 
the  basic  method  of  conformal  mapping — to  transform  from  a  simple  region  where 
you  know  the  answer  to  a  given  problem,  to  the  region  in  which  you  want  the  so¬ 
lution.  The  temperature  at  any  (x,  y)  point  is  the  same  as  the  temperature  at  the 
(u,v)  image  point,  since  we  obtain  the  temperature  as  a  function  of  x  and  y  by 
the  same  substitution  u  =  u(x,y),  v  =  v(x,y)  that  we  use  to  obtain  image  points. 
Thus  the  temperatures  on  the  boundaries  of  the  transformed  region  are  the  same 
as  the  temperatures  on  the  corresponding  boundaries  of  the  simpler  (u,  v)  region. 
Similarly,  isothermals  (curves  of  constant  temperature)  transform  into  isothermals; 
in  this  problem  the  (u,v)  isothermals  are  the  lines  v  =  const.,  and  so  the  (x,y) 
isothermals  are  9  =  const.  You  can  show  that  the  rate  of  change  of  T  in  a  direction 
perpendicular  to  a  boundary  in  the  (u,  v )  plane  is  proportional  to  the  corresponding 
rate  of  change  of  T  in  a  direction  perpendicular  to  the  image  boundary  in  the  (x,  y) 
plane  (Problem  9.15).  Thus  insulated  boundaries  (across  which  the  rate  of  change 
of  T  is  zero)  map  into  insulated  boundaries.  The  lines  (or  curves)  perpendicular  to 
the  isothermals  give  the  direction  of  flow  of  heat;  in  Figure  10.1  heat  flows  along 
the  lines  u  =  const,  in  the  w  plane,  and  along  the  circles  r  =  const,  (which  are  the 
images  of  u  =  const.)  in  the  z  plane. 

Using  the  same  mapping  function  w  =  In  z,  we  can  solve  a  number  of  other 
physics  problems.  Observe  first  that  if  we  think  of  Figure  10.1  as  representing  a  cross 
section  of  a  three-dimensional  problem  (with  all  parallel  cross  sections  identical), 
then  (10.2)  gives  the  solution  of  the  three-dimensional  problem  also.  In  Figure  10.1 
the  (u,  v)  diagram  would  be  the  cross  section  of  a  slab  with  faces  at  T  =  100°  and 
T  =  0°  and  all  other  surfaces  insulated  (or  extending  to  infinity);  the  (x,  y)  diagram 
would  similarly  represent  half  a  cylinder.  Now  let  us  do  a  three-dimensional  problem 
in  electrostatics. 


V 

y 

7T 

V  =  100  volts 

0 

V  =  100  volts 

V  =  0  volts 

V  =  0  volts 

Insulation 

Figure  10.2 

Example  3.  In  Figure  10.2  the  (u,  v)  diagram  represents  (the  cross  section  of)  two  infinite 
parallel  plates,  one  at  potential  V  =  0  volts  and  one  at  potential  V  =  100  volts.  The 
(x,  y)  diagram  represents  (the  cross  section  of)  one  plane  with  its  right-hand  half 
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at  potential  V  =  0  volts  and  its  left-hand  half  at  V  =  100  volts.  From  electricity 
we  know  that  the  electrostatic  potential  V  satisfies  Laplace’s  equation  in  regions 
where  there  is  no  free  charge  (see  Chapter  13,  Section  1).  You  should  convince 
yourself  that  the  mapping  by  (10.1)  gives  the  result  shown  in  Figure  10.2,  and  that 
the  potential  is  given  by 


Tr  100  100  „  100  y 

V  =  - v  =  - 9  = - arc  tan  — , 

7T  7T  7T  X 


0  <  9  <  7T 


as  in  (10.2).  The  equipotentials  {V  =  const.)  in  the  (x,y)  plane  are  the  lines  9  = 
const.  Recall  that  the  electric  field  is  given  by  E  =  -VF,  and  that  the  gradient  of 
V  is  perpendicular  to  V  =  const.  (Chapter  6,  Section  6).  Then  the  direction  of  the 
electric  field  at  any  point  is  perpendicular  to  the  equipotential  through  that  point. 
Thus  if  we  sketch  the  curves  r  =  const,  which  are  perpendicular  to  the  equipotentials 
9  =  const.,  then  the  tangent  to  a  circle  at  a  point  gives  the  direction  of  the  electric 
field  E  at  that  point.  Note  the  correspondence  between  the  isothermals  of  the 
temperature  problem  and  the  equipotentials  here,  and  between  the  lines  of  electric 
flux  (curves  tangent  to  E)  and  the  lines  or  curves  along  which  heat  flows. 

We  can  also  solve  problems  in  hydrodynamics  (see  Chapter  6,  Section  10)  by 
conformal  mapping.  We  consider  a  two-dimensional  flow  of  water  by  which  we 
mean  either  that  we  think  of  the  water  as  flowing  in  a  thin  sheet  over  the  (x,  y) 
[or  (u,  u)]  plane,  or  if  it  has  depth,  the  flow  is  the  same  in  all  planes  parallel  to 
the  (x,  y)  [or  (it,  x)]  plane.  Although  it  is  convenient  to  talk  about  water,  what  we 
actually  require  is  an  irrotational  flow  (see  Chapter  6,  Section  11)  of  a  nonviscous 
incompressible  fluid.  For  then  (see  Problem  2)  the  velocity  V  of  the  liquid  is  given 
by  V  =  V4>,  where  $  (called  the  velocity  potential)  satisfies  Laplace’s  equation. 
Water  approximately  meets  these  requirements. 


Figure  10.3 

Example  4.  Figure  10.3  shows  two  flow  patterns  related  by  the  same  transformation  we 
have  used  in  the  heat  problem  and  the  electrostatics  problem,  namely  w  =  In  z.  In 
the  w  plane  of  Figure  10.3,  we  picture  water  flowing  in  the  u  direction  at  constant 
speed  Vo  down  a  channel  between  v  =  0  and  v  =  2-7T.  (Note  that  v  is  the  imaginary 
part  of  w  =  u  +  iv  and  has  nothing  to  do  with  velocity.)  The  velocity  potential 
is  <E>  =  Vo u;  for  then  the  velocity,  V  =  V4>,  has  components  d$/du  =  Vo  in  the 
u  direction  and  d$/dv  =  0  in  the  v  direction  as  we  have  assumed.  The  function 
<F-M’F  =  Vo w  =  Vo (u+iv)  is  called  the  complex  potential ;  the  function  T  (conjugate 
to  4>;  see  Section  2)  is  called  the  stream  function.  The  lines  'V  =  const,  (that  is, 
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v  =  const,  in  the  w  plane)  are  the  lines  along  which  the  water  flows  and  are  called 
streamlines.  Observe  that  the  lines  $  =  const,  and  the  lines  dr  =  const,  are  mutually 
perpendicular  sets  of  lines.  The  water  flows  across  lines  of  constant  <f>  and  along 
streamlines  (constant  T ) ;  boundaries  of  the  channel  (v  =  0  and  v  =  2ir)  must  then 
be  streamlines.  The  water  comes  from  the  left  (Figure  10.3,  w  plane)  and  goes  off 
to  the  right;  we  say  that  there  is  a  source  at  the  left  and  a  sink  at  the  right. 

Now  consider  the  mapping  of  the  w  plane  flow  of  Figure  10.3  into  the  z  plane 
by  the  function  w  =  In  z.  The  complex  potential  is 


<f>  +  i T  =  VqW  =  Vq  In  z  =  Vq  (In  r  +  iO) . 


The  streamlines  are  T  =  const.,  or  9  =  const.,  that  is,  radial  lines;  the  curves  d>  = 
const,  are  circles  r  =  const,  and  are  perpendicular  to  the  streamlines.  The  velocity 
is  given  by 


What  we  are  describing,  then,  is  the  flow  of  water  from  a  source  at  the  origin  out 
along  radial  lines.  Since  the  same  amount  of  water  crosses  a  small  circle  (about  the 
origin)  or  a  large  one,  the  velocity  of  the  water  decreases  with  r  as  we  have  found 


(|V|  =  V0/r). 


We  can  obtain  another  flow  pattern  from  any  given  one  by  interchanging  the 
equipotentials  and  the  streamlines.  In  Figure  10.3,  z  plane,  this  new  flow  would  have 
the  circles  r  =  const,  as  streamlines  and  would  correspond  to  a  whirlpool  motion 
of  the  water  about  the  origin  (called  a  vortex).  There  are  still  other  applications 
of  this  diagram.  The  circles  r  =  const,  give  the  direction  of  the  magnetic  field 
about  a  long  current-carrying  wire  perpendicular  to  the  (x,  y)  plane  and  passing 
through  the  origin.  The  radial  lines  9  =  const,  give  the  direction  of  the  electric 
field  about  a  similar  long  wire  with  a  static  charge  on  it.  The  radial  lines  give  the 
direction  of  heat  flow  from  a  small  hot  object  at  the  origin,  and  the  circles  r  =  const, 
are  then  the  isothermals.  By  starting  with  problems  like  these  to  which  we  know 
the  answers  and  using  various  conformal  transformations,  we  can  solve  many  other 
physics  problems  involving  fluid  flow,  electricity,  heat,  and  so  on.  Some  examples 
are  outlined  in  the  problems  and  you  will  find  many  more  in  books  on  complex 
variables. 

Example  5.  Let  us  consider  one  somewhat  more  complicated  example  of  the  use  of  con¬ 
formal  mapping.  We  shall  be  able  to  solve  two  interesting  physics  problems  in  this 
example:  (1)  to  find  the  flow  pattern  for  water  flowing  out  of  the  end  of  a  straight 
channel  into  the  open,  and  (2)  to  find  the  edge  effect  (fringing)  at  the  ends  of  a 
parallel-plate  capacitor. 


We  consider  the  mapping  function 


(10.3) 


z  =  w  +  e 


W 


=  u  +  iv  +  e  e 


u  +  iv  +  eu  (cos  v  +  i  sin  v ) , 


x  =  u  +  eu  cos  v,  y  —  v  +  eu  sin  v. 


In  Figure  10.4,  w  plane,  we  picture  a  parallel  flow  of  water  at  constant  velocity  in 
the  region  between  the  lines  DEF  and  GHI\  this  is  just  like  the  flow  of  Figure  10.3, 


Section  10 


Some  Applications  of  Conformal  Mapping  715 


D  E 

w  plane 

V 

F  F 

7T 

B' 

r  jy  L 

A  B 

-  C  „  A'  | 

O'"  -1 

0  1 

—  7T 

G  H 

.  1  G'  ,1V 

-r  r  •" 

Figure  10.4 

u>  plane.  Now  let  us  map  the  w  plane  streamlines  into  the  z  plane  using  (10.3).  On 
the  u  axis,  v  =  0;  putting  v  =  0  in  (10.3),  we  find  y  =  0,  and  x  =  u  +  eu.  Thus 
the  u  axis  maps  into  the  x  axis  ( y  =  0)  with  u  =  — oo  corresponding  to  x  =  — oo, 
u  =  0  corresponding  to  x  =  1,  and  u  =  +oo  corresponding  to  x  =  +oo  as  shown 
in  Figure  10.4  (line  ABC  maps  into  A'B'C').  Now  on  DEF ,  v  =  ir;  substituting 
v  =  7r  into  (10.3),  we  find  y  =  n,  x  =  u  +  e“cos7r  =  u  —  eu.  However,  the  image 
of  v  =  7r  is  not  the  entire  line  y  =  n.  To  see  this  consider  x  =  u  —  eu .  We  find  the 
maximum  value  of  x  for  dx/du  =  1  —  eu  =  0,  d2x/du2  =  — e“  <  0.  These  equations 
are  satisfied  for  u  =  0,  x  =  —  1.  The  point  E  (u  =  0,  v  =  tt)  maps  into  the  point 
E'  (x  =  —  1,  y  =  7r).  Thus  DE  in  the  w  plane  maps  into  the  part  of  the  line  y  =  ir 
in  the  z  plane  up  to  x  =  —1  with  u  =  —  oo  corresponding  to  x  =  —  oo  and  u  =  0 
corresponding  to  x  =  —  1.  To  see  how  to  map  EF ,  we  realize  that  x  has  its  largest 
value  at  u  =  0  and  so  decreases  as  u  increases;  for  very  large  positive  u,  x  =  u  —  eu 
is  negative  and  of  large  absolute  value  since  eu  u.  Thus  the  positive  part  of 
v  =  7r  (EF)  maps  into  the  same  line  segment  (y  =  n,  x  <  —1)  that  we  obtained 
for  the  mapping  of  the  negative  part  ( DE ),  but  this  time  the  line  segment  (E'F1 , 
z  plane)  is  traversed  backward.  It  is  as  if  the  line  y  =  ir  were  broken  at  x  =  — 1 
and  bent  back  upon  itself  through  an  angle  of  180°.  By  a  parallel  discussion  of 
the  line  GHI,  we  find  that  it  maps  as  shown  in  Figure  10.4  into  G'H'V .  Other 
streamlines  in  the  w  plane  are  given  by  v  =  const,  for  any  v  between  —  tt  and  tt.  If 
we  substitute  v  =  const,  into  the  x  and  y  equations  in  (10.3),  we  have  parametric 
equations  (with  u  as  the  parameter)  for  the  streamlines  in  the  z  plane.  For  any  value 
of  v,  these  streamlines  can  be  plotted  in  the  z  plane:  some  of  them  are  shown  by 
the  solid  curves  in  Figure  10.5.  Think  of  D'E'  and  G' H'  as  boundaries  of  a  channel 
(in  the  z  plane)  down  which  water  flows  coming  from  x  =  —  oo.  The  boundaries 
stop  at  x  =  —1  and  the  water  flows  out  of  the  channel  spreading  over  the  whole 
plane,  including  spreading  back  along  the  outsides  (E'F'  and  H' V)  of  the  channel 
boundaries.  This  is  correct  according  to  our  mapping,  for  the  boundary  streamline 
DEF  mapped  into  the  broken-and- folded-back  line  D'E'F' ,  and  similarly  for  GHI 
to  G'H’V . 

For  the  electrical  application,  let  DEF  and  GHI  represent  (the  cross  section 
of)  a  large  parallel  plate  capacitor.  Then  the  lines  v  =  const,  are  the  equipotentials 
and  the  lines  u  =  const,  give  the  direction  of  the  electric  field  E.  The  image  in  the 
^  plane  represents  (a  cross  section  of)  the  end  of  a  parallel  plate  capacitor.  The 
images  of  the  equipotentials  v  =  const,  are  the  equipotentials  in  the  z  plane  (same 
as  the  streamlines,  shown  as  solid  curves  in  Figure  10.5).  The  images  of  the  lines 
u  =  const,  (shown  as  dotted  curves  in  Figure  10.5)  give  the  direction  of  the  electric 
field  at  the  end  of  a  parallel-plate  capacitor.  Well  inside  the  plates  the  E  lines  are 
vertical,  but  at  the  end  they  bulge  out;  this  effect  is  known  as  fringing. 
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Figure  10.5 


PROBLEMS,  SECTION  10 

1.  Prove  the  theorem  stated  just  after  (10.2)  as  follows.  Let  ij>(u,v )  be  a  harmonic 
function  (that  is,  <j>  satisfies  d2(j)/du2  +  d2(j>/dv2  =  0).  Show  that  there  is  then  an 
analytic  function  g(w)  =  4>{u,v)  +  itp(u,v)  (see  Section  2).  Let  w  =  f(z)  =  u  +  iv 
be  another  analytic  function  (this  is  the  mapping  function).  Show  that  the  function 
h(z)  =  g(f(z))  is  analytic.  Hint:  Show  that  h(z)  has  a  derivative.  (How  do  you  find 
the  derivative  of  a  function  of  a  function,  for  example,  In  sin  z?)  Then  (by  Section  2) 
the  real  part  of  h(z)  is  harmonic.  Show  that  this  real  part  is  (p(u(x,y),v(x,y)). 

2.  A  fluid  flow  is  called  irrotational  if  V  X  V  =  0  where  V  =  velocity  of  fluid  (Chapter  6, 
Section  11);  then  V  =  V4>.  Use  Problem  10.15  of  Chapter  6  to  show  that  if  the 
fluid  is  incompressible,  the  $  satisfies  Laplace’s  equation.  ( Caution :  In  Chapter  6, 
we  used  V  =  vp,  with  v  =  velocity;  here  V  =  velocity.) 

3.  Assuming  from  electricity  the  equations  V-D  =  p,  E  =  -W,  D  =  eE  (e  =  const.), 
show  that  in  regions  where  the  free  charge  density  p 
equation. 

4.  Let  a  flat  plate  in  the  shape  of  a  quarter-circle,  as 
shown,  have  its  faces  and  curved  boundary  insulated, 
and  its  two  straight  edges  held  at  0°  and  100°.  Find 
the  temperature  distribution  T(x,y)  in  the  plate,  and 
the  equations  of  the  isothermals.  Hint:  Use  the  map¬ 
ping  function  w  =  In  a  as  in  Figure  10.1;  what  w  plane 
line  maps  into  the  y  axis? 

5.  Consider  a  capacitor  made  of  two  very  large  perpendicular  plates.  (Let  the  positive 
x  and  y  axes  in  the  diagram  of  Problem  4  represent  a  cross  section  of  the  capacitor.) 
Let  one  plate  (x  axis)  be  held  at  potential  V  =  0,  and  the  other  plate  (y  axis)  be 
held  at  potential  V  =  100  volts.  Find  the  potential  V(x,y)  for  x  >  0,  y  >  0,  and 
the  equations  of  the  equipotentials.  Hint:  This  problem  is  mathematically  the  same 
as  Problem  4. 
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6.  Let  the  figure  represent  (the  cross  section  of)  a  hot  cylinder  (say  T  =  100°)  lying 
on  a  cold  plane  (say  T  =  0°).  (Separate  the  two  by  a  bit  of  insulation.)  Find 
the  temperature  in  the  shaded  region.  Alternatively,  let  the  cylinder  and  the  plane 
be  held  at  two  different  electric  potentials  (with  insulation 
between),  and  find  the  electric  potential  in  the  shaded  region. 

Find  and  sketch  some  of  the  isothermals  (equipotentials)  and 
some  of  the  curves  (perpendicular  to  the  isothermals)  along 
which  heat  flows  (lines  of  flux  for  the  electric  case).  Hint: 

Use  the  mapping  function  w  =  1/z,  and  consider  the  image 
of  the  w  plane  region  between  v  =  0  and  v  =  —  1. 

7.  Use  the  mapping  function  w  =  z2  to  find  the  streamlines  for  the  flow  of  water 
around  the  inside  of  a  right-angle  boundary.  Find  the  velocity  potential  <5>,  the 
stream  function  d',  and  the  velocity  V  =  V$. 

8.  Observe  that  the  magnitude  of  the  velocity  in  Problem  7  can  be  obtained  from 
V  =  Vo\dw/dz\.  Show  that  this  result  holds  in  general  as  follows.  Let  w  =  f(z)  be 
an  analytic  mapping  function  such  that  the  lines  v  =  const,  map  into  the  streamlines 
of  the  flow  you  want  to  consider  in  the  z  plane.  Then 

Vow  =  Vo{u  +  iv)  =  <l>(a;,  y )  +  i^[x,y). 


Show  that 


dw  _  <9£  .9$ 

0  dz  dx  dy 


-  Vx  iVy 


(this  expression  is  called  the  complex  velocity).  Hence  show  that  V  =  Vo\dw / dz\ . 


9.  Find  and  sketch  the  streamlines  for  the  flow  of  water  over  a  semicircular  hump  (say 
a  half-buried  log  at  the  bottom  of  a  stream)  as  shown.  Hint:  Use  the  mapping 
function  w  =  z  +  z^1.  Show  that  the  u  axis  maps  into  the  contour  ABODE  with 
the  correspondence  shown. 


10.  Find  and  sketch  the  streamlines  for  the  indicated  flow  of  water  inside  a 
rectangular  boundary.  Hint:  Consider  w  =  sin  z;  map  the  u  axis  into  the 
boundary  of  the  rectangle. 


11.  For  w  =  ln[(z  +  l)/(z  —  1)],  show  that  the  images  of  u  =  const,  and  v  =  const,  are 
two  orthogonal  sets  of  circles.  Find  centers  and  radii  of  five  or  six  circles  of  each  set 
and  sketch  them.  Include  the  circle  with  center  at  the  origin. 


Use  the  results  of  Problem  11  to  solve  the  following 
physics  problems. 

12.  The  figure  represents  the  cross  section  of  a  long 
cylinder  (assume  it  infinitely  long)  cut  in  half, 
with  the  top  half  and  the  bottom  half  insulated 
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from  each  other.  Let  the  surface  of  the  top  half  be  held  at  temperature  T  =  30°  and 
the  surface  of  the  bottom  half  at  T  —  10°.  Find  the  temperature  T(x,y)  inside  the 
cylinder.  Hint:  Show  that  the  line  v  =  7t/2  maps  into  the  lower  half  of  the  circle 
\z\  =  1,  and  the  line  v  =  3n/2  maps  into  the  upper  half  of  the  circle. 

13.  Let  the  figure  in  Problem  12  represent  (the  cross  section  of)  a  capacitor  with  the 
lower  half  at  potential  Vi  and  the  upper  half  at  potential  V2.  Find  the  potential 
V{x,y)  between  the  plates  (that  is,  inside  the  circle).  Hint:  This  is  almost  like 
Problem  12.  Observe  that  in  the  text  and  in  Problem  12,  the  wr-plane  temperature 
is  of  the  form  Av,  with  A  constant;  here  you  need  the  potential  of  the  form  Av  +  B, 
A  and  B  constants. 

14.  In  the  figure  in  Problem  12,  let  z  =  —1  be  a  source  and  z  =  +1  a  sink,  and  let  the 
water  flow  inside  the  circular  boundary.  Find  <t>,  U',  and  V.  Sketch  the  streamlines. 

15.  In  Problem  14,  the  streamlines  were  the  images  of  v  =  const.  Consider  the  flow 
(over  the  whole  plane,  that  is,  with  no  boundaries)  with  streamlines  u  =  const. 
This  flow  may  be  described  as  two  vortices  rotating  in  opposite  directions.  Sketch 
a  number  of  streamlines  indicating  the  direction  of  the  velocity  with  arrows.  Since 
a  boundary  is  a  streamline,  a  flow  is  not  disturbed  by  inserting  a  boundary  along 
a  streamline.  Insert  two  circular  boundaries  corresponding  to  u  =  a  and  u  =  —a. 
Show  that  the  velocity  through  the  narrow  neck  (say  at  z  =  0)  is  greater  than  the 
velocity  elsewhere  (say  at  z  =  i).  You  can  simplify  your  calculation  of  the  velocity 
by  showing  that  the  result  in  Problem  8  holds  here  also. 

16.  Two  long  parallel  cylinders  form  a  capacitor.  (Let  their  cross  sections  be  the  images 
of  u  =  a  and  u  =  —a.)  If  they  are  held  at  potentials  Vo  and  —Vo,  find  the  potential 
V  (x,  y)  at  points  between  them,  given  that  the  charge  (per  unit  length)  on  a  cylinder 
is  q  =  Vo/ (2 a),  show  that  the  capacitance  (per  unit  length),  that  is,  q/( 2Vo),  is 
given  by  l/(4cosh_1  d/(2r)),  where  d  is  the  distance  between  the  centers  of  the  two 
cylinders,  and  their  radii  are  r. 

17.  Other  problems  to  consider  using  the  mapping  function  of  Problem  11:  (a)  a  capac¬ 
itor  consisting  of  two  long  cylinders  one  inside  the  other,  but  not  concentric;  (b)  the 
magnetic  field  in  a  plane  perpendicular  to  two  long  parallel  wires  carrying  equal  but 
opposite  currents;  (c)  the  electric  field  in  a  plane  perpendicular  to  two  long  parallel 
wires,  one  charged  positive  and  the  other  negative;  (d)  other  flow  problems  obtained 
by  inserting  boundaries  along  streamlines. 

11.  MISCELLANEOUS  PROBLEMS 

In  Problems  1  and  2,  verify  that  the  given  function  is  harmonic,  and  find  a  function  f(z)  of 

which  it  is  the  real  part.  Hint:  Use  Problem  2.64.  For  Problem  2,  see  Chapter  2,  Section 

17,  Problem  19. 

1.  lnV(l  +  x)2  +  y2  2-  arctan^Y 

3.  Liouville’s  theorem:  Suppose  f(z)  is  analytic  for  all  z  (except  oo),  and  bounded 
[that  is,  |/(2)|  <  M  for  all  z  and  some  M],  Prove  that  f(z)  is  a  constant.  Hints:  If 
f'(z)  =  0,  then  f(z)  =  const.  To  show  this,  write  f'(z)  as  in  Problem  3.21  where  C 
is  a  circle  of  radius  R  and  center  z,  that  is,  w  =  z  +  Rel6 .  Show  that  |/'(z)|  <  M/R , 
and  let  R  — ►  00. 

4.  Use  Liouville’s  theorem  (Problem  3)  to  prove  the  fundamental  theorem  of  algebra 
(see  Problem  7.44).  Hint:  Let  P(z)  be  a  polynomial  of  degree  >  1;  then  f(z)  = 
1  /P(z)  is  a  bounded  analytic  function  in  a  region  not  containing  any  zeros  of  P(z). 
Disprove  the  assumption  that  P(z)  has  no  zeros  anywhere. 
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In  Problems  5  to  8,  find  the  residues  of  the  given  function  at  all  poles.  Take  z  =  re'6 , 
0  <  9  <  2tt. 


5. 


zl/3 

1+Z2 


6. 


y/j 

1  +8z3 


7. 


In  z 
1  +  z2 


In  2 

(2z  -  l)2 


In  Problems  9  to  10,  use  Laurent  series  to  find  the  residues  of  the  given  functions  at  the 
origin. 


9. 


10. 


ln(l  —  z) 
sin2  z 


11.  Find  the  Laurent  series  of  f(z )  =  e2/(l  —  z )  for  \z\  <  1  and  \z\  >  1.  Hints: 

For  \z\  <  1,  multiply  two  power  series;  you  should  find  f(z)  =  ’Y^=0o,nZn  with 
an  =  1/^-  For  \z\  >  1,  use  (4.3)  where  C  is  a  circle  \z\  =  a  with  a  >  1. 

Evaluate  the  integrals  by  finding  residues  at  1  and  0.  You  should  find  f(z)  = 
Y,™=o  bnz~n  where  all  bn  =  -e  and  an  =  -e  +  J2k=o 

12.  Let  f(z)  be  the  branch  of  \J z2  —  1  which  is  positive  for  large  positive  real  values 
of  z.  Expand  the  square  root  in  powers  of  1/z  to  obtain  the  Laurent  series  of  /(z) 
about  oo.  Thus  by  Problem  8.1  find  the  residue  of  /(z)  at  oo.  Check  your  result  by 
using  equation  (8.2). 


In  Problems  13  and  14,  find  the  residues  at  the  given  points. 


13. 


14. 


(a) 

(c) 

(a) 

(c) 


cos  2  7 r 

"7 - TT  clt  — 

(2  z  —  7r)4  2 


1  +  32z5 
ln(l  +  2z) 


4z4  +  1 


at 


at 


at  1 


0 

(1  +  0 


(b) 

(d) 

(b) 

(d) 


2z2  +  3  z 

- - —  at  oo 

z  —  1 

csc(2z  —  3)  at  z  =  | 


—  sin(2z  +  5) 
z  sin  2  z 

at 


(z  +  7r)2 


at  oo 

—  71 


In  Problem  15  to  20,  evaluate  the  integrals  by  contour  integration. 


15. 


17. 


I 

l 


cos  6  dd 
5  —  4  cos  9 

cos  x  dx 

(4x2  +  1)(*2  +  9) 


16. 

18. 


f27T  sin  9  d6 
J0  5  +  3  sin  9 

[°°x smW2)rfx 
Jo  +  4 


19. 


r°°  sin  x  dx 

J- oo  (3a;  -  7i)(a;2  +  712) 


20. 


DT/  f°°  cos  xdx 

7-00  x(l-x)(x2  +  l) 


Verify  the  formulas  in  Problem  21  to  27  by  contour  integration  or  as  indicated.  Assume 
a  >  0,  m  >  0. 


21. 


22. 


23. 


L 

l 


d9 


a  +  b  sin  9 
d9 


i: 


d9 


2tt 


a  +  b  cos  9  y'a2  -  b2  ’ 

2?r  d9  2na 


|fe|  < 


|6|  <  a 


(a  +  fa  sin#)2  J  0  (a  +  fecos#)2  (a2  —  62)3/2  ’ 

Hint:  You  can  do  this  directly  by  contour  integration,  but  it  is  easier  to  differentiate 
Problem  21  with  respect  to  a. 

a  o—  /  „  \ 

|6|  <  a 


r2n  sin  9d9 

f2n  cos  9  d9  2tv  - 

Jo  a  +  6  sin  9  J 

q  a  +  b  cos  6  b  1 

Va2  -b2J 
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24. 

26. 


cos  mx  dx 

7T  —ma 

25. 

PV 

f°°  cosmx  dx 

x2  +  a2 

2  a 

l0  x2  —  a2 

x  sin  mx  dx 

7T  —ma 

27. 

PV 

x  sin  mxdx 

x2  +  a2 

26 

l0  x2  —  a2 

7T  . 

- sin  ma 

2  a 


7T 

—  cos  ma 


Hint  for  Problems  26  and  27:  Differentiate  Problems  24  and  25  with  respect  to  m. 
fx  In  x  dx 


28.  Evaluate 


f 


by  using  the  contour  of  Figure  7.4. 


(1  +  x)2 

Hint:  Along  DE,  z  =  re2nt  so  In  a  =  In  r  +  2ivi. 
(Inx)2 


29.  Evaluate 


/  i  ’  2  d x  ^  using  the  contour  of  Figure  7.3.  Comment:  Note  that 

Jo  1  +  x 

Inx 


your  work  also  shows  that 
30.  Show  that 


POO 

Jo  1 


+  X2 


■  dx  =  0. 


r°°  costing  _ tt _ 

Jo  x2  +  l  2cosh(7r/2) 

by  integrating  elXnz /(z2  —  1)  around  a  contour  like  Figure  7.3  but  rotated  90°  clock¬ 
wise  so  the  straight  side  is  along  the  y  axis. 

As  in  Section  7,  find  out  how  many  roots  the  equations  in  Problem  31  to  34  have  in  each 
quadrant. 


31. 

24  +  32  T  5  =  0 

32.  z3  +  2z2  +  5z  +  6  =  0 

33. 

z6  +  z3  +  9z  +  64  =  0 

34.  z8  +  5z3  +  3z  +  4  =  0 

(no  real  roots) 

(2  negative  real  roots) 

35. 

Show  that  the  Cauchy-Riemann  equations  [see  (2.2)  and  Problem  2.46]  in  a  general 
orthogonal  curvilinear  coordinate  system  [see  Chapter  10,  Sections  8  and  9]  are 

1  du  1  dv  1  dv  1  du 

hi  dx\  /i2  8x2  ’  hi  dxi  /12  dx2 


where,  as  in  Chapter  10,  the  variables  are  xi,  X2  and  the  scale  factors  are  hi,  /i2- 
Hint:  Consider  the  directional  derivatives  (Chapter  6,  Section  6)  in  two  perpendic¬ 
ular  directions.  (Compare  Problem  2.46.)  Also  show  that  u  and  v  satisfy  Laplace’s 
equation,  Chapter  10,  equation  (9.10)  (drop  the  013  term  and  set  /13  =  1). 

36.  Show  that  a  harmonic  function  u(x,  y)  is  equal  at  every  point  a  to  its  average  value 
on  any  circle  centered  at  a  [and  lying  in  the  region  where  f(z)  =  u(x ,  y)  +  iv(x,  y) 
is  analytic].  Hint:  In  (3.9),  let  z  =  a  +  re*8  (that  is,  C  is  a  circle  with  center  at 
a),  and  show  that  the  average  value  of  f(z)  on  the  circle  is  f(a)  (see  Chapter  7, 
Section  4  for  discussion  of  the  average  of  a  function).  Take  real  and  imaginary  parts 
of  f(a)  =  [u( x,  y)  +  iv(x,  y)]z=a- 

37.  A  (nonconstant)  harmonic  function  takes  its  maximum  value  and  its  minimum  value 
on  the  boundary  of  any  region  (not  at  an  interior  point).  Thus,  for  example,  the 
electrostatic  potential  V  in  a  region  containing  no  free  charge  takes  on  its  largest 
and  smallest  values  on  the  boundary  of  the  region;  similarly,  the  temperature  T  of  a 
body  containing  no  sources  of  heat  takes  its  largest  and  smallest  values  on  the  surface 
of  the  body.  Prove  this  fact  (for  two-dimensional  regions)  as  follows:  Suppose  that 
it  is  claimed  that  u(x,  y)  takes  its  maximum  value  at  some  interior  point  a;  this 
means  that,  at  all  points  of  some  small  disk  about  a,  the  values  of  u(x,  y)  are  no 
larger  than  at  a.  Show  by  Problem  36  that  such  a  claim  leads  to  a  contradiction 
(unless  u  =  const.).  Similarly  prove  that  u(x,y)  cannot  take  its  minimum  value  at 
an  interior  point. 


Section  11 


Miscellaneous  Problems  721 


38.  Show  that  a  Dirichlet  problem  (see  Chapter  13,  Section  3)  for  Laplace’s  equation  in 
a  finite  region  has  a  unique  solution;  that  is,  two  solutions  ui  and  112  with  the  same 
boundary  values  are  identical.  Hint:  Consider  U2  —  «i  and  use  Problem  37.  [Also 
see  Chapter  13,  discussion  following  equation  (2.17).] 

39.  Use  the  following  sequence  of  mappings  to  find  the  steady  state  temperature  T(x,  y) 

in  the  semi-infinite  strip  y>0,  0<x<nii  T(x,  0)  =  100°,  T(0,y)  =  T(n,y)  =  0, 
and  T(x,y)  — >  0  as  y  — >  00.  (See  Chapter  13,  Section  2  and  Problem  2.6.) 

(a)  Use  w  =  (z!  —  1  )/(z'  +  1)  to  map  the  half  plane  v  >  0  on  the  upper  half  plane 

y'  >  0,  with  the  positive  u  axis  corresponding  to  the  two  rays  x'  >  1  and 

x'  <  —1,  and  the  negative  u  axis  corresponding  to  the  interval  —  1  <  x  <  1  of 

the  x'  axis. 

(b)  Use  z'  =  —  cos  z  to  map  the  half-strip  0  <  x  <  7r,  1/  >  0  on  the  z'  half  plane 
described  in  (a).  The  interval  —  1  <  x'  <1,  y'  =  0  corresponds  to  the  base 
0  <  X  <  7r,  1/  =  0  of  the  strip. 

Comments:  The  temperature  problem  in  the  (u,  v)  plane  is  like  the  problems  shown 
in  the  z  plane  of  Figures  10.1  and  10.2,  and  so  is  given  by  T  =  (100/7r)  arctan(u/u). 
In  the  2  plane  you  will  find 


T(x,y)  =  arc  tan 

7T 


2  sin  x  sinh  y 
sinh2  y  —  sin2  x 


Put  tana  = 


sinh  y 


and  use  the  formula  for  tan  2a  to  get 


T{x,y)=™  arctan^. 

7 r  sinh  y 

Note  that  this  is  the  same  answer  as  in  Chapter  13,  Problem  2.6,  if  we  replace  10 

by  7T. 

40.  Use  L13  of  the  Laplace  transform  table  to  find  the  Laplace  transform  of  sin  at  sinh  at. 
Verify  your  result  by  finding  its  inverse  transform  using  the  Bromwich  integral. 

cos2(a7r/2) 


41.  Evaluate  by  contour  integration 


ion  [ 

Jo 


(1  -  a2)2 


■  da. 


1  +  e*' 


■  dz 


Hint:  cos2(a7r/2)  =  (1  +  cosa7r)/2.  Evaluate  r  ,  . 

J  (z  —  1  )^(z  +  l)2 

around  the  upper  half  plane;  note  that  the  poles  are  actually  simple  poles  (see 
Section  7,  Example  4). 
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Probability  and  Statistics 


1.  INTRODUCTION 


The  theory  of  probability  has  many  applications  in  the  physical  sciences.  It  is  of 
basic  importance  in  quantum  mechanics  where  results  may  be  expressed  in  terms 
of  probabilities  (see  Chapter  13,  Schrodinger  equation).  It  is  needed  whenever  we 
are  dealing  with  large  numbers  of  particles  or  variables  where  it  is  impossible  or 
impractical  to  have  complete  information  about  each  one,  such  as  in  kinetic  theory 
and  statistical  mechanics  and  a  great  variety  of  engineering  problems.  Statistics  is 
the  part  of  probability  theory  which  deals  with  the  interpretation  of  sets  of  data. 
You  need  statistical  terms  and  methods  every  time  you  make  a  set  of  laboratory 
measurements.  In  this  chapter,  we  shall  discuss  some  of  the  basic  ideas  of  probability 
and  statistics  which  are  most  useful  in  applications. 

The  word  “probably”  is  frequently  used  in  everyday  life.  We  say  “The  test 
will  probably  be  hard,”  “It  will  probably  snow  today,”  “We  will  probably  win  this 
game,”  and  so  on.  Such  statements  always  imply  a  state  of  partial  ignorance  about 
the  outcome  of  some  event;  we  do  not  say  “probably”  about  something  whose 
outcome  we  know.  The  theory  of  probability  tries  to  express  more  precisely  just 
what  our  state  of  ignorance  is.  We  say  that  the  probability  of  getting  a  head  in 
one  toss  of  a  coin  is  |,  and  similarly  for  a  tail.  We  mean  by  this  that  there  are  two 
possible  outcomes  of  the  experiment  (if  we  do  not  consider  the  possibility  of  the 
coin’s  standing  on  edge)  and  that  we  have  no  reason  to  expect  one  outcome  more 
than  the  other;  therefore  we  assign  equal  probabilities  to  the  two  possible  outcomes. 
(See  end  of  Section  2  for  further  discussion  of  this.) 

Consider  the  following  problem.  You  and  I  each  toss  a  coin  and  look  at  our 
own  coins  but  not  each  other’s.  The  question  is  “What  is  the  probability  that 
both  coins  show  heads?”  Suppose  you  see  that  your  coin  shows  tails;  you  say  that 
the  probability  that  both  coins  are  heads  is  zero  because  you  know  that  yours  is 
tails.  On  the  other  hand,  suppose  I  see  that  my  coin  is  heads;  then  I  say  that  the 
probability  of  both  heads  is  tj  because  I  don’t  know  whether  your  coin  shows  heads 
or  tails.  Now  suppose  neither  of  us  looks  at  either  coin,  but  a  third  person  looks 
at  both  coins  and  gives  us  the  information  that  at  least  one  is  heads.  Without  this 
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information,  there  are  four  possibilities,  namely 
(1.1)  hh  tt  th  ht 

to  each  of  which  we  would  ordinarily  assign  the  probability  ^  (see  end  of  Section  2, 
and  Section  3).  The  information  “at  least  one  head”  rules  out  tt,  but  gives  no  new 
information  about  the  other  three  cases.  Since  hh,  th,  ht  were  equally  likely  before, 
we  still  consider  them  equally  likely  and  say  that  the  probability  of  hh  is 

Notice  in  the  above  discussion  that  the  answer  to  a  probability  problem  depends 
on  the  state  of  knowledge  (or  ignorance)  of  the  person  giving  the  answer.  Notice 
also  that  in  order  to  find  the  probability  of  an  event,  we  consider  all  the  different 
equally  likely  outcomes  which  are  possible  according  to  our  information.  We  say 
that  these  are  mutually  exclusive  (for  example,  if  a  coin  is  heads  it  cannot  be  tails), 
collectively  exhaustive  (we  must  consider  all  possibilities),  and  equally  likely  (we 
have  no  information  which  makes  us  expect  one  result  more  than  another  so  we 
assume  the  same  probability  for  each  one  of  the  set  of  outcomes).  Let  us  now 
formalize  this  notion  of  probability  as  a  definition  (also  see  Section  2) . 


If  there  are  several  equally  likely,  mutually  exclusive,  and  collec¬ 
tively  exhaustive  outcomes  of  an  experiment,  the  probability  of  an 
event  E  is 

(1.2) 

number  of  outcomes  favorable  to  E 

P  =  - • 

total  number  of  outcomes 


Example  1.  Find  the  probability  that  a  single  card  drawn  from  a  shuffled  deck  of  cards 
will  be  either  a  diamond  or  a  king  (or  both). 

There  are  52  different  possible  outcomes  of  the  drawing;  since  the  deck  is  shuffled, 
we  assume  all  cards  equally  likely.  Of  the  52  cards,  16  are  favorable  (13  diamonds 
and  the  3  other  kings);  therefore  by  (1.2)  the  desired  probability  is  gf  =  yjp 

Example  2.  A  three-digit  number  (that  is,  a  number  from  100-999)  is  selected  “at  ran¬ 
dom.”  (“At  random”  means  that  we  assume  all  numbers  to  have  the  same  proba¬ 
bility  of  being  selected.)  What  is  the  probability  that  all  three  digits  are  the  same? 

There  are  900  three-digit  numbers;  9  of  them  (namely  111,  222,  •  •  • ,  999)  have 
all  three  digits  the  same.  Hence  the  desired  probability  is  ^  =  -hj. 

PROBLEMS,  SECTION  1 

1.  If  you  select  a  three-digit  number  at  random,  what  is  the  probability  that  the  units 
digit  is  7?  What  is  the  probability  that  the  hundreds  digit  is  7? 

2.  Three  coins  are  tossed;  what  is  the  probability  that  two  are  heads  and  one  tails? 
That  the  first  two  are  heads  and  the  third  tails?  If  at  least  two  are  heads,  what  is 
the  probability  that  all  are  heads? 

3.  In  a  box  there  are  2  white,  3  black,  and  4  red  balls.  If  a  ball  is  drawn  at  random, 
what  is  the  probability  that  it  is  black?  That  it  is  not  red? 
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4.  A  single  card  is  drawn  at  random  from  a  shuffled  deck.  What  is  the  probability  that 
it  is  red?  That  it  is  the  ace  of  hearts?  That  it  is  either  a  three  or  a  five?  That  it  is 
either  an  ace  or  red  or  both? 

5.  Given  a  family  of  two  children  (assume  boys  and  girls  equally  likely,  that  is,  proba¬ 
bility  1/2  for  each),  what  is  the  probability  that  both  are  boys?  That  at  least  one 
is  a  girl?  Given  that  at  least  one  is  a  girl,  what  is  the  probability  that  both  are 
girls?  Given  that  the  first  two  are  girls,  what  is  the  probability  that  an  expected 
third  child  will  be  a  boy? 

6.  A  trick  deck  of  cards  is  printed  with  the  hearts  and  diamonds  black,  and  the  spades 
and  clubs  red.  A  card  is  chosen  at  random  from  this  deck  (after  it  is  shuffled).  Find 
the  probability  that  it  is  either  a  red  card  or  the  queen  of  hearts.  That  it  is  either 
a  red  face  card  or  a  club.  That  it  is  either  a  red  ace  or  a  diamond. 

7.  A  letter  is  selected  at  random  from  the  alphabet.  What  is  the  probability  that  it  is 
one  of  the  letters  in  the  word  “probability?”  What  is  the  probability  that  it  occurs 
in  the  first  half  of  the  alphabet?  What  is  the  probability  that  it  is  a  letter  after  xl 

8.  An  integer  N  is  chosen  at  random  with  1  <  IV  <  100.  What  is  the  probability  that 
N  is  divisible  by  11?  That  N  >  90?  That  N  <  3?  That  N  is  a  perfect  square? 

9.  You  are  trying  to  find  instrument  A  in  a  laboratory.  Unfortunately,  someone  has 
put  both  instruments  A  and  another  kind  (which  we  shall  call  B)  away  in  identical 
unmarked  boxes  mixed  at  random  on  a  shelf.  You  know  that  the  laboratory  has 
3  A’s  and  7  B’s.  If  you  take  down  one  box,  what  is  the  probability  that  you  get 
an  A?  If  it  is  a  B  and  you  put  it  on  the  table  and  take  down  another  box,  what  is 
the  probability  that  you  get  an  A  this  time? 

10.  A  shopping  mall  has  four  entrances,  one  on  the  North,  one  on  the  South,  and  two 
on  the  East.  If  you  enter  at  random,  shop  and  then  exit  at  random,  what  is  the 
probability  that  you  enter  and  exit  on  the  same  side  of  the  mall? 

2.  SAMPLE  SPACE 

It  is  frequently  convenient  to  make  a  list  of  the  possible  outcomes  of  an  experiment 
[as  we  did  in  (1.1)].  Such  a  set  of  all  possible  mutually  exclusive  outcomes  is  called  a 
sample  space ;  each  individual  outcome  is  called  a  point  of  the  sample  space.  There 
are  many  different  sample  spaces  for  any  given  problem.  For  example,  instead  of 

(1.1) ,  we  could  say  that  a  set  of  all  mutually  exclusive  outcomes  of  two  tosses  of  a 
coin  is 

(2.1)  2  heads,  1  head,  no  heads. 

Still  another  sample  space  for  the  same  problem  is 

(2.2)  no  heads,  at  least  1  head. 

(Can  you  list  some  more  examples?)  On  the  other  hand,  the  set  of  outcomes 

2  heads,  at  least  1  head,  exactly  1  tail. 

cannot  be  used  as  a  sample  space,  because  these  outcomes  are  not  mutually  exclu¬ 
sive.  “At  least  1  head”  includes  “2  heads”  and  also  includes  “exactly  1  tail”  (which 
means  also  “exactly  1  head”). 
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In  order  to  use  a  sample  space  to  solve  problems,  we  need  to  have  the  probabil¬ 
ities  corresponding  to  the  different  points  in  the  sample  space.  We  usually  assign 
probability  1/4  to  each  of  the  outcomes  listed  in  (1.1).  (See  end  of  Section  2  and  Sec¬ 
tion  3.)  We  call  such  a  list  of  equally  likely  outcomes  a  uniform  sample  space.  Now 
suppose  the  outcomes  are  not  equally  likely.  Satisfy  yourself  that  the  probabilities 
associated  with  the  points  of  (2.1)  and  (2.2)  are  as  follows. 


For  (2.1):  ^  ^  n\h  and  for  (2.2): 

4  2  4 


no  h  at  least  1  h 
1  3 

4  4 


The  sample  spaces  (2.1)  and  (2.2)  with  different  probabilities  associated  with  dif¬ 
ferent  points  are  called  nonuniform  sample  spaces.  For  some  problems,  there  may 
be  both  uniform  and  nonuniform  sample  spaces;  for  example,  (1.1)  is  a  uniform 
sample  space,  and  (2.1)  and  (2.2)  are  nonuniform  sample  spaces  for  a  toss  of  two 
coins.  But  sometimes  there  is  no  uniform  sample  space;  for  example,  consider  a 
weighted  coin  which  has  a  probability  |  for  heads  and  |  for  tails.  In  such  cases, 
we  cannot  use  the  definition  (1.2)  of  probability,  and  we  need  the  following  more 
general  definition. 


Definition  of  Probability.  Given  any  sample  space  (uniform  or  not)  and  the 
probabilities  associated  with  the  points,  we  find  the  probability  of  an  event  by 
adding  the  probabilities  associated  with  all  the  sample  points  favorable  to  the 
event. 


For  a  given  nonuniform  sample  space,  we  must  use  this  definition  since  (1.2) 
does  not  apply.  If  the  given  sample  space  is  uniform,  or  if  there  is  an  underlying 
uniform  sample  space  [for  example,  (1.1)  is  the  uniform  space  underlying  (2.1)  and 
(2.2)],  then  this  definition  is  consistent  with  the  definition  (1.2)  by  equally  likely 
cases  (Problems  15  and  16),  and  we  may  use  either  definition.  As  an  example,  let 
us  find  from  (2.1)  the  probability  of  at  least  one  head;  this  is  the  probability  of  one 
head  plus  the  probability  of  two  heads  or  \  +  \  =  |.  We  get  the  same  result  from 
the  uniform  sample  space  (1.1)  using  either  (1.2)  or  the  definition  above. 

If  we  can  easily  construct  several  sample  spaces  for  a  given  problem,  we  must 
choose  an  appropriate  one  for  the  question  we  want  to  answer.  Suppose  we  ask  the 
question:  In  two  tosses  of  a  coin,  what  is  the  probability  that  both  are  heads?  From 
either  (1.1)  or  (2.1)  we  find  the  answer  1;  (2.2)  is  not  an  appropriate  sample  space 
to  use  in  answering  this  question.  (Why  not?)  To  find  the  probability  of  both  tails, 
we  could  use  any  of  the  three  listed  sample  spaces,  and  to  find  the  probability  that 
the  first  toss  gave  a  head  and  the  second  a  tail,  we  could  use  only  (1.1)  since  the 
other  sample  spaces  do  not  give  enough  information.  Let  us  now  consider  some  less 
trivial  examples. 

Example  1.  A  coin  is  tossed  three  times.  A  uniform  sample  space  for  this  problem  contains 
eight  points, 

,  .  hhh  hth  ttt  tht 

^  '  '  hht  thh  tth  htt 

and  we  attach  probability  |  to  each.  Now  let  us  use  this  sample  space  to  answer 
some  questions. 
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What  is  the  probability  of  at  least  two  tails  in  succession?  By  actual  count,  we 
see  that  there  are  three  such  cases,  so  the  probability  is  |. 

What  is  the  probability  that  two  consecutive  coins  fall  the  same?  Again  by 
actual  count,  this  is  true  in  six  cases,  so  the  probability  is  |  or 

If  we  know  that  there  was  at  least  one  tail,  what  is  the  probability  of  all  tails? 
The  point  hhh  is  now  ruled  out;  we  have  a  new  sample  space  consisting  of  seven 
points.  Since  the  new  information  (at  least  one  tail)  tells  us  nothing  new  about 
these  seven  outcomes,  we  consider  them  equally  probable,  each  with  probability  y. 
Thus  the  probability  of  all  tails  when  all  heads  is  ruled  out  is  y. 

(See  problems  11  and  12  for  further  discussion  of  this  example.) 


Example  2.  Let  two  dice  be  thrown;  the  first  die  can  show  any  number  from  1  to  6  and 
similarly  for  the  second  die.  Then  there  are  36  possible  outcomes  or  points  in  a 
uniform  sample  space  for  this  problem;  with  each  point  we  associate  the  probability 
A,.  We  can  indicate  a  3  on  the  first  die  and  a  2  on  the  second  die  by  the  symbol 
3,2.  Then  the  sample  space  is  as  shown  in  (2.4).  (Ignore  the  circling  of  some  points 
and  the  letters  a  and  b  right  now;  they  are  for  use  in  the  problems  below.) 


(2.4) 


1,1 

1,2 

1,3/ 

1,5 

1,6 

2,1 

2,2/ 

^3/ 

-"2d 

2,5 

2,6 

3+/ 

-^2/ 

^3,3 

3,4 

3,5 

3,6 

a  ^4,1/ 

/+2 

4,3 

4,4 

4,5/ 

^L6 

5,1 

5,2 

5,3 

54/ 

''5,5/ 

^5,6 

6,1 

6,2 

6,3 

<6,4/ 

/6,5 

6,6 

Let  us  now  ask  some  questions  and  use  the  sample  space  (2.4)  to  answer  them. 

(a)  What  is  the  probability  that  the  sum  of  the  numbers  on  the  dice  will  be  5? 
The  sample  space  points  circled  and  marked  a  in  (2.4)  give  all  the  cases  for  which 
the  sum  is  5.  There  are  four  of  these  sample  points;  therefore  the  probability  that 
the  sum  is  5  is  ^  or  1. 

(b)  What  is  the  probability  that  the  sum  on  the  dice  is  divisible  by  5?  This 
means  a  sum  of  5  or  10;  the  four  points  circled  and  marked  a  in  (2.4)  correspond  to 
a  sum  of  5,  and  the  three  points  circled  and  marked  b  correspond  to  a  sum  of  10. 
Thus  there  are  seven  points  in  the  sample  space  corresponding  to  a  sum  divisible 
by  5,  so  the  probability  of  a  sum  divisible  by  5  is  (7  favorable  cases  out  of  36 
possible  cases,  or  7  times  the  probability  ^  of  each  of  the  favorable  sample  points). 

(c)  Set  up  a  sample  space  in  which  the  points  correspond  to  the  possible  sums  of 
the  two  numbers  on  the  dice,  and  find  the  probabilities  associated  with  the  points 
of  this  nonuniform  sample  space.  The  possible  sums  range  from  2  (that  is,  1  +  1) 
to  12  (that  is,  6  +  6).  From  (2.4)  we  see  that  the  points  corresponding  to  any  given 
sum  lie  on  a  diagonal  (parallel  to  the  diagonal  elements  marked  a  or  b).  There  is 
one  point  corresponding  to  the  sum  2;  there  are  two  points  giving  the  sum  3,  three 
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points  for  sum  4,  etc.  Thus  we  have: 


(2.5) 

Sample  Space 

2 

3 

4 

5 

6 

7 

8 

9 

10  11  12 

Associated 

1 

2 

3 

4 

5 

6 

5 

4 

3  2  1 

probabilities 

36 

36 

36 

36 

36 

36 

36 

36 

36  36  36 

(d) 

What  is  the  most  probable 

sum 

in 

a  toss  of  two  dice? 

Although  we  can 

answer  this  from  the  sample  space  (2.4)  (Try  it!),  it  is  easier  from  (2.5).  We  see 
that  the  sum  7  has  the  largest  probability,  namely  ^  =  g- 

(e)  What  is  the  probability  that  the  sum  on  the  dice  is  greater  than  or  equal 
to  9?  Using  (2.5),  we  add  the  probabilities  associated  with  the  sums  9,  10,  11, 
and  12.  Thus  the  desired  probability  is 

_4_  A  A  A-A-A 

36  +  36  +  36  +  36~36~18' 

So  far  we  have  been  talking  as  if  it  were  perfectly  obvious  and  unquestionable 
that  heads  and  tails  are  equally  likely  in  the  toss  of  a  coin.  If  you  have  felt  skeptical 
about  this,  you  are  perfectly  right.  It  is  not  obvious;  it  is  not  even  necessarily 
true,  as  a  bent  or  weighted  coin  would  show.  We  must  distinguish  here  between 
the  mathematical  theory  of  probability  and  its  application  to  a  problem  about 
the  physical  world.  Mathematical  probability  (like  all  of  mathematics)  starts  with 
a  set  of  assumptions  and  shows  that  if  the  assumptions  are  true,  then  various 
results  follow.  The  basic  assumptions  in  a  mathematical  probability  problem  are 
the  probabilities  associated  with  the  points  of  the  sample  space.  Thus  in  a  coin 
tossing  problem,  we  assume  that  for  each  toss  the  probability  of  heads  and  the 
probability  of  tails  are  both  1,  and  then  we  show  that  the  probability  of  both  heads 
in  two  tosses  is  (See  Section  3.)  The  question  of  whether  the  assumptions  are 
correct  is  not  a  mathematical  one.  Here  we  must  ask  what  physical  problem  we 
are  trying  to  solve.  If  we  are  dealing  with  a  weighted  coin,  and  if  we  know  or 
can  somehow  estimate  experimentally  the  probability  p  of  heads  (and  so  1  —  p  of 
tails) ,  then  the  mathematical  theory  starts  with  these  values  instead  of  \ \ .  In  the 
absence  of  any  information  as  to  whether  heads  or  tails  is  more  likely,  we  often  make 
the  “natural”  or  “intuitive”  assumption  that  the  probabilities  are  both  i .  The  only 
possible  answer  to  the  question  of  whether  this  is  correct  or  not  lies  in  experiment. 
If  the  results  predicted  on  the  basis  of  our  assumptions  agree  with  experiment, 
then  the  assumptions  are  good;  otherwise  we  must  revise  the  assumptions.  (See 
Section  4,  Example  5.) 

In  this  chapter  we  shall  consider  mainly  the  mathematical  methods  of  calcu¬ 
lating  the  probabilities  of  complicated  happenings  if  we  are  given  the  probabilities 
associated  with  the  points  of  the  sample  space.  For  simplicity,  we  shall  often  assume 
these  probabilities  to  be  the  “natural”  ones;  the  mathematical  theory  we  develop 
applies,  however,  if  we  replace  these  “natural”  probabilities  (|,  ^  in  the  coin  toss 
problem,  etc.)  by  any  set  of  non-negative  fractions  whose  sum  is  1. 

PROBLEMS,  SECTION  2 

1  to  10.  Set  up  an  appropriate  sample  space  for  each  of  Problems  1.1  to  1.10  and  use  it 
to  solve  the  problem.  Use  either  a  uniform  or  nonuniform  sample  space  or  try  both. 

11.  Set  up  several  nonuniform  sample  spaces  for  the  problem  of  three  tosses  of  a  coin 
(Example  1,  above). 
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12.  Use  the  sample  space  of  Example  1  above,  or  one  or  more  of  your  sample  spaces  in 
Problem  11,  to  answer  the  following  questions. 

(a)  If  there  were  more  heads  than  tails,  what  is  the  probability  of  one  tail? 

(b)  If  two  heads  did  not  appear  in  succession,  what  is  the  probability  of  all  tails? 

(c)  If  the  coins  did  not  all  fall  alike,  what  is  the  probability  that  two  in  succession 
were  alike? 

(d)  If  Nt  =  number  of  tails  and  Nh  =  number  of  heads,  what  is  the  probability 
that  |  Nh  -  Nt  |  =  1? 

(e)  If  there  was  at  least  one  head,  what  is  the  probability  of  exactly  two  heads? 

13.  A  student  claims  in  Problem  1.5  that  if  one  child  is  a  girl,  the  probability  that 
both  are  girls  is  i- .  Use  appropriate  sample  spaces  to  show  what  is  wrong  with 
the  following  argument:  It  doesn’t  matter  whether  the  girl  is  the  older  child  or  the 
younger;  in  either  case  the  probability  is  i  that  the  other  child  is  a  girl. 

14.  Two  dice  are  thrown.  Use  the  sample  space  (2.4)  to  answer  the  following  questions. 

(a)  What  is  the  probability  of  being  able  to  form  a  two-digit  number  greater  than 
33  with  the  two  numbers  on  the  dice?  (Note  that  the  sample  point  1,  4  yields 
the  two-digit  number  41  which  is  greater  than  33,  etc.) 

(b)  Repeat  part  (a)  for  the  probability  of  being  able  to  form  a  two-digit  number 
greater  than  or  equal  to  42. 

(c)  Can  you  find  a  two-digit  number  (or  numbers)  such  that  the  probability  of 
being  able  to  form  a  larger  number  is  the  same  as  the  probability  of  being  able 
to  form  a  smaller  number?  [See  note,  part  (a).] 

15.  Use  both  the  sample  space  (2.4)  and  the  sample  space  (2.5)  to  answer  the  following 
questions  about  a  toss  of  two  dice. 

(a)  What  is  the  probability  that  the  sum  is  >  4? 

(b)  What  is  the  probability  that  the  sum  is  even? 

(c)  What  is  the  probability  that  the  sum  is  divisible  by  3? 

(d)  If  the  sum  is  odd,  what  is  the  probability  that  it  is  equal  to  7? 

(e)  What  is  the  probability  that  the  product  of  the  numbers  on  the  two  dice  is  12? 

16.  Given  an  nonuniform  sample  space  and  the  probabilities  associated  with  the  points, 
we  defined  the  probability  of  an  event  A  as  the  sum  of  the  probabilities  associated 
with  the  sample  points  favorable  to  A.  [You  used  this  definition  in  Problem  15  with 
the  sample  space  (2.5).]  Show  that  this  definition  is  consistent  with  the  definition 
by  equally  likely  cases  if  there  is  also  a  uniform  sample  space  for  the  problem  (as 
there  was  in  Problem  15).  Hint:  Let  the  uniform  sample  space  have  N  points  each 
with  the  probability  AY1.  Let  the  nonuniform  sample  space  have  n  <  N  points, 
the  first  point  corresponding  to  N\  points  of  the  uniform  space,  the  second  to  N2 
points,  etc.  What  is 

Nt  +  N2  +  ■  ■  ■  +  Nn? 

What  are  pi,p2, ,  the  probabilities  associated  with  the  first,  second,  etc.,  points 
of  the  nonuniform  space?  What  is  pi  +  p2  +  ■  ■  ■  +  Pn^  Now  consider  an  event  for 
which  several  points,  say  i,j,  k,  of  the  nonuniform  sample  space  are  favorable.  Then 
using  the  nonuniform  sample  space,  we  have,  by  definition  of  the  probability  p  of 
the  event,  p  =  pt  +pj  +  pt ■  Write  this  in  terms  of  the  N’s  and  show  that  the  result 
is  the  same  as  that  obtained  by  equally  likely  cases  using  the  uniform  space.  Refer 
to  Problem  15  as  a  specific  example  if  you  need  to. 
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17.  Two  dice  are  thrown.  Given  the  information  that  the  number  on  the  first  die  is 
even,  and  the  number  on  the  second  is  <  4,  set  up  an  appropriate  sample  space  and 
answer  the  following  questions. 

(a)  What  are  the  possible  sums  and  their  probabilities? 

(b)  What  is  the  most  probable  sum? 

(c)  What  is  the  probability  that  the  sum  is  even? 

18.  Are  the  following  correct  nonuniform  sample  spaces  for  a  throw  of  two  dice?  If 
so,  find  the  probabilities  of  the  given  sample  points.  If  not  show  what  is  wrong. 
Suggestion:  Copy  sample  space  (2.4)  and  circle  on  it  the  regions  corresponding  to 
the  points  of  the  proposed  nonuniform  spaces. 


(a) 

First  die  shows  an  even  number. 

First  die  shows  an  odd  number. 

(b) 

Sum  of  two  numbers  on  dice  is  even. 

First  die  is  even  and  second  odd. 

First  die  is  odd  and  second  even. 

(c) 

First  die  shows  a  number  <  3. 

At  least  one  die  shows  a  number  >  3. 

19.  Consider  the  set  of  all  permutations  of  the  numbers  1,  2,  3.  If  you  select  a 
tion  at  random,  what  is  the  probability  that  the  number  2  is  in  the  middle 

permuta- 

position? 

In  the  first  position?  Do  your  answers  suggest  a  simple  way  of  answering  the  same 
questions  for  the  set  of  all  permutations  of  the  numbers  1  to  7? 

3.  PROBABILITY  THEOREMS 

It  is  not  always  easy  to  make  direct  use  of  our  definitions  to  calculate  probabilities. 
Definition  (1.2)  asks  us  to  find  a  uniform  sample  space  for  a  problem,  that  is,  a 
set  of  all  possible  equally  likely ,  mutually  exclusive  outcomes  of  an  experiment,  and 
then  determine  how  many  of  these  are  favorable  to  a  given  event.  The  definition  in 
Section  2  similarly  requires  a  sample  space,  that  is,  a  list  of  the  possible  outcomes 
and  their  probabilities.  Such  lists  may  be  prohibitively  long;  we  want  to  consider 
some  theorems  which  will  shorten  our  work. 

Suppose  there  are  5  black  balls  and  10  white  balls  in  a  box;  we  draw  one  ball 
“at  random”  (this  means  we  are  assuming  that  each  ball  has  probability  yt  of  being 
drawn),  and  then  without  replacing  the  first  ball,  we  draw  another.  Let  us  ask 
for  the  probability  that  the  first  ball  is  white  and  the  second  one  is  black.  The 
probability  of  drawing  a  white  ball  the  first  time  is  y|  (10  of  the  15  balls  are  white). 
The  probability  of  then  drawing  a  black  ball  is  since  there  are  14  balls  left  and 
5  of  them  are  black.  We  are  going  to  show  that  the  probability  of  drawing  first  a 
white  ball  and  then  (without  replacement)  a  black  is  the  product  We  reason 

in  the  following  way,  using  a  uniform  sample  space.  Imagine  that  the  balls  are 
numbered  1  to  15.  The  symbol  5,3  will  mean  that  ball  5  was  drawn  the  first  time 
and  ball  3  the  second  time.  In  such  pairs  of  two  (different)  numbers  representing 
a  drawing  of  two  balls  in  succession,  there  are  15  choices  for  the  first  number  and 
14  for  the  second  (the  first  ball  was  not  replaced).  Thus  the  uniform  sample  space 
representing  all  possible  drawings  consists  of  a  rectangular  array  of  symbols  (like 
5,3)  with  15  columns  (for  the  15  different  choices  for  the  first  number)  and  14  rows 
(for  the  14  choices  for  the  second  number).  Thus  there  are  15  •  14  points  in  the 
sample  space.  [See  also  (4.1)].  How  many  of  these  sample  points  correspond  to 
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drawing  first  a  white  ball  and  then  a  black  ball?  Ten  numbers  correspond  to  white 
balls  and  the  other  five  to  black  balls.  Thus  to  obtain  a  sample  point  corresponding 
to  drawing  first  a  white  and  then  a  black  ball,  we  can  choose  the  first  number  in 
10  ways  and  then  the  second  number  in  5  ways,  and  so  choose  the  sample  point  in 
10  •  5  ways;  that  is,  there  are  10  •  5  sample  points  favorable  to  the  desired  drawing. 
Then  by  the  definition  (1.2),  the  desired  probability  is  (10  •  5) / (15  •  14)  as  claimed. 

Let  us  state  in  general  the  theorem  we  have  just  illustrated.  We  are  interested 
in  two  successive  events  A  and  B.  Let  P(A)  be  the  probability  that  A  will  happen, 
P(AB)  be  the  probability  that  both  A  and  B  will  happen,  and  Pa{B)  be  the 
probability  that  B  will  happen  if  know  that  A  has  happened.  Then 


(3.1) 


P{AB)  =  P(A)  ■  Pa{B) 


or  in  words,  the  probability  of  the  compound  event  “ A  and  Bv  is  the  product  of 
the  probability  that  A  will  happen  times  the  probability  that  B  will  happen  if  A 
does.  Using  the  idea  of  a  uniform  sample  space,  we  can  prove  (3.1)  by  following 
the  method  in  the  ball  drawing  problem.  Let  N  be  the  total  number  of  sample 
points  in  a  uniform  sample  space,  N(A)  and  N(B)  be  the  numbers  of  sample  points 
corresponding  to  the  events  A  and  B  respectively,  and  N(AB)  be  the  number 
of  sample  points  corresponding  to  the  compound  event  A  and  B.  It  is  useful  to 
picture  the  sample  space  geometrically  (Figure  3.1)  as  an  array  of  N  points  [compare 
with  sample  space  (2.4)].  We  can  then  circle  all  points  which  correspond  to  T’s 
happening  and  mark  this  region  A;  it  contains  N(A)  points.  Similarly,  we  can  circle 
the  N(B)  points  which  correspond  to  B’s  happening  and  call  this  region  B.  The 
overlapping  region  we  call  AB\  it  is  part  of 
both  A  and  B  and  contains  N(AB)  points 
which  correspond  to  the  compound  event 
A  and  B.  Then  by  the  definition  (1.2): 


(3.2) 


P{AB)  = 
P(A)  = 
Pa(B)  = 


N(AB) 
N  : 
N(A) 

N  ’ 
N(AB) 


Figure  3.1 


Perhaps  this  last  formula  for  Pa{B )  needs  some  discussion.  Recall  from  Sec¬ 
tion  2,  Example  1,  the  uniform  sample  space  (2.3)  for  three  tosses  of  a  coin.  To 
find  the  probability  of  all  tails  given  that  there  was  at  least  one  tail,  we  reduced 
our  sample  space  to  seven  points  (eliminating  hhh).  We  then  assumed  that  the 
seven  points  of  the  new  sample  space  had  the  same  relative  probability  as  before 
the  deletion  of  the  point  hhh',  thus  each  of  the  seven  points  had  probability  l. 
(This  is  no  more  and  no  less  “obvious”  than  the  original  assumption  that  the  eight 
points  had  equal  probability;  it  is  an  additional  assumption  which  we  make  in  the 
absence  of  any  information  to  the  contrary;  see  end  of  Section  2.)  Now  let  us  look 
at  the  third  equation  of  (3.2).  N(A)  is  the  number  of  sample  points  corresponding 
to  event  A;  the  N  points  in  the  original  sample  apace  all  had  the  same  probability 
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so  we  now  assume  that  when  we  cross  off  all  the  points  corresponding  to  A’s  not 
happening,  the  remaining  N(A)  points  also  have  equal  probability.  Thus  we  have  a 
new  uniform  sample  space  consisting  of  N(A)  points.  N(AB)  of  these  N(A)  points 
correspond  to  the  event  B  (assuming  A).  Thus  by  (1.2),  the  probability  of  “ B  if  A ” 
is  N(AB)/N(A).  From  the  three  equations  (3.2),  we  then  have  (3.1).  In  a  similar 
way  we  can  show  that 

(3.3)  P{BA)  =  P{B)  ■  PB{A)  =  P(AB) 

(see  Problem  1).  [We  have  proved  (3.1)  assuming  a  uniform  sample  space.  This 
assumption  is  not  necessary;  (3.1)  is  true  whether  or  not  we  can  construct  a  uniform 
sample  space;  see  Problem  2.] 

Suppose,  now,  in  our  example  of  5  black  and  10  white  balls  in  a  box,  we  draw  a 
ball  and  replace  it  and  then  draw  a  second  ball.  The  probability  of  a  black  ball  on 
the  second  drawing  is  then  ^  this  is  exactly  the  same  result  we  would  get  if 
we  had  not  drawn  and  replaced  the  first  ball.  In  the  notation  of  the  last  paragraph 

(3.4)  P(B)  =  Pa{B),  A  and  B  independent. 

When  (3.4)  is  true,  we  say  that  the  event  B  is  independent  of  event  A  and  (3.1) 
becomes 


(3.5)  P(AB)  =  P(A)  ■  P{B),  A  and  B  independent. 


Because  of  the  symmetry  of  (3.5),  we  may  simply  say  that  A  and  B  are  independent 
if  (3.5)  is  true.  (Also  see  Problem  7.) 

Example  1.  (a)  In  three  tosses  of  a  coin,  what  is  the  probability  that  all  three  are  heads? 

We  found  p  =  |  for  this  problem  in  Section  2  by  seeing  that  one  sample  point  out  of 
eight  corresponds  to  all  heads.  Now  we  can  do  the  problem  more  simply  by  saying 
that  the  probability  of  heads  on  each  toss  is  the  tosses  are  independent,  and 
therefore 

_  1  1  1  _  1 
P~  2  '  2  '  2  ~  8' 

(b)  If  we  should  want  the  probability  of  all  heads  when  a  coin  is  tossed  ten  times, 
the  sample  space  would  be  unwieldy;  instead  of  using  the  sample  space,  we  can  say 
that  since  the  tosses  are  independent,  the  desired  probability  is  p  =  (^)10. 

(c)  To  find  the  probability  of  at  least  one  tail  in  ten  tosses,  we  see  that  this  event 
corresponds  to  all  the  rest  of  the  sample  space  except  the  “all  heads”  point.  Since 
the  sum  of  the  probabilities  of  all  the  sample  points  is  1,  the  desired  probability  is 

1-(I)10. 

In  Figure  3.1  or  Figure  3.2  the  region  AB  corresponds  to  the  happening  of  both 
A  and  B.  The  whole  region  consisting  of  points  in  A  or  B  or  both  corresponds  to 
the  happening  of  either  A  or  B  or  both.  We  write  P(AB)  for  the  probability  that 
both  A  and  B  occur.  We  shall  write  P(A  +  B)  for  the  probability  that  either  or 
both  occur.  Then  we  can  prove  that 
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Figure  3.2  Figure  3.3 


(3.6)  P{A  +  B)  =  P{A)  +  P{B)  -P(AB). 


To  see  why  this  is  true,  consider  Figure  3.2.  To  find  P(A  +  B)  we  add  the  probabil¬ 
ities  of  all  the  sample  points  in  the  region  consisting  of  A  or  B  or  both.  But  if  we 
add  P(A)  and  P(B ),  we  have  included  the  probabilities  of  all  the  sample  points  in 
AB  twice  [once  in  P(A)  and  once  in  P(B)].  Thus  we  must  subtract  P(AB),  which 
is  the  sum  of  the  probabilities  of  all  the  sample  points  in  AB.  This  is  just  what 

(3.6)  says. 

If  the  sample  space  diagram  is  like  the  one  in  Figure  3.3,  so  that  P(AB)  =  0, 
we  say  that  A  and  B  are  mutually  exclusive.  Then  (3.6)  becomes 


(3.7)  P(A  +  B)  =  P{A)  +  P(B),  A  and  B  mutually  exclusive. 


Example  2.  Two  students  are  working  separately  on  the  same  problem.  If  the  first  student 
has  probability  \  of  solving  it  and  the  second  student  has  probability  |  of  solving 
it,  what  is  the  probability  that  at  least  one  of  them  solves  it? 

Let  A  be  the  event  “first  student  succeeds,”  and  B  be  the  event  “second  student 
succeeds.”  Then  P(AB)  =  \  •  |  =  |  (assume  A  and  B  independent  since  the 
students  work  separately).  Then  by  (3.6)  the  probability  that  one  or  the  other  or 
both  students  solve  the  problem  is 


P{A  +  B) 


7 

8' 


Conditional  Probability;  Bayes’  Formula  If  we  are  asked  for  the  probability 
of  event  B  assuming  that  event  A  occurs  [that  is,  Pa{B)\ ,  it  is  often  useful  to  find 
it  from  (3.1): 


(3.8) 


Pa(B)  = 


P(AB) 

W' 


Equation  (3.8)  is  called  Bayes’  formula.  In  any  conditional  probability  problem  to 
which  the  answer  is  not  immediately  obvious,  you  should  consider  whether  you 
can  easily  find  P(A)  and  P(AB );  if  so,  the  conditional  probability  Pa(B )  is  given 
by  (3.8). 


Section  3 


Probability  Theorems  733 


Example  3.  A  preliminary  test  is  customarily  given  to  the  students  at  the  beginning  of  a 
certain  course.  The  following  data  are  accumulated  after  several  years: 

(a)  95%  of  the  students  pass  the  course,  5%  fail. 

(b)  96%  of  the  students  who  pass  the  course  also  passed  the  preliminary  test. 

(c)  25%  of  the  students  who  fail  the  course  passed  the  preliminary  test. 


What  is  the  probability  that  a  student  who  has  failed  the  preliminary  test  will  pass 
the  course? 


95% 

pass 

course 


5%  fail 


Let  A  be  the  event  “fails  preliminary  test”  and  B  be  the  event  “Passes  course.” 
The  probability  we  want  is  then  Pa{B)  in  (3.8),  so  we  need  P(AB)  and  P(A). 
P(AB)  is  the  probability  that  the  student  both  fails  the  preliminary  test  and  passes 
the  course;  this  is  P(AB)  =  (0.95) (0.04)  =  0.038.  (See  Figure  3.4;  95%  of  the 
students  passed  the  course  and  of  these  4%  had  failed  the  preliminary  test.)  We 
also  want  P(A),  the  probability  that  a  students  fails  the  preliminary  test;  this 
event  corresponds  to  the  shaded  area  in  Figure  3.4.  Thus  P(A)  is  the  sum  of  the 
probabilities  of  the  two  events  “passes  course  after  failing  test,”  “fails  course  after 
failing  test.”  Then 


P(A)  =  (0.095)(0.04)  +  (0.05)(0.75)  =  0.0755 


(See  Figure  3.4;  of  the  95%  of  students  who  passed  the  course,  4%  failed  the  prelimi¬ 
nary  test;  of  the  5%  of  the  students  who  failed  the  course,  75%  failed  the  preliminary 
test  since  we  are  given  that  25%  passed.)  By  (3.8)  we  have 


Pa(B) 


P(AB) 

w 


0.038 

0.0755 


50%, 


that  is,  half  of  the  students  who  fail  the  preliminary  test  succeed  in  passing  the 
course. 

Note  that  in  Figure  3.4,  the  shaded  area  corresponds  to  event  A  (fails  preliminary 
test).  We  are  interested  in  event  B  (passes  course)  given  event  A.  Thus  instead 
of  the  original  sample  space  (whole  rectangle  in  Figure  3.4)  we  consider  a  smaller 
sample  space  (shaded  area  in  Figure  3.4).  We  then  want  to  know  what  part  of  this 
sample  space  corresponds  to  event  B  (passes  course).  This  fraction  is  P(AB)/P(A) 
which  we  computed. 
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PROBLEMS,  SECTION  3 

1.  (a)  Set  up  a  sample  space  for  the  5  black  and  10  white  balls  in  a  box  discussed 

above  assuming  the  first  ball  is  not  replaced.  Suggestions:  Number  the  balls, 
say  1  to  5  for  black  and  6  to  15  for  white.  Then  the  sample  points  form  an 
array  something  like  (2.4),  but  the  point  3,3  for  example  is  not  allowed.  (Why? 
What  other  points  are  not  allowed?)  You  might  find  it  helpful  to  write  the 
numbers  for  black  balls  and  the  numbers  for  white  balls  in  different  colors. 

(b)  Let  A  be  the  event  “first  ball  is  white”  and  B  be  the  event  “second  ball  is 
black.”  Circle  the  region  of  your  sample  space  containing  points  favorable  to 
A  and  mark  this  region  A.  Similarly,  circle  and  mark  region  B.  Count  the 
number  of  sample  points  in  A  and  in  B\  these  are  N(A)  and  N(B).  The  region 
AB  is  the  region  inside  both  A  and  B;  the  number  of  points  in  this  region  is 
N(AB).  Use  the  numbers  you  have  found  to  verify  (3.2)  and  (3.1).  Also  find 
P(B)  and  Pb(A)  and  verify  (3.3)  numerically. 

(c)  Use  Figure  3.1  and  the  ideas  of  part  (b)  to  prove  (3.3)  in  general. 

2.  Prove  (3.1)  for  a  nonuniform  sample  space.  Hints:  Remember  that  the  probability 
of  an  event  is  the  sum  of  the  probabilities  of  the  sample  points  favorable  to  it.  Using 
Figure  3.1,  let  the  points  in  A  but  not  in  AB  have  probabilities  pi,P2,  ■  ■  ■  ,Pn,  the 
points  in  AB  have  probabilities  Pn+i,Pn+2,  ■  ■  ■  ,pn+k,  and  the  points  in  B  but  not 
in  AB  have  probabilities  pn+k+i,Pn+k+2,  ■  ■  ■  ,Pn+k+i ■  Find  each  of  the  probabilities 
in  (3.1)  in  terms  of  the  p’s  and  show  that  you  then  have  an  identity. 

3.  What  is  the  probability  of  getting  the  sequence  hhhttt  in  six  tosses  of  a  coin?  If  you 
know  the  first  three  are  heads,  what  is  the  probability  that  the  last  three  are  tails? 

4.  (a)  A  weighted  coin  has  probability  of  |  of  showing  heads  and  4  of  showing  tails. 

Find  the  probabilities  of  hh,  ht,  th  and  tt  in  two  tosses  of  the  coin.  Set  up 
the  sample  space  and  the  associated  probabilities.  Do  the  probabilities  add  to 
1  as  they  should?  What  is  the  probability  of  at  least  one  head?  What  is  the 
probability  of  two  heads  if  you  know  there  was  at  least  one  head? 

(b)  For  the  coin  in  (a),  set  up  the  sample  space  for  three  tosses,  find  the  associated 
probabilities,  and  use  it  to  answer  the  questions  in  Problem  2.12. 

5.  What  is  the  probability  that  a  number  n,  1  <  n  <  99,  is  divisible  by  both  6  and  10? 
By  either  6  or  10  or  both? 

6.  A  card  is  selected  from  a  shuffled  deck.  What  is  the  probability  that  it  is  either  a 
king  or  a  club?  That  it  is  both  a  king  and  a  club? 

7.  (a)  Note  that  (3.4)  assumes  P(A)  ^  0  since  Pa(B)  is  meaningless  if  P(A)  =  0. 

Assuming  both  P(A)  ^  0  and  P(B)  ^  0,  show  that  if  (3.4)  is  true,  then 
P(A)  —  Pb(A)-,  that  is  if  B  is  independent  of  A,  then  A  is  independent  of  B. 
If  either  P(A)  or  P(B)  is  zero,  then  we  use  (3.5)  to  define  independence. 

(b)  When  is  an  event  E  independent  of  itself?  When  is  E  independent  of  “not  E ”  ? 

8.  Show  that 

P(A  +  B  +  C)  =  P{A)  +  P(B)  +  P(C)  -  P(AB )  -  P{AC)  -  P(BC)  +  P(ABC). 

Hint:  Start  with  Figure  3.2  and  sketch  in  a  region  C  overlapping  some  of  the  points 
of  each  of  the  regions  A,  B,  and  AB. 

9.  Two  cards  are  drawn  at  random  from  a  shuffled  deck  and  laid  aside  without  being 
examined.  Then  a  third  card  is  drawn.  Show  that  the  probability  that  the  third 
card  is  a  spade  is  4  just  as  it  was  for  the  first  card.  Hint:  Consider  all  the  (mutually 
exclusive)  possibilities  (two  discarded  cards  spades,  third  card  spade  or  not  spade, 
etc.). 
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10.  (a)  Three  typed  letters  and  their  envelopes  are  piled  on  a  desk.  If  someone  puts  the 

letters  into  the  envelopes  at  random  (one  letter  in  each),  what  is  the  probability 
that  each  letter  gets  into  its  own  envelope?  Call  the  envelopes  A,  B,  C,  and  the 
corresponding  letters  a,  b,  c,  and  set  up  the  sample  space.  Note  that  “a  in  C, 
b  in  B,  c  in  A”  is  one  point  in  the  sample  space. 

(b)  What  is  the  probability  that  at  least  one  letter  gets  into  its  own  envelope? 
Hint:  What  is  the  probability  that  no  letter  gets  into  its  own  envelope? 

(c)  Let  A  mean  that  a  got  into  envelope  A,  and  so  on.  Find  the  probability  P(A) 
that  a  got  into  A.  Find  P(B)  and  P(C).  Find  the  probability  P(A  +  B) 
that  either  a  or  b  or  both  got  into  their  correct  envelopes,  and  the  probability 
P(AB)  that  both  got  into  their  correct  envelopes.  Verify  equation  (3.6). 


11.  In  paying  a  bill  by  mail,  you  want  to  put  your  check  and  the  bill  (with  a  return 
address  printed  on  it)  into  a  window  envelope  so  that  the  address  shows  right  side 
up  and  is  not  blocked  by  the  check.  If  you  put  check  and  bill  at  random  into  the 
envelope,  what  is  the  probability  that  the  address  shows  correctly? 

12.  (a)  A  loaded  die  has  probabilities  A-,  4L,  ^y,of  showing  1,  2,  3,  4,  5,  6. 

What  is  the  probability  of  throwing  two  3’s  in  succession? 

(b)  What  is  the  probability  of  throwing  a  4  the  first  time  and  not  a  4  the  second 
time  with  a  die  loaded  as  in  (a)? 

(c)  If  two  dice  loaded  as  in  (a)  are  thrown,  and  we  know  that  the  sum  of  the 
numbers  on  the  faces  is  greater  than  or  equal  to  10,  what  is  the  probability 
that  both  are  5’s? 

(d)  How  many  times  must  we  throw  a  die  loaded  as  in  (a)  to  have  probability 
greater  than  |  of  getting  an  ace? 

(e)  A  die,  loaded  as  in  (a),  is  thrown  twice.  What  is  the  probability  that  the 
number  on  the  die  is  even  the  first  time  >  4  the  second  time? 


13.  (a)  A  candy  vending  machine  is  out  of  order.  The  probability  that  you  get  a  candy 

bar  (with  or  without  return  of  your  money)  is  5,  the  probability  that  you  get 
your  money  back  (with  or  without  candy)  is  i,  and  the  probability  that  you 
get  both  the  candy  and  your  money  back  is  -L  •  What  is  the  probability  that  you 
get  nothing  at  all?  Suggestion:  Sketch  a  geometric  diagram  similar  to  Figure 
3.1,  indicate  regions  representing  the  various  possibilities  and  their  probabili¬ 
ties;  then  set  up  a  four-point  sample  space  and  the  associated  probabilities  of 
the  points. 

(b)  Suppose  you  try  again  to  get  a  candy  bar  as  in  part  (a).  Set  up  the  16-point 
sample  space  corresponding  to  the  possible  results  of  your  two  attempts  to 
buy  a  candy  bar,  and  find  the  probability  that  you  get  two  candy  bars  (and 
no  money  back);  that  you  get  no  candy  and  lose  your  money  both  times;  that 
you  just  get  your  money  back  both  times. 


14.  A  basketball  player  succeeds  in  making  a  basket  3  tries  out  of  4.  How  many  tries 
are  necessary  in  order  to  have  probability  >  0.99  of  at  least  one  basket? 

15.  Use  Bayes’  formula  (3.8)  to  repeat  these  simple  problems  previously  done  by  using 
a  reduced  sample  space. 

(a)  In  a  family  of  two  children,  what  is  the  probability  that  both  are  girls  if  at 
least  one  is  a  girl? 

(b)  What  is  the  probability  of  all  heads  in  three  tosses  of  a  coin  if  you  know  that 
at  least  one  is  a  head? 


736  Probability  and  Statistics 


Chapter  15 


16.  Suppose  you  have  3  nickels  and  4  dimes  in  your  right  pocket  and  2  nickels  and  a 
quarter  in  your  left  pocket.  You  pick  a  pocket  at  random  and  from  it  select  a  coin 
at  random.  If  it  is  a  nickel,  what  is  the  probability  that  it  came  from  your  right 
pocket? 

17.  (a)  There  are  3  red  and  5  black  balls  in  one  box  and  6  red  and  4  white  balls  in 

another.  If  you  pick  a  box  at  random,  and  then  pick  a  ball  from  it  at  random, 
what  is  the  probability  that  it  is  red?  Black?  White?  That  it  is  either  red  or 
white? 

(b)  Suppose  the  first  ball  selected  is  red  and  is  not  replaced  before  a  second  ball 
is  drawn.  What  is  the  probability  that  the  second  ball  is  red  also? 

(c)  If  both  balls  are  red,  what  is  the  probability  that  they  both  came  from  the 
same  box? 

18.  Two  cards  are  drawn  at  random  from  a  shuffled  deck. 

(a)  What  is  the  probability  that  at  least  one  is  a  heart? 

(b)  If  you  know  that  at  least  one  is  a  heart,  what  is  the  probability  that  both  are 
hearts? 

19.  Suppose  it  is  known  that  1%  of  the  population  have  a  certain  kind  of  cancer.  It  is 
also  known  that  a  test  for  this  kind  of  cancer  is  positive  in  99%  of  the  people  who 
have  it  but  is  also  positive  in  2%  of  the  people  who  do  not  have  it.  What  is  the 
probability  that  a  person  who  tests  positive  has  cancer  of  this  type? 

20.  Some  transistors  of  two  different  kinds  (call  them  N  and  P)  are  stored  in  two  boxes. 
You  know  that  there  are  6  TV’s  in  one  box  and  that  2  N’s  and  3  P’s  got  mixed  in 
the  other  box,  but  you  don’t  know  which  box  is  which.  You  select  a  box  and  a 
transistor  from  it  at  random  and  find  that  it  is  an  IV;  what  is  the  probability  that 
it  came  from  the  box  with  the  6  N’s?  From  the  other  box?  If  another  transistor  is 
picked  from  the  same  box  as  the  first,  what  is  the  probability  that  it  is  also  an  N? 

21.  Two  people  are  taking  turns  tossing  a  pair  of  coins;  the  first  person  to  toss  two  alike 
wins.  What  are  the  probabilities  of  winning  for  the  first  player  and  for  the  second 
player?  Hint:  Although  there  are  an  infinite  number  of  possibilities  here  (win  on 
first  turn,  second  turn,  third  turn,  etc.),  the  sum  of  the  probabilities  is  a  geometric 
series  which  can  be  summed;  see  Chapter  1  if  necessary. 

22.  Repeat  Problem  21  if  the  players  toss  a  pair  of  dice  trying  to  get  a  double  (that  is, 
both  dice  showing  the  same  number). 

23.  A  thick  coin  has  probability  |  of  falling  heads,  |  of  falling  tails,  and  i  of  standing  on 
edge.  Show  that  if  it  is  tossed  repeatedly  it  has  probability  1  of  eventually  standing 
on  edge. 

4.  METHODS  OF  COUNTING 

Let  us  digress  for  a  bit  to  review  some  ideas  and  formulas  we  need  in  computing 
probabilities  in  more  complicated  problems. 

Let  us  ask  how  many  two-digit  numbers  have  either  5  or  7  for  the  tens  digit  and 
either  3,  4,  or  6  for  the  units  digit.  The  answer  becomes  obvious  if  we  arrange  the 
possible  numbers  in  a  rectangle 


53  54  56 
73  74  76 
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with  two  rows  corresponding  to  the  two  choices  of  the  tens  digit  and  three  columns 
corresponding  to  the  three  choices  of  the  units  digit.  This  is  an  example  of  the 
fundamental  principle  of  counting: 


If  one  thing  can  be  done  N±  ways,  and  after  that  a  second  thing 
can  be  done  in  N2  ways,  the  two  things  can  be  done  in  succession 
in  that  order  in  Ni  ■  N2  ways.  This  can  be  extended  to  doing  any 
number  of  things  one  after  the  other,  the  first  Ni  ways,  the  second 
N2  ways,  the  third  IV3  ways,  etc.  Then  the  total  number  of  ways 
to  perform  the  succession  of  acts  is  the  product  N\N2N^  ■  ■  ■ . 


Now  consider  a  set  of  n  things  lined  up  in  a  row;  we  ask  how  many  ways  we 
can  arrange  (permute)  them.  This  result  is  called  the  number  of  permutations  of  n 
things  n  at  a  time,  and  is  denoted  by  nPn  or  P(n,  n )  or  P".  To  find  this  number, 
we  think  of  seating  n  people  in  a  row  of  n  chairs.  We  can  place  anyone  in  the 
first  chair,  that  is,  we  have  n  possible  ways  of  filling  the  first  chair.  Once  we  have 
selected  someone  for  the  first  chair,  there  are  (n  —  1)  choices  left  for  the  second 
chair,  then  ( n  —  2)  choices  for  the  third  chair,  and  so  on.  Thus  by  the  fundamental 
principle,  there  are  n(n  —  l)(n  —  2)  •  •  •  2  •  1  =  n!  ways  of  arranging  the  n  people  in 
the  row  of  n  chairs.  The  number  of  permutations  of  n  things  n  at  a  time  is 

(4.2)  P(n,  n )  =  n\. 

Next  suppose  there  are  n  people  but  only  r  <  n  chairs  and  we  ask  how  many 
ways  we  can  select  groups  of  r  people  and  seat  them  in  the  r  chairs.  The  result  is 
called  the  number  of  permutations  of  n  things  r  at  a  time  and  is  denoted  by  nPr 
or  P(n,  r)  or  P".  Arguing  as  before,  we  find  that  there  are  n  ways  to  fill  the  first 
chair,  ( n  —  1)  ways  to  fill  the  second  chair,  (n  —  2)  ways  for  the  third  [note  that  we 
could  write  [n  —  2)  as  [n  —  3  +  1)],  etc.,  and  finally  [n  —  r  +  1)  ways  of  filling  chair 
r.  Thus  we  have  for  the  number  of  permutations  of  n  things  r  at  a  time 

P(n,  r)  =  n(n  —  l)(n  —  2)  ■  •  ■  (n  —  r  +  1). 

By  multiplying  and  dividing  by  (n  —  r) !  we  can  write  this  as 


(4.3)  P(n ,  r)  =  n(n  —  l)(n  —  2)  •  •  •  (n  —  r  +  1) 


(n  —  r)!  n! 

(n  —  r)!  (n  —  r)\ 


So  far  we  have  been  talking  about  arranging  things  in  a  definite  order.  Suppose, 
instead  that  we  ask  how  many  committees  of  r  people  can  be  chosen  from  a  group  of 
n  people  (n  >  r).  Here  the  order  of  the  people  in  the  committee  is  not  considered; 
the  committee  made  up  of  people  A,  B ,  C,  is  the  same  as  the  committee  made  up 
of  people  B,  A,  C.  We  call  the  number  of  such  committees  of  r  people  which  we 
can  select  from  n  people,  the  number  of  combinations  or  selections  of  n  things  r 
at  a  time,  and  denote  this  number  by  nCr  or  C(n,r)  or  (”).  To  find  C(n,  r),  we 
go  back  to  the  problem  of  selecting  r  people  from  a  group  of  n  and  seating  them 
in  r  chairs;  we  found  that  the  number  of  ways  of  doing  this  is  P(n,  r)  as  given  in 
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(4.3) .  We  can  perform  this  job  by  first  selecting  r  people  from  the  total  n  and  then 
arranging  the  r  people  in  r  chairs.  The  selection  of  r  people  can  be  done  in  C(n ,  r) 
ways  (this  is  the  number  we  are  trying  to  find),  and  after  r  people  are  selected,  they 
can  be  arranged  in  r  chairs  in  P(r,r)  ways  by  (4.2).  By  the  fundamental  principle 
(4.1),  the  total  number  of  ways  P(n ,  r )  of  selecting  and  seating  r  people  out  of  n  is 
the  product  C(n,  r)  ■  P(r ,  r).  Thus  we  have 

(4.4)  P(n,r)  =  C(n,r)  ■  P(r,r). 

We  can  solve  this  equation  to  find  the  value  C(n,r )  which  we  wanted.  Substituting 
the  values  of  P(n,r)  and  P(r,r)  from  (4.3)  and  (4.2)  into  (4.4)  and  solving  for 
C(n,  r),  we  find  for  the  number  of  combinations  of  n  things  r  at  a  time 


(4.5) 


C(n,r) 


P(n,  r ) 
P(r,  r) 


n! 

(n  —  r)!r! 


Each  time  we  select  r  people  to  be  seated,  we  leave  n  —  r  people  without  chairs. 
Then  there  are  exactly  the  same  number  of  combinations  of  n  things  n  —  r  at  a  time 
as  there  are  combinations  of  n  things  r  at  a  time.  Hence  we  write 

77,1 

(4.6)  C(n,n-r)  =  C(n,r)  =  - - 

( n  —  r)!r! 

We  can  also  obtain  (4.6)  from  (4.5)  by  replacing  r  by  (n  —  r). 


Example  1.  A  club  consists  of  50  members.  In  how  many  ways  can  a  president,  vice- 
president,  secretary,  and  treasurer  be  chosen?  In  how  many  ways  can  a  committee 
of  4  members  be  chosen? 

In  the  selection  of  officers,  we  must  not  only  select  4  people,  but  decide  which  one 
is  president,  etc.;  we  could  think  of  seating  the  4  people  in  chairs  labeled  president, 
vice-president,  etc.  Thus  the  number  of  ways  of  selecting  the  officers  is 

501  5f)l 

r(50, 4)  —  =  4g|  =  5°  -  49  ■  48  ■  47. 


The  committee  members,  however,  are  all  equivalent  (we  are  neglecting  the  pos¬ 
sibility  that  one  is  named  chairman) ,  so  the  number  of  ways  of  selecting  committees 
of  4  people  is 


C(50, 4) 


50! 

46144 


50  •  49  •  48  •  47 
24 


Example  2.  Find  the  coefficient  of  x8  in  the  binomial  expansion  of  (1  +  x)15. 
Think  of  multiplying  out 


(1  +  x)(l  +  x)(l  +  x)  •  •  •  (1  +  x),  (with  15  factors). 

We  obtain  a  term  in  x8  each  time  we  multiply  l’s  from  seven  of  the  parentheses  by 
x’s  from  eight  of  the  parentheses.  The  number  of  ways  of  selecting  8  parentheses 
out  of  15  is 
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This  is  the  desired  coefficient  of  x8. 

Generalizing  this  example,  we  see  that  in  the  expansion  of  (a+b)n,  the  coefficient 
of  an~rbr  is  (7(n,  r),  usually  written  when  used  in  connection  with  a  binomial 
expansion  (see  Chapter  1,  Section  13C).  Thus  the  expressions  (7(n,r )  are  just  the 
binomial  coefficients,  and  we  can  write 

(4.7)  (a  +  b)n  =  J2  (n)an~rbr- 

r=0  ' r  ' 


Example  3.  A  basic  problem  in  statistical  mechanics  is  this:  Given  N  balls,  and  n  boxes, 
in  how  many  ways  can  the  balls  be  put  into  the  boxes  so  that  there  will  be  given 
numbers  of  balls  in  the  boxes,  say  N\  balls  in  the  first  box,  N%  balls  in  the  second 
box,  IV3  in  the  third,  •  •  • ,  Nn  in  the  nth,  and  what  is  the  probability  that  this 
given  distribution  will  occur  when  the  balls  are  put  into  the  boxes?  In  statistical 
mechanics  the  “balls”  may  be  molecules,  electrons,  photons,  etc.,  and  each  “box” 
corresponds  to  a  small  range  of  values  of  position  and  momentum  of  a  particle.  We 
can  state  many  other  problems  in  this  same  language  of  putting  balls  into  boxes. 
For  example,  in  tossing  a  coin,  we  can  equate  heads  with  box  1,  and  tails  with 
box  2;  in  tossing  a  die,  there  are  six  “boxes.”  In  putting  letters  into  envelopes,  the 
letters  are  the  balls,  and  the  envelopes  are  the  boxes.  In  dealing  cards,  the  cards 
are  the  balls  and  the  players  who  receive  them  are  the  boxes.  In  an  alpha  scattering 
experiment,  the  alpha  particles  are  the  balls,  and  the  boxes  are  elements  of  area 
on  the  detecting  screen  which  the  particles  hit  after  they  are  scattered.  (Also  see 
Problems  14  and  21  and  Feller,  pp.  10-11.) 

Let  us  do  a  special  case  of  this  problem  in  which  we  have  15  balls  and  6  boxes, 
and  the  numbers  of  balls  we  are  to  put  into  the  various  boxes  are: 

Number  of  balls:  3  1  4  2  3  2 

In  box  number:  1  2  3  4  5  6 

We  first  ask  how  many  ways  we  can  select  3  balls  to  go  in  the  first  box  from  the 
15  balls;  this  is  (7(15,3).  (Note  that  the  order  of  the  balls  in  the  boxes  is  not 
considered;  this  is  like  the  committee  problem  in  Example  1.)  Now  we  have  12  balls 
left,  of  which  we  are  to  select  1  for  box  2;  we  can  do  this  in  (7(12, 1)  ways.  We  can 
then  select  the  4  balls  for  box  3  from  the  remaining  11  balls  in  (7(11,4)  ways,  the 
2  balls  for  box  4  in  C( 7,  2)  ways,  the  3  balls  for  box  5  in  (7(5, 3)  ways,  and  finally  the 
balls  for  box  6  in  C( 2,  2)  ways  (verify  that  this  is  1).  By  the  fundamental  principle, 
the  total  number  of  ways  of  putting  the  required  numbers  of  balls  into  the  boxes  is 

(7(15,  3)  •  (7(12, 1)  •  (7(11, 4)  •  (7(7, 2)  •  (7(5, 3)  •  (7(2,  2) 

15!  12!  11!  7!  5!  2! 

~  3!  •  12!  '  1!  •  11!  '  4!  •  7!  '  2!  -5!  '  3!  •  2!  '  2!  •  0! 

15! 

““  3!  •  1!  •  4!  •  2!  •  3!  •  2! ' 

(Remember  from  Chapters  1  and  11  that  0!  =  1.) 

Next  we  want  the  probability  of  this  particular  distribution.  Let  us  assume  that 
the  balls  are  distributed  “at  random”  into  the  boxes;  by  this  we  mean  that  a  ball 
has  the  same  probability  (namely  ^ )  of  being  put  into  any  one  box  as  into  any  other 
box.  We  can  put  the  first  ball  into  any  one  of  the  6  boxes,  the  second  ball  into  any 


740  Probability  and  Statistics 


Chapter  15 


one  of  the  6  boxes,  and  so  on.  Thus  by  the  fundamental  principle,  the  total  number 
of  ways  of  distributing  the  15  balls  into  the  6  boxes  is6-6-6-6---6  =  615  and  we 
are  assuming  that  these  distributions  are  equally  probable.  Then  the  probability 
that,  when  15  balls  are  distributed  “at  random”  into  6  boxes,  there  will  be  3  balls 
in  box  1,  1  in  box  2,  etc.,  as  given,  is,  by  (1.2)  (favorable  cases  4-  total) 


3!  •  1!  •  4!  ■  2!  •  3!  -  2!  ' 

Example  4.  In  Example  3,  we  assumed  that  the  615  possible  distributions  of  15  balls  into 
6  boxes  were  equally  likely.  This  seems  very  reasonable  if  we  think  of  putting  the 
balls  into  the  boxes  by  tossing  a  die  for  each  ball;  if  the  die  shows  1  we  put  the  ball 
into  box  1,  etc.  However,  we  can  think  of  situations  to  which  this  method  and  result 
do  not  apply.  For  example,  suppose  we  are  putting  letters  into  envelopes  or  seating 
people  in  chairs;  then  we  may  reasonably  require  only  one  letter  per  envelope,  not 
more  than  one  person  per  chair,  that  is,  one  ball  (or  none)  per  box.  Consider  the 
problem  of  seating  4  people  in  6  chairs,  that  is  of  putting  4  balls  into  6  boxes.  If  we 
number  the  chairs  from  1  to  6  and  let  each  person  choose  a  chair  by  tossing  a  die, 
we  may  have  two  or  more  people  choosing  the  same  chair.  The  result  64  (which  the 
method  of  Example  3  gives  for  the  problem  of  4  balls  in  6  boxes)  then  does  not  apply 
to  this  problem.  However,  let  us  consider  the  uniform  sample  space  of  64  points  and 
select  from  it  the  points  corresponding  to  our  restriction  (one  ball  or  none  per  box) . 
The  new  sample  space  contains  C(6,4)  •  4!  points  (number  of  ways  of  selecting  the 
4  chairs  to  be  occupied  times  the  number  of  ways  of  then  arranging  4  people  in  4 
chairs).  Since  these  points  were  equally  probable  in  the  original  (uniform)  sample 
space,  we  still  consider  them  equally  probable.  Now  let  us  ask  for  the  probability 
that  the  first  two  chairs  are  vacant  when  the  4  people  are  seated.  The  number  of 
sample  points  corresponding  to  this  event  is  4!  (the  number  of  ways  of  arranging 
the  4  people  in  the  last  4  chairs).  Thus  the  desired  probability  is 

4!  1 

C(6, 4)  •  4!  "  C(6, 4) ' 

We  can  now  see  an  easier  way  of  doing  problems  of  this  kind.  The  factor  4!, 
which  canceled  in  the  probability  calculation,  was  the  number  of  rearrangements  of 
the  4  people  among  the  4  occupied  chairs.  Since  this  is  the  same  for  any  given  set 
of  4  chairs,  we  can  lump  together  all  the  sample  points  corresponding  to  each  given 
set  of  4  chairs,  and  have  a  smaller  (still  uniform)  sample  space  of  C(6,4)  points. 
Each  point  now  corresponds  to  a  given  set  of  4  occupied  chairs;  the  quantity  C( 6, 4) 
is  just  the  number  of  ways  of  picking  4  occupied  chairs  out  of  6.  The  probability 
that  the  first  two  chairs  are  vacant  when  4  people  are  seated  is  1/C(6, 4)  since  there 
is  only  one  way  to  select  4  occupied  chairs  leaving  the  first  two  chairs  vacant. 

Another  useful  way  of  looking  at  this  problem  is  to  consider  a  set  of  4  identical 
balls  to  be  put  into  6  boxes.  Since  the  balls  are  identical,  the  4!  arrangements 
of  the  4  balls  in  4  given  boxes  all  look  alike.  We  can  say  that  there  are  C(6,4) 
distinguishable  arrangements  of  the  4  identical  balls  in  6  boxes  (one  ball  or  none 
per  box).  Since  all  these  arrangements  are  equally  probable,  the  probability  of  any 
one  arrangement  (say  the  first  two  boxes  empty)  is  1/C(6,4)  as  we  found  previously. 
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Example  5.  In  Example  4  we  found  the  same  answer  for  the  probability  that  two  partic¬ 
ular  boxes  were  empty  whether  or  not  we  considered  the  balls  distinguishable.  This 
was  true  because  the  allowed  distinguishable  arrangements  were  equally  probable. 
Without  the  restriction  of  one  ball  or  none  per  box,  all  distinguishable  arrangements 
are  not  equally  probable  according  to  the  methods  of  Examples  3  and  4.  For  exam¬ 
ple,  the  probability  of  all  balls  in  box  1  is  1/64;  compare  this  with  the  probability 
of  no  balls  in  the  first  2  boxes  and  one  ball  in  each  of  the  other  4  boxes,  which  is 
4!  4-  64  =  We  see  that  the  concentrated  arrangements  (all  or  several  balls  in  one 
box)  are  less  probable  than  the  more  uniform  arrangements. 

Now  we  are  going  to  try  to  imagine  a  situation  in  which  all  distinguishable  ar¬ 
rangements  are  equally  probable.  Suppose  the  6  boxes  are  benches  in  a  waiting  room 
and  the  4  balls  are  people  who  are  going  to  come  in  and  sit  on  the  benches.  Then 
if  the  people  are  friends,  there  will  be  a  certain  tendency  for  them  to  sit  together 
and  the  probabilities  we  have  been  calculating  will  not  apply — the  probabilities  of 
the  concentrated  arrangements  will  increase.  Consider  the  following  mathematical 
model.  (This  is  a  modification  of  Polya’s  urn  model.)  We  have  6  boxes  labeled  1 
to  6,  and  4  balls.  From  6  cards  labeled  1  to  6  we  draw  one  at  random  and  place  a 
ball  in  the  box  numbered  the  same  as  the  card  drawn.  We  then  replace  the  card  and 
also  add  another  card  of  the  same  number  so  that  there  are  now  7  cards,  two  with 
the  number  first  drawn.  We  now  select  a  card  at  random  from  these  7,  put  a  ball 
in  the  corresponding  box  and  again  replace  the  card  adding  a  duplicate  to  make 
8  cards.  We  repeat  this  process  two  more  times  (until  all  balls  are  distributed). 
Then  the  probability  that  all  balls  are  in  box  1  is  ~  ■  |  ■  |  •  The  probability 
of  one  ball  in  each  of  the  first  4  boxes  is  i  •  j  ■  |  |  •  4!  (here  i  •  =  •  |  |  is  the 
probability  that  the  first  ball  is  in  box  1,  the  second  in  box  2,  etc.;  we  must  add  to 
this  the  probability  that  the  first  ball  is  in  box  3,  the  second  in  box  1,  etc.;  there 
are  4!  such  possibilities  all  giving  one  ball  in  each  of  the  first  4  boxes).  We  see  that 
the  distributions  “all  balls  in  box  1”  and  “one  ball  in  each  of  the  first  4  boxes”  are 
equally  probable.  Further  calculation  (Problem  20)  shows  that  all  distinguishable 
arrangements  are  equally  probable. 

To  find  the  number  of  distinguishable  arrangements,  consider  the  following  pic¬ 
ture  of  the  4  balls  in  the  6  boxes. 


Box  number: 
Number  of  balls: 


o  I 
1  2 

1  0 


o  o 
3 
2 


The  lines  mean  the  sides  of  the  boxes  and  the  circles  are  the  balls;  note  that  it 
requires  7  lines  to  picture  the  6  boxes.  This  picture  shows  one  of  many  possible 
arrangements  of  the  4  balls  in  6  boxes.  In  any  such  picture  there  must  be  a  line  at 
the  beginning  and  at  the  end,  but  the  rest  of  the  lines  (5  of  them)  and  the  4  circles 
can  be  arranged  in  any  order.  You  should  convince  yourself  that  every  arrangement 
of  the  balls  in  the  boxes  can  be  so  pictured.  Then  the  number  of  such  distinguishable 
arrangements  is  just  the  number  of  ways  we  can  select  4  positions  for  the  4  circles 
out  of  9  positions  for  the  5  lines  and  4  circles.  Thus  there  are  (7(9,4)  equally  likely 
arrangements  in  this  problem. 

We  see  then  that  putting  balls  in  boxes  is  not  quite  as  simple  as  we  thought;  we 
must  say  how  we  propose  to  distribute  them  and  even  before  that  we  must  think 
what  practical  problem  we  are  trying  to  solve;  this  is  what  determines  the  sample 
space  and  the  probabilities  to  be  associated  with  the  sample  points.  Unfortunately, 
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it  may  not  always  be  clear  what  the  sample  space  probabilities  should  be;  then  the 
best  we  can  do  is  to  try  various  assumptions.  In  statistical  mechanics  it  is  found  that 
certain  particles  (for  example,  the  molecules  of  a  gas)  are  correctly  described  if  we 
assume  that  they  behave  like  the  balls  of  Example  3  (all  615  arrangements  equally 
likely);  we  then  say  that  they  obey  Maxwell-Boltzmann  statistics.  Other  particles 
(for  example,  electrons)  behave  like  the  people  to  be  seated  in  Example  4  (one 
particle  or  none  per  box);  we  say  that  such  particles  obey  Fermi-Dirac  statistics. 
Finally  some  particles  (for  example,  photons)  act  something  like  the  friends  who 
want  to  sit  near  each  other  (all  distinguishable  arrangements  of  identical  particles 
are  equally  likely);  we  say  that  these  particles  obey  Bose-Einstein  statistics.  For 
the  problem  of  4  particles  in  6  boxes,  there  are  then  64  equally  likely  arrangements 
for  Maxwell-Boltzmann  particles,  C”(6,4)  for  Fermi-Dirac  particles,  and  C( 9,4)  for 
Bose-Einstein  particles.  (See  Problems  15  to  20.) 

PROBLEMS,  SECTION  4 

1.  (a)  There  are  10  chairs  in  a  row  and  8  people  to  be  seated.  In  how  many  ways  can 

this  be  done? 

(b)  There  are  10  questions  on  a  test  and  you  are  to  do  8  of  them.  In  how  many 
ways  can  you  choose  them? 

(c)  In  part  (a)  what  is  the  probability  that  the  first  two  chairs  in  the  row  are 
vacant? 

(d)  In  part  (b),  what  is  the  probability  that  you  omit  the  first  two  problems  in  the 
test? 

(e)  Explain  why  the  answer  to  parts  (a)  and  (b)  are  different,  but  the  answers  to 
(c)  and  (d)  are  the  same. 

2.  In  the  expansion  of  (a  +  b)n  (see  Example  2),  let  a  =  b  =  1,  and  interpret  the  terms 
of  the  expansion  to  show  that  the  total  number  of  combinations  of  n  things  taken 
1,  2,  3,  ■  •  •  ,  n  at  a  time,  is  2n  —  1. 

3.  A  bank  allows  one  person  to  have  only  one  savings  account  insured  to  $100,000. 
However,  a  larger  family  may  have  accounts  for  each  individual,  and  also  accounts 
in  the  names  of  any  2  people,  any  3  and  so  on.  How  many  accounts  are  possible  for 
a  family  of  2?  Of  3?  Of  5?  Of  n?  Hint:  See  Problem  2. 

4.  Five  cards  are  dealt  from  a  shuffled  deck.  What  is  the  probability  that  they  are  all 
of  the  same  suit?  That  they  are  all  diamond?  That  they  are  all  face  cards?  That 
the  five  cards  are  a  sequence  in  the  same  suit  (for  example,  3,  4,  5,  6,  7  of  hearts)? 

5.  A  bit  (meaning  binary  digit)  is  0  or  1.  An  ordered  array  of  eight  bits  (such  as 
01101001)  is  a  byte.  How  many  different  bytes  are  there?  If  you  select  a  byte  at 
random,  what  is  the  probability  that  you  select  11000010?  What  is  the  probability 
that  you  select  a  byte  containing  three  l’s  and  five  0’s? 

6.  A  so-called  7-way  lamp  has  three  60-watt  bulbs  which  may  be  turned  on  one  or  two 
or  all  three  at  a  time,  and  a  large  bulb  which  may  be  turned  to  100  watts,  200  watts 
or  300  watts.  How  many  different  light  intensities  can  the  lamp  be  set  to  give  if  the 
completely  off  position  is  not  included?  (The  answer  is  not  7.) 

7.  What  is  the  probability  that  the  2  and  3  of  clubs  are  next  to  each  other  in  a  shuffled 
deck?  Hint:  Imagine  the  two  cards  accidentally  stuck  together  and  shuffled  as 
one  card. 
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8.  Two  cards  are  drawn  from  a  shuffled  deck.  What  is  the  probability  that  both  are 
aces?  If  you  know  that  at  least  one  is  an  ace,  what  is  the  probability  that  both  are 
aces?  If  you  know  that  one  is  the  ace  of  spades,  what  is  the  probability  that  both 
are  aces? 

9.  Two  cards  are  drawn  from  a  shuffled  deck.  What  is  the  probability  that  both  are 
red?  If  at  least  one  is  red,  what  is  the  probability  that  both  are  red?  If  at  least  one 
is  a  red  ace,  what  is  the  probability  that  both  are  red?  If  exactly  one  is  a  red  ace, 
what  is  the  probability  that  both  are  red? 

10.  What  is  the  probability  that  you  and  a  friend  have  different  birthdays?  (For  sim¬ 
plicity,  let  a  year  have  365  days.)  What  is  the  probability  that  three  people  have 
three  different  birthdays?  Show  that  the  probability  that  n  people  have  n  different 
birthdays  is 


Estimate  this  for  n  <C  365  by  calculating  lnp  [recall  that  ln(l  +  a;)  is  approximately  x 
for  x  1].  Find  the  smallest  (integral)  n  for  which  p  <  i.  Hence,  show  that  for  a 
group  of  23  people  or  more,  the  probability  is  greater  than  |  that  two  of  them  have 
the  same  birthday.  (Try  it  with  a  group  of  friends  or  a  list  of  people  such  as  the 
presidents  of  the  United  States.) 

11.  The  following  game  was  being  played  on  a  busy  street:  Observe  the  last  two  digits 
on  each  license  plate.  What  is  the  probability  of  observing  at  least  two  cars  with 
the  same  last  two  digits  among  the  first  5  cars?  10  cars?  15  cars?  How  many  cars 
must  you  observe  in  order  for  the  probability  to  be  greater  than  |  of  observing  two 
with  the  same  last  two  digits? 

12.  Consider  Problem  10  for  different  months  of  birth.  What  is  the  smallest  number  of 
people  for  which  the  probability  is  greater  than  i  that  two  of  them  were  born  in  the 
same  month? 

13.  Generalize  Example  3  to  show  that  the  number  of  ways  of  putting  N  balls  in  n  boxes 
with  Ni  in  box  1,  N2  in  box  2,  etc.,  is 

( _ ™ _ V 

■  n2\  ■  n3\  ■  ■  ■  nj  j 

14.  (a)  Find  the  probability  that  in  two  tosses  of  a  coin,  one  is  heads  and  one  tails. 

That  in  six  tosses  of  a  die,  all  six  of  the  faces  show  up.  That  in  12  tosses  of 
a  12-sided  die,  all  12  faces  show  up.  That  in  n  tosses  of  an  n-sided  die,  all  n 
faces  show  up. 

(b)  The  last  problem  in  part  (a)  is  equivalent  to  finding  the  probability  that,  when 
n  balls  are  distributed  at  random  into  n  boxes,  each  box  contains  exactly  one 
ball.  Show  that  for  large  n,  this  is  approximately  e~rLy/2nn. 

15.  Set  up  the  uniform  sample  spaces  for  the  problem  of  putting  2  particles  in  3  boxes: 
for  Maxwell-Boltzmann  particles,  for  Fermi-Dirac  particles,  and  for  Bose-Einstein 
particles.  See  Example  5.  (You  should  find  9  sample  points  for  MB,  3  for  FD,  and 
6  for  BE.) 

16.  Do  Problem  15  for  2  particles  in  2  boxes.  Using  the  model  discussed  in  Example  5, 
find  the  probability  of  each  of  the  three  sample  points  in  the  Bose-Einstein  case. 
(You  should  find  that  each  has  probability  i,  that  is,  they  are  equally  probable.) 

17.  Find  the  number  of  ways  of  putting  2  particles  in  4  boxes  according  to  the  three 
kinds  of  statistics. 
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18.  Find  the  number  of  ways  of  putting  3  particles  in  5  boxes  according  to  the  three 
kinds  of  statistics. 

19.  (a)  Following  the  methods  of  Examples  3,  4,  and  5,  show  that  the  number  of 

equally  likely  ways  of  putting  N  particles  in  n  boxes,  n  >  N,  is  nN  for  Maxwell- 
Boltzmann  particles,  C{n,  N)  for  Fermi-Dirac  particles,  and  C(n  —  1  +  N,  N) 
for  Bose-Einstein  particles. 

(b)  Show  that  if  n  is  much  larger  than  N  (think  ,  for  example,  of  n  =  106,  N  =  10), 
then  both  the  Bose-Einstein  and  the  Fermi-Dirac  results  in  part  (a)  contain 
products  of  N  numbers,  each  number  approximately  equal  to  n.  Thus  show 
that  for  n  N,  both  the  BE  and  the  FD  results  are  approximately  equal  to 
nN /N\,  which  is  1/N\  times  the  MB  result. 

20.  (a)  In  Example  5,  a  mathematical  model  is  discussed  which  claims  to  give  a  dis¬ 

tribution  of  identical  balls  into  boxes  in  such  a  way  that  all  distinguishable 
arrangements  are  equally  probable  (Bose-Einstein  statistics).  Prove  this  by 
showing  that  the  probability  of  a  distribution  of  N  balls  into  n  boxes  (accord¬ 
ing  to  this  model)  with  Ni  balls  in  the  first  box,  N2  in  the  second,  •  •  •  ,  Nn  in 
the  nth,  is  1/C(n—1+N,  N)  for  any  set  of  numbers  Ni  such  that  1  ^  =  ?V. 

(b)  Show  that  the  model  in  (a)  leads  to  Maxwell-Boltzmann  statistics  if  the  drawn 
card  is  replaced  (but  no  extra  card  added)  and  to  Fermi-Dirac  statistics  if  the 
drawn  card  is  not  replaced.  Hint:  Calculate  in  each  case  the  number  of  possible 
arrangements  of  the  balls  in  the  boxes.  First  do  the  problem  of  4  particles  in 
6  boxes  as  in  the  example,  and  then  do  N  particles  in  n  boxes  (n  >  N)  to  get 
the  results  in  Problem  19. 

21.  The  following  problem  arises  in  quantum  mechanics  (see  Chapter  13,  Problem  7.21). 
Find  the  number  of  ordered  triples  of  nonnegative  integers  a,  b,  c  whose  sum  a  +  b  +  c 
is  a  given  positive  integer  n.  (For  example,  if  n  =  2,  we  could  have  (a,  b,  c)  =  (2,  0,  0) 
or  (0,  2,  0)  or  (0,  0,  2)  or  (0,  1,  1)  or  (1,  0,  1)  or  (1,  1,  0).)  Hint:  Show  that  this 
is  the  same  as  the  number  of  distinguishable  distributions  of  n  identical  balls  in  3 
boxes,  and  follow  the  method  of  the  diagram  in  Example  5. 

22.  Suppose  13  people  want  to  schedule  a  regular  meeting  one  evening  a  week.  What 
is  the  probability  that  there  is  an  evening  when  everyone  is  free  if  each  person  is 
already  busy  one  evening  a  week? 

23.  Do  Problem  22  if  one  person  is  busy  3  evenings,  one  is  busy  2  evenings,  two  are  each 
busy  one  evening,  and  the  rest  are  free  every  evening. 


5.  RANDOM  VARIABLES 

In  the  problem  of  tossing  two  dice  (Example  2,  Section  2),  we  may  be  more  interested 
in  the  value  of  the  sum  of  the  numbers  on  the  two  dice  than  we  are  in  the  individual 
numbers.  Let  us  call  this  sum  x\  then  for  each  point  of  the  sample  space  in  (2.4),  x 
has  a  value.  For  example,  for  the  point  2,1,  we  have  x  =  2  + 1  =  3;  for  the  point  6,2, 
we  have  x  =  8,  etc.  Such  a  variable,  x ,  which  has  a  definite  value  for  each  sample 
point,  is  called  a  random  variable.  We  can  easily  construct  many  more  examples 
of  random  variables  for  the  sample  space  (2.4);  here  are  a  few  (Can  you  construct 
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some  more?): 

x  =  number  on  first  die  minus  number  on  second; 
x  =  number  on  second  die; 

x  =  probability  p  associated  with  the  sample  point; 
f  1  if  the  sum  is  7  or  1 1 . 

X  ——  < 

I  0  otherwise. 


For  each  of  these  random  variables  x1  we  could  set  up  a  table  listing  all  the  sample 
points  in  (2.4)  and,  next  to  each  sample  point,  the  corresponding  value  of  x.  This 
table  may  remind  you  of  the  tables  of  values  we  could  use  in  plotting  the  graph 
of  a  function.  In  analytical  geometry  or  in  a  physics  problem,  knowing  x  as  a 
function  of  t  means  that  for  any  given  t  we  can  find  the  corresponding  value  of  x. 
In  probability  the  sample  point  corresponds  to  the  independent  variable  t;  given 
the  sample  point,  we  can  find  the  corresponding  value  of  the  random  variable  x  if 
we  are  given  a  description  of  x  (for  example,  x  =  the  sum  of  numbers  on  dice).  The 
“description”  corresponds  to  the  formula  x(t)  that  we  use  in  plotting  a  graph  in 
analytic  geometry.  Thus  we  may  say  that  a  random  variable  x  is  a  function  defined 
on  a  sample  space. 


Probability  Functions  Let  us  consider  further  the  random  variable  x  -  “sum 
of  numbers  on  dice”  for  a  toss  of  two  dice  [sample  space  (2.4)].  We  note  that  there 
are  several  sample  points  for  which  x  =  5,  namely  the  points  marked  a  in  (2.4). 
Similarly,  there  are  several  sample  points  for  most  of  the  other  values  of  x.  It  is  then 
convenient  to  lump  together  all  the  sample  points  corresponding  to  a  given  value 
of  x,  and  consider  a  new  sample  space  in  which  each  point  corresponds  to  one  value 
of  x\  this  is  the  sample  space  (2.5).  The  probability  associated  with  each  point 
of  the  new  sample  space  is  obtained  as  in  Section  2,  by  adding  the  probabilities 
associated  with  all  the  points  in  the  original  sample  space  corresponding  to  the 
particular  value  of  x.  Each  value  of  x,  say  Xi,  has  a  probability  pt  of  occurrence; 
we  may  write  pi  =  f{xf)  =  probability  that  x  =  Xi,  and  call  the  function  f{x)  the 
probability  function  for  the  random  variable  x.  In  (2.5)  we  have  listed  on  the  first 
line  the  values  of  x  and  on  the  second  line  the  values  of  f(x).  [In  this  problem,  x 
and  f(x)  take  on  only  a  finite  number  of  discrete  values;  in  some  later  problems 
they  will  take  on  a  continuous  set  of  values.]  We  could  also  exhibit  these  values 
graphically  (Figure  5.1). 
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Now  that  we  have  the  table  of  values  (2.5)  or  the  graph  (Figure  5.1)  to  describe 
the  random  variable  x  and  its  probability  function  f(x),  we  can  dispense  with  the 
original  sample  space  (2.4).  But  since  we  used  (2.4)  in  defining  what  is  meant  by 
a  random  variable,  let  us  now  give  another  definition  using  (2.5)  or  Figure  5.1.  We 
can  say  that  x  is  a  random  variable  if  it  takes  various  values  Xi  with  probabilities 
Pi  =  f(xi).  This  definition  may  explain  the  name  random  variable;  x  is  called  a 
variable  since  it  takes  various  values.  A  random  (or  stochastic)  process  is  one  whose 
outcome  is  not  known  in  advance.  The  way  the  two  dice  fall  is  such  an  unknown 
outcome,  so  the  value  of  x  is  unknown  in  advance,  and  we  call  x  a  random  variable. 

You  may  note  that  at  first  we  thought  of  x  as  a  dependent  variable  or  function 
with  the  sample  point  as  the  independent  variable.  Although  we  didn’t  say  much 
about  it,  there  was  also  a  value  of  the  probability  p  attached  to  each  sample  point, 
that  is  p  and  x  were  both  functions  of  the  sample  point.  In  the  last  paragraph,  we 
have  thought  of  x  as  an  independent  variable  with  p  as  a  function  of  x.  This  is 
quite  analogous  to  having  both  x  and  p  given  as  functions  of  t  and  eliminating  t 
to  obtain  p  as  a  function  of  x.  We  have  here  eliminated  the  sample  point  from 
the  forefront  of  our  discussion  in  order  to  consider  directly  the  probability  function 
p=  f(x). 


Example  1.  Let  x  =  number  of  heads  when  three  coins  are  tossed.  The  uniform  sample 
space  is  (2.3)  and  we  could  write  the  value  of  x  for  each  sample  point  in  (2.3). 
Instead,  let  us  go  immediately  to  a  table  of  x  and  p  =  f(x).  [Can  you  verify  this 
table  by  using  (2.3),  or  otherwise?] 


(5.1) 


x  0  12  3 

P=f(x )  I  I  I  I 


Other  terms  used  for  the  probability  function  p  =  f(x)  are:  probability  density 
function ,  frequency  function ,  or  probability  distribution  ( caution :  not  distribution 
function,  which  means  the  cumulative  distribution  as  we  will  discuss  later;  see  Fig¬ 
ure  5.2).  The  origins  of  these  terms  will  become  clearer  as  we  go  on  (Sections  6 
and  7)  but  we  can  get  some  idea  of  the  terms  frequency  and  distribution  from  (5.1). 
Suppose  we  toss  three  coins  repeatedly;  we  might  reasonably  expect  to  get  three 
heads  in  about  |  of  the  tosses,  two  heads  in  about  |  of  the  tosses,  etc.  That  is, 
each  value  of  p  =  f(x)  is  proportional  to  the  frequency  of  occurrence  of  that  value 
of  x — hence  the  term  frequency  function  (see  also  Section  7).  Again  in  (5.1),  imag¬ 
ine  four  boxes  labeled  x  =  0,  1,  2,  3,  and  put  a  marble  into  the  appropriate  box  for 
each  toss  of  three  coins.  Then  p  =  f{x)  indicates  approximately  how  the  marbles 
are  distributed  into  the  boxes  after  many  tosses — hence  the  term  distribution. 


Mean  Value;  Standard  Deviation  The  probability  function  f(x)  of  a  ran¬ 
dom  variable  x  gives  us  detailed  information  about  it,  but  for  many  purposes  we 
want  a  simpler  description.  Suppose,  for  example,  that  x  represents  experimental 
measurements  of  the  length  of  a  rod,  and  that  we  have  a  large  number  N  of  mea¬ 
surements  Xi .  We  might  reasonably  take  pi  =  f(xi)  proportional  to  the  number  of 
times  Ni  we  obtained  the  value  Xi,  that  is  pi  =  Ni/N.  We  are  especially  interested 
in  two  numbers,  namely  a  mean  or  average  value  of  all  our  measurements,  and  some 
number  which  indicates  how  widely  the  original  set  of  values  spreads  out  about  that 
average.  Let  us  define  two  such  quantities  which  are  customarily  used  to  describe  a 
random  variable.  To  calculate  the  average  of  a  set  of  N  numbers,  we  add  them  and 
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divide  by  N.  Instead  of  adding  the  large  number  of  measurements,  we  can  multiply 
each  measurement  by  the  number  of  times  it  occurs  and  add  the  results.  This  gives 
for  the  average  of  the  measurements,  the  value 

^■Y/NiXi  =  ^2piXi. 
i  i 

By  analogy  with  this  calculation,  we  now  define  the  average  or  mean  value  p,  of  a 
random  variable  x  whose  probability  function  is  /( x)  by  the  equation 


(5.2) 


fj,  =  average  of  x 


Y,x*Pi = y ^xif(xj). 

i  i 


To  obtain  a  measure  of  the  spread  or  dispersion  of  our  measurements,  we  might 
first  list  how  much  each  measurement  differs  from  the  average.  Some  of  these 
deviations  are  positive  and  some  are  negative;  if  we  average  them,  we  get  zero 
(Problem  10).  Instead,  let  us  square  each  deviation  and  average  the  squares.  We 
define  the  variance  of  a  random  variable  x  by  the  equation 


(5.3)  Var(x)  =  ^(x*  -  p)2f(xi). 

i 


(The  variance  is  sometimes  called  the  dispersion.)  If  nearly  all  the  measurements  Xi 
are  very  close  to  p,  then  Var(a;)  is  small;  if  the  measurements  are  widely  spread, 
Var(x)  is  large.  Thus  we  have  a  number  which  indicates  the  spread  of  the  mea¬ 
surements;  this  is  what  we  wanted.  The  square  root  of  Var(x),  called  the  standard 
deviation  of  x,  is  often  used  instead  of  Var(x): 


(5.4) 


<jx  =  standard  deviation  of  x  =  ar{x). 


Example  2.  For  the  data  in  (5.1)  we  can  compute: 

By  (5.2),  p  =  average  ofx  =  0-  i  +  l-  |  +  2-  |  +  3-  i  =  ^  =  |. 

By  (5.3),  Var(x)  =  (0  -  |)2  •  I  +  (l  -  |)2  •  |  +  (2  -  |)2  •  |  +  (3  -  |)2  •  | 

-9  1  i  1  .  3  ,  1  .  3  ,  9  .  1  _  3 

4  8  ”r  4  8  ”r  4  8  ”r  4  8  4' 

By  (5.4),  ax  =  standard  deviation  of  x  =  i/Var(x)  =  \  ^3. 

The  mean  or  average  value  of  a  random  variable  x  is  also  called  its  expectation 
or  its  expected  value  or  (especially  in  quantum  mechanics)  its  expectation  value. 
Instead  of  /x,  the  symbols  x  or  E{x)  or  (x)  may  be  used  to  denote  the  mean  value 
of  x. 
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(5.5) 

x  =  E(x)  =  (x)  =  PL  =  Xif(Xi). 

i 

The  term  expectation  comes  from  games  of  chance. 

Example  3.  Suppose  you  will  be  paid  $5  if  a  die  shows  a  5,  $2  if  it  shows  a  2  or  a  3, 
and  nothing  otherwise.  Let  x  represent  your  gain  in  playing  the  game.  Then  the 
possible  values  of  x  and  the  corresponding  probabilities  are  x  =  5  with  p  =  i ,  x  =  2 
with  p  =  and  x  =  0  with  p  =  \.  We  find  for  the  average  or  expectation  of  x: 

E(x)  =  ^  XiPi  =  $5  ■  -  +  $2  •  -  +  $0  •  -  =  $1.50. 

If  you  play  the  game  many  times,  this  is  a  reasonable  estimate  of  your  average  gain 
per  game;  this  is  what  your  expectation  means.  It  is  also  a  reasonable  amount  to 
pay  as  a  fee  for  each  game  you  play.  The  term  expected  value  (which  means  the  same 
as  expectation  or  average)  may  be  somewhat  confusing  and  misleading  if  you  try  to 
interpret  “expected”  in  an  everyday  sense.  Note  that  the  expected  value  ($1.50)  of  x 
is  not  one  of  the  possible  values  of  x,  so  you  cannot  ever  “expect”  to  have  x  =  $1.50. 
If  you  think  of  expected  value  as  a  technical  term  meaning  the  same  as  average, 
then  there  is  no  difficulty.  Of  course,  in  some  cases,  it  makes  reasonable  sense  with 
its  everyday  meaning;  for  example,  if  a  coin  is  tossed  n  times,  the  expected  number 
of  heads  is  n/2  (Problem  11)  and  it  is  true  that  we  may  reasonably  “expect”  a  fair 
approximation  to  this  result  (see  Section  7). 

Cumulative  Distribution  Functions  So  far  we  have  been  using  the  probability 
function  /( x)  which  gives  the  probability  p,;  =  f(xf)  that  x  is  exactly  Xj.  In  some 
problems  we  may  be  more  interested  in  the  probability  that  x  is  less  than  some 
particular  value.  For  example,  in  an  election  we  would  like  to  know  the  probability 
that  less  than  half  the  votes  would  be  cast  for  the  opposing  candidate,  that  is,  that 
our  candidate  would  win.  In  an  experiment  on  radioactivity,  we  would  like  to  know 
the  probability  that  the  background  radiation  always  remains  below  a  certain  level. 
Given  the  probability  function  /(x),  we  can  obtain  the  probability  that  x  is  less 
than  or  equal  to  a  certain  value  Xj  by  adding  all  the  probabilities  of  values  of  x  less 
than  or  equal  to  Xi.  For  example,  consider  the  sum  of  the  numbers  on  two  dice;  the 
probability  function  p  =  f(x)  is  plotted  in  Figure  5.1.  The  probability  that  x  is, 
say,  less  than  or  equal  to  4  is  the  sum  of  the  probabilities  that  x  is  2  or  3  or  4,  that 
is,  +  ^  +  Similarly,  we  could  find  the  probability  that  x  is  less  than 

or  equal  to  any  given  number.  The  resulting  function  of  x  is  plotted  in  Figure  5.2. 
Such  a  function  F{x)  is  called  a  cumulative  distribution  function;  we  can  write 


(5.6)  F(xi)  =  (probability  that  x  <  Xi)  =  ^  f{xj)- 

Xj  <Xi 


Note  carefully  that,  although  the  probability  function  f(x)  may  be  referred  to  as  a 
probability  distribution ,  the  term  distribution  function  means  the  cumulative  distri¬ 
bution  F(x). 
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PROBLEMS,  SECTION  5 

Set  up  sample  spaces  for  Problems  1  to  7  and  list  next  to  each  sample  point  the  value  of 
the  indicated  random  variable  x,  and  the  probability  associated  with  the  sample  point. 
Make  a  table  of  the  different  values  Xi  of  x  and  the  corresponding  probabilities  pt  =  f(xi). 
Compute  the  mean,  the  variance,  and  the  standard  deviation  for  x.  Find  and  plot  the 
cumulative  distribution  function  F(x). 

1.  Three  coins  are  tossed;  x  =  number  of  heads  minus  number  of  tails. 

2.  Two  dice  are  thrown;  x  =  sum  of  the  numbers  on  the  dice. 

3.  A  coin  is  tossed  repeatedly;  x  =  number  of  the  toss  at  which  a  head  first  appears. 

4.  Suppose  that  Martian  dice  are  4-sided  (tetrahedra)  with  points  labeled  1  to  4.  When 
a  pair  of  these  dice  is  tossed,  let  x  be  the  product  of  the  two  numbers  at  the  tops  of 
the  dice  if  the  product  is  odd;  otherwise  x  =  0. 

5.  A  random  variable  x  takes  the  values  0,  1,  2,  3,  with  probabilities  yy,  yy, 

6.  A  card  is  drawn  from  a  shuffled  deck.  Let  x  =  10  if  it  is  an  ace  or  a  face  card; 
x  =  —1  if  it  is  a  2;  and  x  =  0  otherwise. 

7.  A  weighted  coin  with  probability  p  of  coming  down  heads  is  tossed  three  times;  x  = 
number  of  heads  minus  number  of  tails. 

8.  Would  you  pay  $10  per  throw  of  two  dice  if  you  were  to  receive  a  number  of  dollars 
equal  to  the  product  of  the  numbers  on  the  dice?  Hint:  What  is  your  expectation? 
If  it  is  more  than  $10,  then  the  game  would  be  favorable  for  you. 

9.  Show  that  the  expectation  of  the  sum  of  two  random  variables  defined  over  the 
same  sample  space  is  the  sum  of  the  expectations.  Hint:  Let  pi,  p2,  •  •  ■ ,  pn  be  the 
probabilities  associated  with  the  n  sample  points;  let  xi,  X2,  ■■■,  x„,  and  y i,  j/2, 
•  •  •  ,  yn,  be  the  values  of  the  random  variables  x  and  y  for  the  n  sample  points.  Write 
out  E(x),  E(y),  and  E(x  +  y). 

10.  Let  p  be  the  average  of  the  random  variable  x.  Then  the  quantities  ( Xi  —  p)  are  the 
deviations  of  x  from  its  average.  Show  that  the  average  of  these  deviations  is  zero. 
Hint:  Remember  that  the  sum  of  all  the  pi  must  equal  1. 

11.  Show  that  the  expected  number  of  heads  in  a  single  toss  of  a  coin  is  i.  Show  in  two 

ways  that  the  expected  number  of  heads  in  two  tosses  of  a  coin  is  1: 

(a)  Let  x  =  number  of  heads  in  two  tosses  and  find  x. 

(b)  Let  x  =  number  of  heads  in  toss  1  and  y  =  number  of  heads  in  toss  2;  find  the 

average  of  x  +  y  by  Problem  9.  Use  this  method  to  show  that  the  expected 
number  of  heads  in  n  tosses  of  a  coin  is  |n. 
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12.  Use  Problem  9  to  find  the  expected  value  of  the  sum  of  the  numbers  on  the  dice  in 
Problem  2. 

13.  Show  that  adding  a  constant  K  to  a  random  variable  increases  the  average  by  K 
but  does  not  change  the  variance.  Show  that  multiplying  a  random  variable  by  K 
multiplies  both  the  average  and  the  standard  deviation  by  K. 

14.  As  in  Problem  11,  show  that  the  expected  number  of  5’s  in  n  tosses  of  a  die  is  n/6. 

15.  Use  Problem  9  to  find  x  in  Problem  7. 

16.  Show  that  a2  =  E( x2)  —  yi2.  Hint:  Write  the  definition  of  a2  from  (5.3)  and  (5.4) 
and  use  Problems  9  and  13. 

17.  Use  Problem  16  to  find  a  in  Problems  2,  6,  and  7. 

6.  CONTINUOUS  DISTRIBUTIONS 

In  Section  5,  we  discussed  random  variables  x  which  took  a  discrete  set  of  values  Xi. 
It  is  not  hard  to  think  of  cases  in  which  a  random  variable  takes  a  continuous  set 
of  values. 

Example  1.  Consider  a  particle  moving  back  and  forth  along  the  x  axis  from  x  =  0  to 
x  =  l,  rebounding  elastically  at  the  turning  points  so  that  its  speed  is  constant. 
(This  could  be  a  simple-minded  model  of  an  alpha  particle  in  a  radioactive  nucleus, 
or  of  a  gas  molecule  bouncing  back  and  forth  between  the  walls  of  a  container.)  Let 
the  position  x  of  the  particle  be  the  random  variable;  then  x  takes  a  continuous  set 
of  values  from  x  =  0  to  x  =  l.  Now  suppose  that,  following  Section  5,  we  ask  for 
the  probability  that  the  particle  is  at  a  particular  point  x\  this  probability  must  be 
the  same,  say  k,  for  all  points  (because  the  speed  is  constant).  In  Section  5,  with  a 
finite  number  of  points,  we  would  say  k  =  1/N.  In  the  continuous  case,  there  are 
an  infinite  number  of  points  so  we  would  find  k  =  0,  that  is,  the  probability  that 
the  particle  is  at  a  given  point)  must  be  zero.  But  this  is  not  a  very  useful  result. 
Let  us  instead  divide  (0,1)  into  small  intervals  dx ;  since  the  particle  has  constant 
speed,  the  time  it  spends  in  each  dx  is  proportional  to  the  length  of  dx.  In  fact, 
since  the  particle  spends  the  fraction  (dx)/l  of  its  time  in  a  given  interval  dx,  the 
probability  of  finding  it  in  dx  is  just  ( dx)/l . 


Figure  6.1 
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Comparison  of  Discrete  and  Continuous  Probability  Functions  To  see 

how  to  define  a  probability  function  for  the  continuous  case  and  to  correlate  this 
discussion  with  the  discrete  case,  let  us  return  for  a  moment  to  Figure  5.1.  There 
we  plotted  a  vertical  distance  to  represent  the  probability  p  =  f(x)  of  each  value 
of  x.  Instead  of  a  dot  (as  in  Figure  5.1)  to  indicate  p  for  each  x,  let  us  now  draw  a 
horizontal  line  segment  of  length  1  centered  on  each  dot,  as  in  Figure  6.1.  Then  the 
area  under  the  horizontal  line  segment  at  a  particular  x*  is  f(xi)  ■  1  =  f(xi)  =  pi 
(since  the  length  of  each  horizontal  line  segment  is  1),  and  we  could  use  this  area 
instead  of  the  ordinate  as  a  measure  of  the  probability.  Such  a  graph  is  called  a 
histogram. 


Example  2.  Now  let  us  apply  this  area  idea  to  Example  1. 
plotted  the  function 


Consider  Figure  6.2.  We  have 


f(x) 


l/l,  0  <  x  <  l, 

0,  x  <  0  and  x  >  l. 


If  we  consider  any  interval  x  to  x  +  dx 
on  (0,  l),  the  area  under  the  curve  f(x)  = 
l/l  for  this  interval  is  (l/l)  dx  or  f(x)  dx, 
and  this  is  just  the  probability  that  the 
particle  is  in  this  interval.  The  proba¬ 
bility  that  the  particle  is  in  some  longer 
subinterval  of  (0,  l),  say  (a,  6),  is  (b  —  a)/l 


(x  +  dx) 


or  f/  f(x)  dx,  that  is,  the  area  under  the  Figure  6.2 

curve  from  a  to  b.  If  the  interval  (a,  b)  is 

outside  (0,1),  then  f/  f(x)  dx  =  0  since  f(x)  is  zero,  and  again  this  is  the  correct 
value  of  the  probability  of  finding  the  particle  on  the  given  interval. 


When  f(x)  is  constant  over  an  interval  (as  in  Figure  6.2),  we  say  that  x  is 
uniformly  distributed  on  that  interval.  Let  us  consider  an  example  in  which  f(x) 
is  not  constant. 


Example  3.  This  time  suppose  the  particle  of  Example  1  is  sliding  up  and  down  an  inclined 
plane  (no  friction)  rebounding  elastically  (no  energy  loss)  against  a  spring  at  the 
bottom  and  reaching  zero  speed  at  height  y  =  h  (Figure  6.3).  The  total  energy, 
namely  |m»2  +  mgy  is  constant  and  equal  to  mgh  since  v  =  0  at  y  =  h.  Thus  we 
have 


(6.1) 


v2  =  —  ( mgh  -  mgy)  =  2 g(h  -  y). 
m 


The  probability  of  finding  the  particle  within  an  interval  dy  at  a  given  height  y 
is  proportional  to  the  time  dt  spent  in  that  interval.  From  v  =  ds/dt ,  we  have 
dt  =  (ds)/v,  from  Figure  6.3,  we  find  ds  =  (dy)  esc  a.  Combining  these  with  (6.1) 
we  have 

^  ds  (dy)  esc  a 
v  y/2gVh  -  y 

Since  the  probability  f(y)  dy  of  finding  the  particle  in  the  interval  dy  at  height  y 
is  proportional  to  dt,  we  can  drop  the  constant  factor  (esc  a)/y/2g,  and  say  that 
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f(y)  dy  is  proportional  to  dy/\Jh  —  y.  In  order  to  find  f(y),  we  must  multiply  by  a 
constant  factor  which  makes  the  total  probability  /)|'  f(y)  dy  equal  to  1  since  this  is 
the  probability  that  the  particle  is  somewhere.  You  can  easily  verify  that 

f(y)  dy  =  — -j=  =  or  f(y)  =  -  = 

W  2 VhVh^y  2 y/h(h  -  y) 

A  graph  of  f(y)  is  plotted  in  Figure  6.4.  Note  that  although  f(y)  becomes  infinite 
at  y  =  h,  the  area  under  the  f(y)  curve  for  any  interval  is  finite;  this  area  represents 
the  probability  that  the  particle  is  in  that  height  interval. 


Figure  6.4 

We  can  now  extend  the  definitions  of  mean  (expectation),  variance,  standard 
deviation,  and  cumulative  distribution  function  to  the  continuous  case.  Let  f{x)  be 
a  probability  density  function;  remember  that  f_oa  f(x )  dx  =  1  just  as  Pi  =  1- 
The  average  of  a  random  variable  x  with  probability  density  function  f(x)  is 


(6.2) 


y  =  x  =  E(  x) 


xf(x)  dx. 


(In  writing  the  limits  — oo,oo  here,  we  assume  that  f(x)  is  defined  to  be  zero  on 
intervals  where  the  probability  is  zero.)  Note  that  (6.2)  is  a  natural  extension  of 
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the  sum  in  (5.5).  Having  found  the  mean  of  x,  we  now  define  the  variance  as  in 
Section  5  as  the  average  of  (x  —  /x)2,  that  is, 


(6.3) 


/OO 

(x  -  /x)2/(x) 

-OO 


dx  =  az. 


As  before,  the  standard  deviation  ax  is  the  square  root  of  the  variance.  Finally, 
the  cumulative  distribution  function  F(x)  gives  for  each  x  the  probability  that  the 
random  variable  is  less  than  or  equal  to  that  x.  But  this  probability  is  just  the  area 
under  the  /(x)  curve  from  — oo  up  to  the  point  x.  Also,  of  course,  the  integral  of 
/(x)  from  —  oo  to  oo  must  =  1  since  that  is  the  total  probability  for  all  values  of  x. 
Thus  we  have 


(6.4) 


/x  /»oo 

f(u)du ,  /  f(x)dx  =  F(  oo)  =  1. 

-OO  J  —  oo 


Example  4.  For  the  problem  in  Example  3,  we  find: 

By  (6.2),  ny  =  f  yf{y)  dy  =  — f  y  J- - dy  =  \h. 

Jo  2  y/h  Jo  Vh-y  3 

rh  .  rh  /  q  \ 2  1  Ah2 

By  (6.3),  Var  id)  =  J  id  ~  dy)  f  (y)  dy  =  j  \y  -  -hj  -j==dy  =  —, 

so  standard  deviation  ay  =  yVa r(y)  =  2h/v/45. 

rv 

By  (6.4),  cumulative  distribution  function  F(y)  =  /  f{u)  du 

Jo 

1  rv  du 

2\/h  Jo  Vh  —  u 


Why  “density  function”?  In  Section  5,  we  mentioned  that  the  probability  func¬ 
tion  /(x)  is  often  called  the  probability  density.  We  can  now  explain  why.  Con¬ 
sider  (6.2).  If  /(x)  represents  the  density  (mass  per  unit  length)  of  a  thin  rod,  then 
the  center  of  mass  of  the  rod  is  given  by  [see  Chapter  5,  (3.3)] 


(6.5) 


'=  J  x/(x)  dxj  J  f(x)  dx, 


where  the  integrals  are  over  the  length  of  the  rod,  or  from  —  oo  to  oo  as  in  (6.2) 
with  /(x)  =  0  outside  the  rod.  But  in  (6.2),  f  /(x)  dx  is  the  total  probability  that 
x  has  some  value,  and  so  this  integral  is  equal  to  1.  Then  (6.5)  and  (6.2)  are  really 
the  same;  we  see  that  it  is  reasonable  to  call  /(x)  a  density,  and  also  that  the  mean 
of  x  corresponds  to  the  center  of  mass  of  a  linear  mass  distribution  of  density  /(x). 
In  a  similar  way,  we  can  interpret  (6.3)  as  giving  the  moment  of  inertia  of  the  mass 
distribution  about  the  center  of  mass  (see  Chapter  5,  Section  3). 
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Joint  Distributions  We  can  easily  generalize  the  ideas  and  formulas  above  to 
two  (or  more)  dimensions.  Suppose  we  have  two  random  variables  x  and  y\  we 
define  their  joint  probability  density  function  f(x,y)  so  that  f(xi,yj)dxdy  is  the 
probability  that  the  point  (x,y)  is  in  an  element  of  area  dxdy  at  x  =  Xi,  y  =  yj. 
Then  the  probability  that  the  point  ( x ,  y)  is  in  a  given  region  of  the  ( x ,  y)  plane,  is 
the  integral  of  f(x,y)  over  that  area.  The  average  or  expected  values  of  x  and  y , 
the  variances  and  standard  deviations  of  x  and  y,  and  the  covariance  of  x,  y  (see 
Problems  13  to  16)  are  given  by 


(6.6) 


X  = 


y  = 


/oo  poo 

/  xf(x,y)  dxdy, 

-oo  J — oo 

/oo 

yf(x,y)dxdy , 

-oo 

/oo  poo 

/  (x -x)2 f(x,y)  dxdy  =  crl, 

-oo  J  —  oo 

/oo  poo 

/  (y  -y)2f(x,y)dxdy  =  a2, 

-oo  J — oo 

POO  POO 

Co  v(x,y)=  /  (x-x)(y-y)f(x,y)dxdy. 


•  —  oo  «/  — oo 


You  should  see  that  these  are  generalizations  of  (6.2)  and  (6.3);  that  (6.6)  can  be 
interpreted  as  giving  the  coordinates  of  the  center  of  mass  and  the  moments  of 
inertia  of  a  two-dimensional  mass  distribution;  and  that  similar  formulas  can  be 
written  for  three  (or  more)  random  variables  (that  is,  in  three  or  more  dimensions). 
Also  note  that  the  formulas  in  (6.6)  could  be  written  in  terms  of  polar  coordinates 
(see  Problems  6  to  9). 

We  have  discussed  a  number  of  probability  distributions  both  discrete  and  con¬ 
tinuous,  and  you  will  find  others  in  the  problems.  We  will  discuss  three  very  impor¬ 
tant  named  distributions  (binomial,  normal,  and  Poisson)  in  the  following  sections. 
Learning  about  these  and  related  graphs,  formulas,  and  terminology  should  make 
it  possible  for  you  to  cope  with  any  of  the  many  other  named  distributions  you  find 
in  texts,  reference  books,  and  computer  programs. 


PROBLEMS,  SECTION  6 

1.  (a)  Find  the  probability  density  function  f(x)  for  the  position  i  of  a  particle 

which  is  executing  simple  harmonic  motion  on  (—a,  a)  along  the  x  axis.  (See 
Chapter  7,  Section  2,  for  a  discussion  of  simple  harmonic  motion.)  Hint:  The 
value  of  x  at  time  t  is  x  =  a  cos  uit.  Find  the  velocity  dx/dt\  then  the  probability 
of  finding  the  particle  in  a  given  dx  is  proportional  to  the  time  it  spends  there 
which  is  inversely  proportional  to  its  speed  there.  Don’t  forget  that  the  total 
probability  of  finding  the  particle  somewhere  must  be  1. 

(b)  Sketch  the  probability  density  function  f(x)  found  in  part  (a)  and  also  the 
cumulative  distribution  function  F(x)  [see  equation  (6.4)]. 

(c)  Find  the  average  and  the  standard  deviation  of  x  in  part  (a). 

2.  It  is  shown  in  the  kinetic  theory  of  gases  that  the  probability  for  the  distance  a 
molecule  travels  between  collisions  to  be  between  x  and  x  +  dx,  is  proportional  to 
e~x'x  dx,  where  A  is  a  constant.  Show  that  the  average  distance  between  collisions 
(called  the  “mean  free  path”)  is  A.  Find  the  probability  of  a  free  path  of  length  >  2A. 
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3.  A  ball  is  thrown  straight  up  and  falls  straight  back  down.  Find  the  probability 
density  function  f(h)  so  that  f(h)  dh  is  the  probability  of  finding  it  between  height 
h  and  h  +  dh.  Hint:  Look  at  Example  3. 

4.  In  Problem  1  we  found  the  probability  density  function  for  a  classical  harmonic 
oscillator.  In  quantum  mechanics,  the  probability  density  function  for  a  harmonic 
oscillator  (in  the  ground  state)  is  proportional  to  e  a  s  ,  where  a  is  a  constant  and 
x  takes  values  from  — oo  to  oo.  Find  f(x)  and  the  average  and  standard  deviation 
of  x.  (In  quantum  mechanics,  the  standard  deviation  of  x  is  called  the  uncertainty 
in  position  and  is  written  Ax.) 

5.  The  probability  for  a  radioactive  particle  to  decay  between  time  t  and  time  t  +  dt  is 
proportional  to  e~  4.  Find  the  density  function  f(t)  and  the  cumulative  distribution 
function  F(t).  Find  the  expected  lifetime  (called  the  mean  life)  of  the  radioactive 
particle.  Compare  the  mean  life  and  the  so-called  “half  life”  which  is  defined  as  the 
value  of  t  when  e~xt  =  1/2. 

6.  A  circular  garden  bed  of  radius  1  m  is  to  be  planted  so  that  N  seeds  are  uniformly 
distributed  over  the  circular  area.  Then  we  can  talk  about  the  number  n  of  seeds  in 
some  particular  area  A,  or  we  can  call  n/N  the  probability  for  any  one  particular 
seed  to  be  in  the  area  A.  Find  the  probability  F(r)  that  a  seed  (that  is,  some 
particular  seed)  is  within  r  of  the  center.  (Hint:  What  is  F(l)?)  Find  f(r)dr,  the 
probability  for  a  seed  to  be  between  r  and  r  +  dr  from  the  center.  Find  r  and  a. 

7.  (a)  Repeat  Problem  6  where  the  “circular”  area  is  now  on  the  curved  surface  of  the 

earth,  say  all  points  at  distance  s  from  Chicago  (measured  along  a  great  circle 
on  the  earth’s  surface)  with  s  <  ttR/3  where  R  =  radius  of  the  earth.  The 
seeds  could  be  replaced  by,  say,  radioactive  fallout  particles  (assuming  these  to 
be  uniformly  distributed  over  the  surface  of  the  earth).  Find  F(s)  and  f(s). 

(b)  Also  find  F(s)  and  f(s)  if  s  <  1  <g  R  (say  s  <  1  mile  where  R  =  4000  miles). 
Do  your  answers  then  reduce  to  those  in  Problem  6? 

8.  Given  that  a  particle  is  inside  a  sphere  of  radius  1,  and  that  it  has  equal  probabilities 
of  being  found  in  any  two  volume  elements  of  the  same  size,  find  the  cumulative 
distribution  function  F(r)  for  the  spherical  coordinate  r,  and  from  it  find  the  density 
function  f(r).  Hint:  F(r)  is  the  probability  that  the  particle  is  inside  a  sphere  of 
radius  r.  Find  r  and  a. 

9.  A  hydrogen  atom  consists  of  a  proton  and  an  electron.  According  to  the  Bohr  theory, 
the  electron  revolves  about  the  proton  in  a  circle  of  radius  a  (a  =  5  ■  10~9cm  for 
the  ground  state).  According  to  quantum  mechanics,  the  electron  may  be  at  any 
distance  r  (from  0  to  oo)  from  the  proton;  for  the  ground  state,  the  probability  that 
the  electron  is  in  a  volume  element  dV,  at  a  distance  r  to  r  +  dr  from  the  proton, 
is  proportional  to  e~2r^adV,  where  a  is  the  Bohr  radius.  Write  dV  in  spherical 
coordinates  (see  Chapter  5,  Section  4)  and  find  the  density  function  f(r)  so  that 
f(r)  dr  is  the  probability  that  the  electron  is  at  a  distance  between  r  and  r  +  dr 
from  the  proton.  (Remember  that  the  probability  for  the  electron  to  be  somewhere 
must  be  1.)  Computer  plot  f(r)  and  show  that  its  maximum  value  is  at  r  =  a;  we 
then  say  that  the  most  probable  value  of  r  is  a.  Also  show  that  the  average  value 
of  r-1  is  a-1. 

10.  Do  Problem  5.10  for  a  continuous  distribution. 

11.  Do  Problem  5.13  for  a  continuous  distribution. 

12.  Do  Problem  5.16  for  a  continuous  distribution. 

13.  Given  a  joint  distribution  function  f(x,  y)  as  in  (6.6),  show  that  E(x  +  y)  =  E(x)  + 
E(y)  and  Var(:r  +  y)  =  Var(:r)  +  Var(y)  +  2  Cov(x,y). 
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14.  Recall  that  two  events  A  and  B  are  called  independent  if  p(AB)  =  p(A)p(B).  Sim¬ 
ilarly  two  random  variables  x  and  y  are  called  independent  if  the  joint  probability 
function  f(x,  y)  =  g(x)h(y).  Show  that  if  x  and  y  are  independent,  then  the  expec¬ 
tation  or  average  of  xy  is  E{xy)  =  E(x)E(y)  =  pxpy- 

15.  Show  that  the  covariance  of  two  independent  (see  Problem  14)  random  variables  is 
zero,  and  so  by  Problem  13,  the  variance  of  the  sum  of  two  independent  random 
variables  is  equal  to  the  sum  of  their  variances. 

16.  By  Problem  15,  if  x  and  y  are  independent,  then  Co v(x,y)  =  0.  The  converse 
is  not  always  true,  that  is,  if  Cov(x,y)  =  0,  it  is  not  necessarily  true  that  the 
joint  distribution  function  is  of  the  form  f(x,y)  =  g(x)h(y).  For  example,  suppose 
f(x,y)  =  (3 y2  +cosa:)/4  on  the  rectangle  —  -k/2  <  x  <  n/2,—1  <  y  <  1,  and 
f(x,y)  =  0  elsewhere.  Show  that  Cov(x,y)  =  0,  but  x  and  y  are  not  independent, 
that  is,  f(x,  y)  is  not  of  the  form  g(x)h(y).  Can  you  construct  some  more  examples? 

7.  BINOMIAL  DISTRIBUTION 

Example  1.  Let  a  coin  be  tossed  5  times;  what  is  the  probability  of  exactly  3  heads  out  of 
the  5  tosses?  We  can  represent  any  sequence  of  5  tosses  by  a  symbol  such  as  thhth. 
The  probability  of  this  particular  sequence  (or  any  other  particular  sequence)  is 
(^)5  since  the  tosses  are  independent  (see  Example  1  of  Section  3).  The  number  of 
such  sequences  containing  3  heads  and  2  tails  is  the  number  of  ways  we  can  select  3 
positions  out  of  5  for  heads  (or  2  for  tails),  namely  C(5,3).  Hence,  the  probability 
of  exactly  3  heads  in  5  tosses  of  a  coin  is  C'(5,3)(i)5.  Suppose  a  coin  is  tossed 
repeatedly,  say  n  times;  let  x  be  the  number  of  heads  in  the  n  tosses.  We  want  to 
find  the  probability  density  function  p  =  /( x)  which  gives  the  probability  of  exactly 
x  heads  in  n  tosses.  By  generalizing  the  case  of  3  heads  in  5  tosses,  we  see  that 

(7.1)  /(*)  =  £?(«,  *)(£)" 

Example  2.  Let  us  do  a  similar  problem  with  a  die,  asking  this  time  for  the  probability  of 
exactly  3  aces  in  5  tosses  of  the  die.  If  A  means  ace  and  N  not  ace,  the  probability 
of  a  particular  sequence  such  as  ANN AA  is  ^  |  |  ^  ^  since  the  probability  of 
A  is  i,  the  probability  of  N  is  |,  and  the  tosses  are  independent.  The  number  of 
such  sequences  containing  3  A’s  and  2  N’s  is  C( 5,  3);  thus  the  probability  of  exactly 
3  aces  in  5  tosses  of  a  die  is  C(5, 3)(|)3(|)2.  Generalizing  this,  we  find  that  the 
probability  of  exactly  x  aces  in  n  tosses  of  a  die  is 

(7.2)  /(*)  =  (?(«,  *)(!)*(§)"-*. 

Bernoulli  Trials  In  the  two  examples  we  have  just  done,  we  have  been  concerned 
with  repeated  independent  trials,  each  trial  having  two  possible  outcomes  ( h  or  t , 
A  or  N)  of  given  probability.  There  are  many  examples  of  such  problems;  let’s 
consider  a  few.  A  manufactured  item  is  good  or  defective;  given  the  probability 
of  a  defect  we  want  the  probability  of  x  defectives  out  of  n  items.  An  archer  has 
probability  p  of  hitting  a  target;  we  ask  for  the  probability  of  x  hits  out  of  n  tries. 
Each  atom  of  a  radioactive  substance  has  probability  p  of  emitting  an  alpha  particle 
during  the  next  minute;  we  are  to  find  the  probability  that  x  alpha  particles  will  be 
emitted  in  the  next  minute  from  the  n  atoms  in  the  sample.  A  particle  moves  back 
and  forth  along  the  x  axis  in  unit  jumps;  it  has,  at  each  step,  equal  probabilities  of 
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Graphs  of  the  binomial  distribution,  f(x)  =  C{«,  x)pxcf  x 


Figure  7.3 


jumping  forward  or  backward.  (This  motion  is  called  a  random  walk ;  it  can  be  used 
as  a  model  of  a  diffusion  process.)  We  want  to  know  the  probability  that,  after  n 
jumps,  the  particle  is  at  a  distance 

d  =  number  x  of  positive  jumps  —  number  ( n  —  x)  of  negative  jumps, 

from  its  starting  point;  this  probability  is  the  probability  of  x  positive  jumps  out  of 
a  total  of  n  jumps. 

In  all  these  problems,  something  is  tried  repeatedly.  At  each  trial  there  are  two 
possible  outcomes  of  probabilities  p  (usually  called  the  probability  of  “success” )  and 

Binomial  distribution  graphs  of  nf{x)  plotted  against  x/n 


Figure  7.4 


Figure  7.5 
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q  =  1—  p  (where  q  =  probability  of  “failure”).  Such  repeated  independent  trials 
with  constant  probabilities  p  and  q  are  called  Bernoulli  trials. 

Binomial  Probability  Functions  Let  us  generalize  (7.1)  and  (7.2)  to  obtain  a 
formula  which  applies  to  any  similar  problem,  namely  the  probability  f(x)  of  exactly 
x  successes  in  n  Bernoulli  trials.  Reasoning  as  we  did  to  obtain  (7.1)  and  (7.2),  we 
find  that 


(7.3) 


f(x)  =  C(n,x)pxqn~x 


We  might  also  ask  for  the  probability  of  not  more  than  x  successes  in  n  trials.  This 
is  the  sum  of  the  probabilities  of  0, 1,  2,  •  •  •  ,  x  successes,  that  is,  it  is  the  cumula¬ 
tive  distribution  function  F{x)  for  the  random  variable  x  whose  probability  density 
function  is  (7.3)  [see  (5.6)].  We  can  write 

f{x)  =  m + m  +  •  •  • + /(*) 

=  C(n,  0 )p°qn  +  C{n,  1  )p1qn~l  +  •  •  •  +  C(n,  x)pxqn~x 

(7-4) 

=  J2C(n,u)puqn-u  =  J2 

u= 0  u— 0 

Observe  that  (7.3)  is  one  term  of  the  binomial  expansion  of  (p  +  q)n  and  (7.4) 
is  a  sum  of  several  terms  of  this  expansion  (see  Section  4,  Example  2).  For  this 
reason,  the  functions  f(x)  in  (7.1),  (7.2),  or  (7.3)  are  called  binomial  probability  (or 
density )  functions  or  binomial  distributions ,  and  the  function  F(x)  in  (7.4)  is  called 
a  binomial  cumulative  distribution  function. 

We  shall  find  it  very  useful  to  computer  plot  graphs  of  the  binomial  density 
function  f{x)  for  various  values  of  p  and  n.  (See  Figures  7.1  to  7.5  and  Problems  1 
to  8.)  Instead  of  a  point  at  y  =  /( x)  for  each  x,  we  plot  a  horizontal  line  segment  of 
length  1  centered  on  each  x  as  in  Figure  6.1;  the  probabilities  are  then  represented 
by  areas  under  the  broken  line,  rather  than  by  ordinates.  From  Figures  7.1  to  7.3 
and  similar  graphs,  we  can  draw  a  number  of  conclusions.  The  most  probable  value 
of  x  [corresponding  to  the  largest  value  of  f(x)}  is  approximately  x  =  np  (Problems 
10  and  11);  for  example  for  p  =  4,  the  most  probable  value  of  a;  is  4 n  for  even  n; 
for  odd  n,  there  are  two  consecutive  values  of  x,  namely  4(n  ±  1),  for  which  the 
probability  is  largest.  The  graphs  for  p  =  4  are  symmetric  about  x  =  \n.  For 
p  /  the  curve  is  asymmetric,  favoring  small  x  values  for  small  p  and  large  x 
values  for  large  p.  As  n  increases,  the  graph  of  f(x)  becomes  wider  and  flatter  (the 
total  area  under  the  graph  must  remain  1).  The  probability  of  the  most  probable 
value  of  x  decreases  with  n.  For  example,  the  most  probable  number  of  heads  in 
8  tosses  of  a  coin  is  4  with  probability  0.27;  the  most  probable  number  of  heads 
in  20  tosses  is  10  with  probability  0.17;  for  106  tosses,  the  probability  of  exactly 
500,000  heads  is  less  than  10-3. 

Let  us  redraw  Figures  7.1  and  7.2  plotting  nf{x)  against  the  relative  number  of 
successes  x/n  (Figures  7.4  and  7.5).  Since  this  change  of  scale  (ordinate  times  n, 
abscissa  divided  by  n)  leaves  the  area  unchanged,  we  can  still  use  the  area  to 
represent  probability.  Note  that  now  the  curves  become  narrower  and  taller  as  n 
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increases.  This  means  that  values  of  the  ratio  x/n  tend  to  cluster  about  their  most 
probable  value,  namely  np/n  =  p.  For  example,  if  we  toss  a  coin  repeatedly,  the 
difference  “number  of  heads  —  \  number  of  tosses”  is  apt  to  be  large  and  to  increase 
with  n  (Figures  7.1  and  7.2),  but  the  ratio  “number  of  heads  number  of  tosses” 
is  apt  to  be  closer  and  closer  to  \  as  n  increases  (Figures  7.4  and  7.5).  It  is  for  this 
reason  that  we  can  use  experimentally  determined  values  of  x/n  as  a  reasonable 
estimate  of  p. 

Chebyshev’s  Inequality  This  is  a  simple  but  very  general  result  which  we  will 
find  useful.  We  consider  a  random  variable  x  with  probability  function  f{x),  and 
let  p  be  the  mean  value  and  a  the  standard  deviation  of  x.  We  are  going  to  prove 
that  if  we  select  any  number  t,  the  probability  that  x  differs  from  its  mean  value  p 
by  more  than  t,  is  less  than  a2 /t2.  This  means  that  x  is  unlikely  to  differ  from  p 
by  more  than  a  few  standard  deviations;  for  example,  if  t  is  twice  the  standard 
deviation  cr,  we  find  that  the  probability  for  x  to  differ  from  p  by  more  than  2er  is 
less  than  a 2 /t2  =  cr2 /{2a)2  =  The  proof  is  simple.  By  definition  of  <r,  we  have 

where  the  sum  is  over  all  x.  Then  if  we  sum  just  over  the  values  of  x  for  which 
\x  —  p\  >  t,  we  get  less  than  er2: 

(7.5)  a2  >  Y  (x~P)2f(x)- 


If  we  replace  each  x  —  p  by  the  number  t  in  (7.5),  the  sum  is  decreased,  so  we  have 
(7.6)  cr2  >  Y  t2f(x )  =  <2  /(*)  or  <  ^2  • 

\x— fi\~>t  |a;— /it  |  >t  |  a; — /x| 

But  )C|x-ai| >t  f(x)  Jusf  the  sum  of  all  probabilities  of  x  values  which  differ  from  p 
by  more  than  t ,  and  (7.6)  says  that  this  probability  is  less  than  u2/t2,  as  we  claimed. 


Laws  of  Large  Numbers  Statements  and  proofs  which  make  more  precise  our 
general  comments  about  the  effect  of  large  n  are  known  as  laws  of  large  numbers. 
Let  us  state  and  prove  one  such  law.  We  apply  Chebyshev’s  inequality  to  a  ran¬ 
dom  variable  whose  probability  function  is  the  binomial  distribution  (7.3).  From 
Problems  9  and  13  we  have  p  =  np  and  cr  =  y/npq.  Then  by  Chebyshev’s  inequality, 

(7.7)  (probability  of  \x  —  np\  >  t)  is  less  than  npq/t2. 


Let  us  choose  the  arbitrary  value  of  t  in  (7.7)  proportional  to  n,  that  is,  t  =  ne 
where  e  is  now  arbitrary.  Then  (7.7)  becomes 


(7.8) 


(probability  of  \x  —  np\  >  ne)  is  less  than  npq/n2 


or,  when  we  divide  the  first  inequality  by  n, 


(7.9) 


^probability 


of 


x 

- P 


>  e 


pq_ 

ne2 


is  less  than 
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Recall  that  x/n  is  the  relative  number  of  successes;  we  intuitively  expect  x/n  to  be 
near  p  for  large  n.  Now  (7.9)  says  that,  if  e  is  any  small  number,  the  probability  is 
less  than  pq/(ne 2)  for  x/n  to  differ  from  p  by  e;  that  is,  as  n  tends  to  infinity,  this 
probability  tends  to  zero.  (Note,  however,  that  x/n  need  not  tend  to  p.)  This  is 
one  form  of  the  law  of  large  numbers  and  it  justifies  our  intuitive  ideas. 

PROBLEMS,  SECTION  7 

For  the  values  of  n  indicated  in  Problems  1  to  4: 

(a)  Write  the  probability  density  function  f(x)  for  the  probability  of  x  heads  in  n  tosses 
of  a  coin  and  computer  plot  a  graph  of  f(x)  as  in  Figures  7.1  and  7.2.  Also  computer 
plot  a  graph  of  the  corresponding  cumulative  distribution  function  F{x). 

(b)  Computer  plot  a  graph  of  nf(x)  as  a  function  of  x/n  as  in  Figures  7.4  and  7.5. 

(c)  Use  your  graphs  and  other  calculations  if  necessary  to  answer  these  questions:  What 
is  the  probability  of  exactly  7  heads?  Of  at  most  7  heads?  [Hint:  Consider  F(x).]  Of 
at  least  7  heads?  What  is  the  most  probable  number  of  heads?  The  expected  number 
of  heads? 

1.  n  =  7  2.  n  =  12  3.  n  =  15  4.  n  =  18 

5.  Write  the  formula  for  the  binomial  density  function  f(x)  for  the  case  n  =  6 ,p  =  1/6, 
representing  the  probability  of,  say,  x  aces  in  6  throws  of  a  die.  Computer  plot  f(x) 
as  in  Figure  (7.3).  Also  plot  the  cumulative  distribution  function  F(x).  What  is 
the  probability  of  at  least  2  aces  out  of  6  tosses  of  a  die?  Hint:  Can  you  read  the 
probability  of  at  most  one  ace  from  one  of  your  graphs? 

For  the  given  values  of  n  and  p  in  Problems  6  to  8,  computer  plot  graphs  of  the  binomial 
density  function  for  the  probability  of  x  successes  in  n  Bernoulli  trials  with  probability  p 
of  success. 

6.  n  =  6,  p  =  5/6  (Compare  Problem  5) 

7.  n  =  50,  p  =  1/5  8.  n  =  50,  p  =  4/5 

9.  Use  the  second  method  of  Problem  5.11  to  show  that  the  expected  number  of  suc¬ 
cesses  in  n  Bernoulli  trials  with  probability  p  of  success  is  *  =  np.  Hint:  What  is 
the  expected  number  of  successes  in  one  trial? 

10.  Show  that  the  most  probable  number  of  heads  in  n  tosses  of  a  coin  is  in  for  even  n 
[that  is,  f(x)  in  (7.1)  has  its  largest  value  for  x  =  n/2]  and  that  for  odd  n,  there 
are  two  equal  “largest”  values  of  f(x),  namely  for  x  =  |(n  +  1)  and  x  =  \{n  —  1). 
Hint:  Simplify  the  fraction  f(x  +  l)//(x),  and  then  find  the  values  of  x  for  which 
it  is  greater  than  1  [that  is,  f(x  +  1)  >  /(*)],  and  less  than  or  equal  to  1  [that  is, 
f(x  +  1)  <  f(x)].  Remember  that  x  must  be  an  integer. 

11.  Use  the  method  of  Problem  10  to  show  that  for  the  binomial  distribution  (7.3),  the 
most  probable  value  of  x  is  approximately  np  (actually  within  1  of  this  value). 

12.  Let  x  =  number  of  heads  in  one  toss  of  a  coin.  What  are  the  possible  values  of  x  and 
their  probabilities?  What  is  px7  Hence  show  that  Var(a:)  =  [average  of  (x  —  px)2] 
=  L,  so  the  standard  deviation  is  |.  Now  use  the  result  from  Problem  6.15  “variance 
of  a  sum  of  independent  random  variables  =  sum  of  their  variances”  to  show  that  if 
x  —  number  of  heads  in  n  tosses  of  a  coin,  Var(a;)  =  |n  and  the  standard  deviation 
ax  =  |  y/n. 

13.  Generalize  Problem  12  to  show  that  for  the  general  binomial  distribution  (7.3), 
Var(a:)  =  npq,  and  a  =  y/npq. 
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8.  THE  NORMAL  OR  GAUSSIAN  DISTRIBUTION 

The  graph  of  the  normal  or  Gaussian  distribution  is  the  bell-shaped  curve  you  may 
know  as  the  normal  error  curve  (Figure  8.1).  The  normal  distribution  is  used  a 
great  deal  because,  as  we  shall  see,  it  is  not  only  of  interest  in  itself  (see  Problems 
2  and  3),  but  also  other  distributions  become  almost  normal  when  n  (the  number 
of  trials  or  measurements)  becomes  large  (see  Figures  8.2  and  8.3). 

The  probability  density  function  f{x)  and  the  cumulative  distribution  function 
F{x)  for  the  normal  or  Gaussian  distribution  are  given  by 


(8.1) 


f(x) 

F(x) 


1 


g\/2tt 

1 


a 


\/27 r  J_c 


e-(t-„)2/( 20  dt 


Normal  distribution 


It  is  straightforward  to  show  (Problem  1)  that  if  a;  is  a  random  variable  with  prob¬ 
ability  density  f(x)  in  (8.1),  then  the  mean  of  x  is  /i  and  the  standard  deviation 
is  tx.  Also  we  can  show  that  the  integral  of  f(x)  from  — oo  to  oo  is  equal  to  1  as  it 
must  be  for  a  probability  function.  Then  the  probability  that  a  normally  distributed 
random  variable  x  lies  between  x\  and  X2  is  the  area  under  the  f(x)  curve  between 
X\  and  X2  which  is 


(8.2)  F(x 2)  —  F(x  1)  =  probability  that  x\  <  x  <  x^- 


A  normal  density  function  graph  (Figure  8.1)  has  its  peak  at  x  =  fi  and  is 
symmetric  with  respect  to  the  line  x  =  fi.  Since  the  area  from  —00  to  00  is  1,  the 
area  from  —00  to  /x  is  ^  (that  is,  F(fi)  =  |),  and  similarly  the  area  from  /x  to  00  is  i. 
A  change  in  /x  merely  translates  the  graph  with  no  change  in  shape.  An  increase  in 
g  widens  and  flattens  the  graph  so  that  the  area  remains  1,  and  similarly  a  decrease 
in  a  makes  the  graph  taller  and  narrower.  (Problems  4  to  6).  The  area  from  /i  —  g 
to  n  +  o'  is  0.6827,  that  is,  the  probability  that  x  differs  from  its  mean  value  by 
1  standard  deviation  or  less,  is  just  over  68%.  The  probability  that  \x  —  fi\  <  2 g 
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is  over  95%  and  the  probability  that  \x  —  p\  <  3a  is  over  99.7%.  Note  that  these 
probabilities  are  independent  of  the  values  of  p  and  a  (Problem  7). 

Normal  Approximation  to  the  Binomial  Distribution  As  an  example  of 
approximating  another  distribution  by  a  normal  distribution,  let’s  consider  the  bi¬ 
nomial  distribution  (7.3).  For  large  n  and  large  np ,  we  can  use  Stirling’s  formula 
(Chapter  11,  Section  11)  to  approximate  the  factorials  in  C(n,x)  in  (7.3)  and  make 
other  approximations  to  find 


(8.3) 


/Or)  =  C(n,x)pxqn-X 


1  g-(x-np)2 /(2npq) 

-y/27r  npq 


f(x) 

0.3 

0.2 

0.1 

1  1  1  — 

0 

i  2 

3  4  5  6  7 

8 

Figure  8.2  Binomial  distribution  for  n  =  8,  p= 
and  the  normal  approximation. 

The  sign  ~  means  (as  in  Chapter  11,  Section  11)  that  the  ratio  of  the  exact  binomial 
distribution  (7.3)  and  the  right-hand  side  of  (8.3)  tends  to  1  as  n  — >  00.  An  outline 
of  a  derivation  of  (8.3)  is  given  in  Problem  8,  but  you  may  be  more  impressed  by 
doing  some  computer  plotting  of  graphs  like  Figures  8.2  and  8.3  (Problems  9  and  10). 
Although  we  have  said  that  equation  (8.3)  gives  an  approximation  valid  for  large  n, 
the  agreement  is  quite  good  even  for  fairly  small  values  of  n.  Figure  8.2  shows  this 
for  the  case  n  =  8.  The  binomial  distribution  f(x)  is  defined  only  for  integral  x ; 
you  should  compare  the  values  of  f(x)  with  the  values  of  the  approximating  normal 
curve  at  integral  values  of  x.  When  n  is  very  large  (Figure  8.3),  a  graph  of  the  exact 
binomial  distribution  is  very  close  to  the  normal  approximation  (Problem  9). 


Figure  8.3  Binomial  distribution  for  n  =  100,  p  = 

In  (8.3),  the  left-hand  side  is  the  exact  binomial  distribution  and  the  right- 
hand  side  is  a  normal  distribution  with  p  =  np  and  er  =  y/npq  as  we  see  by 
comparing  (8.3)  and  (8.1).  Recall  from  Problems  7.9  and  7.13  that  the  mean  value 
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p  and  standard  deviation  a  for  a  random  variable  whose  probability  function  is  the 
binomial  distribution  (7.3)  are  also  p  =  np  and  a  =  npq . 


For  the  binomial  distribution  and  its  normal  approximation, 
p  =  np,  a  =  \Jnpq . 


We  can  expect  this  in  general;  whatever  the  /i  and  a  are  for  a  given  distribution, 
the  normal  approximation  will  have  the  same  p  and  a. 

Example  1.  Find  the  probability  of  exactly  52  heads  in  100  tosses  of  a  coin  using  the 
binomial  distribution  and  using  the  normal  approximation. 

See  Figure  (8.3)  which  is  a  plot  of  the  binomial  probability  density  function  with 
n  =  100, p  =  We  find  by  computer  for  x  =  52,  binomial  /( 52)  =  0.07353,  which 
you  could  also  read  approximately  from  Figure  (8.3). 

For  the  normal  approximation,  we  find  from  (8.4),  p  =  np  =  100  •  i  =  50, 

a  =  sjnpq  =  yj  100  •  \  \  =  5.  Then  for  the  normal  approximation  with  p  =  50, 
<7  =  5,  we  find  by  computer  for  x  =  52,  normal  /( 52)  =  0.07365. 

Example  2.  Find  the  probability  P(45,  55)  of  between  45  and  55  heads  in  100  tosses  of  a 
coin,  that  is  45  <  x  <  55. 

As  in  Example  1,  for  the  binomial  distribution  we  have  n  =  100,  p  =  ^ .  The 
cumulative  binomial  distribution  function  F(x)  in  (7.4)  gives  P(45,  55)  as  a  sum  of 
terms;  we  want  the  sum  of  the  11  terms  with  x  =  45,46,  •  •  -55.  By  computer,  we 
can  find  P( 55),  the  binomial  cumulative  distribution  function  with  x  =  55,  which  is 
the  probability  of  55  heads  or  less,  and  then  find  and  subtract  F(44),  the  probability 
of  44  heads  or  less.  Thus  we  find  P(45,  55)  =  binomial  F( 55)  —  binomial  F(44)  = 
0.72875. 

For  the  normal  approximation,  we  find  by  computer  from  (8.2),  P(45,55)  = 
normal  F(55)  —  normal  P(45)  =  0.68269.  We  can  get  a  better  approximation  by 
integrating  from  44.5  to  55.5;  this  corresponds  more  closely  to  the  appropriate  area 
under  the  exact  binomial  graph  in  Figure  8.3  by  including  the  whole  steps  at  x  =  45 
and  x  =  55.  This  gives  P(44.5, 55.5)  =  normal  P(55. 5)  —normal  P(44.5)  =  0.72867. 

Standard  Normal  Distribution  This  is  just  the  normal  distribution  in  (8.1) 
for  the  special  case  p  =  0  and  a  =  1.  The  density  function  is  often  denoted  by 
and  the  corresponding  cumulative  distribution  function  by  4>(z): 


0(A)  =  -j=e  z2/2, 

(8.5)  1  ,z  Standard  normal  distribution 

$(z)  =  -==  /  e"“2/2  du. 

V2^J-  oo 


The  cumulative  distribution  function  <E>(z)  is  related  to  the  error  function  (see  Chap¬ 
ter  11,  Section  9). 
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It  is  sometimes  convenient  to  write  the  functions  in  (8.1)  in  terms  of  (j>(z)  and 
$(z).  We  can  do  this  by  making  the  change  of  variables  z=  (x  —  yt)/ a.  The  result 
is  (Problem  21) 


(8.6) 


f(x)  =  u  {x-n) 

<j  where  z  =  - . 

F(:r)  =  $(*), 


The  functions  cj>(z)  and  $(z)  [or  sometimes  $(2)  —  5]  are  tabulated  so  you  can  use 
either  tables  or  computer  to  do  problems. 

Example  3.  Find  the  number  r  such  that  the  area  under  the  normal  distribution  curve 
y  =  /(x)  from  y,  —  r  to  y  +  r  is  equal  to  1/2. 

Look  at  Figure  8.1  and  recall  that  the  area  from  —00  to  00  is  1  and  that  the 
graph  is  symmetric  about  x  =  y.  Then  the  integral  from  —00  to  y,  —  r  and  the 
integral  from  fi  +  r  to  00  are  equal  to  each  other  and  so  each  is  equal  to  1  /4.  Thus 
the  integral  from  —00  to  +  r  must  be  3/4,  that  is  F(/j,  +  r)  =  3/4.  By  (8.6)  this 
is  $(z)  =  3/4  where  z  =  (n  +  r  —  y)/a  =  r /a.  By  computer  or  tables  we  find  that 
if  $(z)  =  3/4,  then  z  =  0.6745.  Thus  r  =  0.6745cr. 

Example  4.  You  have  taken  a  test  (academic  like  the  SAT,  or  medical  like  a  bone  density 
test)  and  a  report  gives  your  z-score  as  1.14.  What  percent  of  your  peers  scored 
higher  than  you? 

If  we  call  the  actual  test  scores  x,  and  their  average  is  y  and  standard  deviation  a , 
then  the  term  z-score  means  the  value  of  z  =  (x  — /r) /cr  as  in  (8.6).  (In  words,  the  z- 
score  is  the  difference  between  x  and  its  average,  measured  in  units  of  the  standard 
deviation.)  Now  we  want  the  area  1  —  F(x)  =  1  —  $(z)  by  (8.6).  By  computer  (or 
tables)  we  find  $(1.14)  =  0.87;  then  1  —  0.87  =  0.13,  so  13%  of  your  peers  scored 
higher  than  you.  If  your  z-score  is  negative,  then  you  are  below  average — bad  if 
it’s  a  physics  test,  good  if  it’s  your  cholesterol!  For  example,  if  z  =  —0.25,  then 
$(z)  =  0.40,  so  60%  of  your  peers  scored  higher  than  you. 

Example  5.  Suppose  that  boxes  of  a  certain  kind  of  cereal  have  an  average  weight  of  16 
ounces  and  it  is  known  that  70%  of  the  boxes  weigh  within  1  ounce  of  the  average. 
What  is  the  probability  that  the  box  you  buy  weighs  less  than  14  ounces? 

If  x  represents  the  weight  of  a  box,  then  we  are  given  that  the  probability  of 
15  <  x  <  17  is  0.7.  Assuming  a  normal  distribution,  the  area  under  the  /(x)  curve 
up  to  x  =  n  =  16  is  ^  and  the  area  from  x  =  16  to  x  =  17  is  half  of  0.7  (by  symmetry; 
see  Figure  8.1).  Thus  F(17)  =  0.5  +  0.35  =  0.85.  We  want  to  find  the  probability 
that  x  <  14;  this  is  F(14).  Using  (8.6),  x  =  17  gives  z  =  (17  —  16)/<r  =  1/cr,  and 
similarly  x  =  14  gives  z  =  —  2/<r.  So  we  are  given  4>(1/<t)  =  0.85,  and  we  want 
to  find  $(— 2/cr).  By  computer  (or  tables)  we  find  that  if  4>(1  /a)  =  0.85,  then 
1/cr  =  1.0364,  so  2/cr  =  2.0728,  and  $(—2/(7)  =  0.019.  So  there  is  almost  a  2% 
chance  that  we  would  get  a  box  weighing  less  than  14  ounces. 

Note  that  in  Examples  4  and  5  we  assumed  a  normal  distribution  with  no  obvious 
justification.  It  is  a  very  interesting  and  useful  fact  that  such  an  assumption  is 
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reasonable  if  the  number  of  measurements  is  very  large.  We  will  discuss  this  further 
at  the  end  of  Section  10. 


PROBLEMS,  SECTION  8 


1.  Verify  that  for  a  random  variable  x  with  normal  density  function  f(x)  as  in  (8.1), 
the  mean  value  of  x  is  p  ,  the  standard  deviation  is  a,  and  the  integral  of  f(x)  from 
—oo  to  oo  is  1  as  it  must  be  for  a  probability  function.  Hint:  Write  and  evaluate 
the  integrals  fxoof(x)dx,f^°ooxf(x)dx,f^°oo(x  —  p)2f(x)dx.  See  equations  (6.2), 
(6.3),  and  (6.4). 

2.  Do  Problem  6.4  by  comparing  e~ax  with  f(x)  in  (8.1). 

3.  The  probability  density  function  for  the  x  component  of  the  velocity  of  a  molecule 

of  an  ideal  gas  is  proportional  to  e-m”  where  v  is  the  x  component  of  the 

velocity,  m  is  the  mass  of  the  molecule,  T  is  the  temperature  of  the  gas  and  k  is 
the  Boltzmann  constant.  By  comparing  this  with  (8.1),  find  the  mean  and  standard 
deviation  of  v,  and  write  the  probability  density  function  f(v). 

4.  Computer  plot  on  the  same  axes  the  normal  probability  density  functions  with  p  =  0, 

cr  =  1,  and  with  p  =  3,  cr  =  1  to  note  that  they  are  identical  except  for  a  translation. 

5.  Computer  plot  on  the  same  axes  the  normal  density  functions  with  p  =  0  and  a  =  1, 

2,  and  5.  Label  each  curve  with  its  a. 

6.  Do  Problem  5  for  cr  =  1  i  1. 

6  ’  3  5 

7.  By  computer  find  the  value  of  the  normal  cumulative  distribution  function  at  p  +  a, 
p  +  2a,  p  +  3a,  and  satisfy  yourself  that  these  are  independent  of  your  choices  for 
p  and  a.  Find  the  probabilities  that  x  is  within  1,  2,  or  3  standard  deviations  of 
its  mean  value  p  to  verify  the  results  stated  in  the  paragraph  following  (8.2).  Hint: 
See  Figure  (8.1).  The  probability  that  x  is  within  1  standard  deviation  of  its  mean 
value  is  the  area  from  p  —  a  to  p  +  a ;  this  is  twice  the  area  from  p  to  p  +  a.  Subtract 
T  (that  is  the  area  from  — oo  to  p)  from  your  value  of  F(p  +  a)  and  then  double  the 
result. 

8.  Carry  through  the  following  details  of  a  derivation  of  (8.3).  Start  with  (7.3);  we  want 
an  approximation  to  (7.3)  for  large  n.  First  approximate  the  factorials  in  C(n,x) 
by  Stirling’s  formula  (Chapter  11,  Section  11)  and  simplify  to  get 


/(*) 


\  n—x  i - 

nq  \  /  n 

n  —  xj  V  2?ra :(n  —  x)  ’ 


Show  that  if  5  =  x—np,  then  x  =  np+S  and  n—x  =  nq  —  S.  Make  these  substitutions 
for  x  and  n  —  a:  in  the  approximate  f{x).  To  evaluate  the  first  two  factors  in  f(x) 
(ignore  the  square  root  for  now):  Take  the  logarithm  of  the  first  two  factors;  show 


that 


In  —  =  -  In  (  1  +  — 
np 


and  a  similar  formula  for  ln [nq/(n  —  a;)];  expand  the  logarithms  in  a  series  of  powers 
of  5/(np),  collect  terms  and  simplify  to  get 


ln 


2  npq 


1  +  powers  of  — 
n 


( 

V  * 


X 


e~S2/(2npq) 


Hence 


n—x 
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for  large  n.  [We  really  want  5/n  small,  that  is,  x  near  enough  to  its  average  value  np 
so  that  5/n  =  (x  —  np)/n  is  small.  This  means  that  our  approximation  is  valid  for 
the  central  part  of  the  graph  (see  Figures  7.1  to  7.3)  around  x  =  np  where  f(x)  is 
large.  Since  f(x)  is  negligibly  small  anyway  for  x  far  from  np,  we  ignore  the  fact 
that  our  approximation  may  not  be  good  there.  For  more  detail  on  this  point,  see 
Feller,  p.  192].  Returning  to  the  square  root  factor  in  f(x),  approximate  x  by  np 
and  n  —  x  by  nq  (assuming  5  np  or  nq )  and  obtain  (8.3). 

9.  Computer  plot  a  graph  like  Figure  8.3  of  the  binomial  distribution  with  n  =  1000, 
p  =  | ,  and  observe  that  you  have  practically  the  corresponding  normal  approxima¬ 
tion. 

10.  Computer  plot  graphs  like  Figure  8.2  but  with  p  ^  \  to  see  that  as  n  increases,  the 
normal  approximation  becomes  good  (at  least  in  the  region  around  x  =  p  where 
the  probabilities  are  large)  even  though  the  binomial  graph  is  not  symmetric  (see 
Figure  7.3). 

As  in  Examples  1  and  2,  use  (a)  the  binomial  distribution;  (b)  the  corresponding  normal 

approximation,  to  find  the  probabilities  of  each  of  the  following: 

11.  Exactly  50  heads  in  100  tosses  of  a  coin. 

12.  Exactly  120  aces  in  720  tosses  of  a  die. 

13.  Between  100  and  140  aces  in  720  tosses  of  a  die. 

14.  Between  499,000  and  501,000  heads  in  106  tosses  of  a  coin. 

15.  Exactly  195  tails  in  400  tosses  of  a  coin. 

16.  Between  195  and  205  tails  in  400  tosses  of  a  coin. 

17.  Exactly  31  4’s  in  180  tosses  of  a  die. 

18.  Between  29  and  33  4’s  in  180  tosses  of  a  die. 

19.  Exactly  21  successes  in  100  Bernoulli  trials  with  probability  of  success. 

20.  Between  17  and  21  successes  in  100  Bernoulli  trials  with  probability  |  of  success. 

21.  Verify  equations  (8.6).  Hints:  In  F(x),  let  u  =  (t  —  p)/cr,  note  that  dt  =  adu.  What 
is  u  when  t  =  — oo?  When  t  =  xl  Remember  that  by  definition  z  =  (x  —  p)/a. 

22.  Using  (8.6),  do  Problem  7. 

23.  Using  (8.6),  find  h  such  that  90%  of  the  area  under  a  normal  f(x)  lies  between  p—h 
and  p  +  h.  Repeat  for  95%.  Hint:  See  Example  3. 

24.  Write  out  a  proof  of  Chebyshev’s  inequality  (see  end  of  Section  7)  for  the  case  of  a 
continuous  probability  function  f(x). 

25.  An  instructor  who  grades  “on  the  curve”  computes  the  mean  and  standard  deviation 
of  the  grades,  and  then,  assuming  a  normal  distribution  with  this  p  and  er,  sets  the 
border  lines  between  the  grades  at:  C  from  p  —  to  p  +  i<7,  B  from  p  +  i<r  to 
p  +  !<t,  A  from  p  +  |cr  up,  etc.  Find  the  percentages  of  the  students  receiving 
the  various  grades.  Where  should  the  border  lines  be  set  to  give  the  percentages: 
A  and  F,  10%;  B  and  D,  20%;  C,  40%? 
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9.  THE  POISSON  DISTRIBUTION 

The  Poisson  distribution  is  useful  in  a  variety  of  problems  in  which  the  probability 
of  some  occurrence  is  small  and  constant.  (See  Example  1  and  Problems  3  to  9.)  It 
is  also  a  good  approximation  to  the  binomial  distribution  when  p  is  so  small  that 
np  is  small  even  though  n  is  large  (see  Example  2). 

Let’s  derive  the  Poisson  distribution  by  considering  the  following  experiment. 
Suppose  we  observe  and  count  the  number  of  particles  emitted  per  unit  time  by  a 
radioactive  substance.  We  assume  that  our  period  of  observation  is  much  less  than 
the  half-life  of  the  substance,  so  that  the  average  counting  rate  does  not  decrease 
during  the  experiment.  Then  the  probability  that  one  particle  is  emitted  during  a 
small  time  interval  At  is  pAt.  p  =const.,  if  At  is  short  enough  so  that  the  probability 
of  two  particles  during  At  is  negligible.  We  want  to  find  the  probability  Pn{t )  of 
observing  exactly  n  counts  during  a  time  interval  t.  The  probability  Pn(t  +  At)  is 
the  probability  of  observing  n  counts  in  the  time  interval  t  +  At.  For  n  >  0,  this  is 
the  sum  of  the  probabilities  of  the  two  mutually  exclusive  events,  “n  particles  in  t, 
none  in  At”  and  “(n  —  1)  particles  in  t,  one  in  At”;  in  symbols, 

(9.1)  Pn(t  +  At)  =  P„(t)P0(At)  +  P„_r(f)Pi(Af). 

Now  Pi  (At)  is  the  probability  of  one  particle  in  At;  this,  by  assumption,  is  pA t. 
Then  the  probability  of  no  particles  in  At  is  1  —  Pl  (At)  =  1  —  pAt.  Substituting 
these  values  into  (9.1),  we  get 

(9.2)  Pn(t  +  At)  =  P„(t)(  1  -  pAt)  +  Pn_i(t)pAt, 


or, 

(9.3) 


Pn{t  +  At)  -  P»(t) 
At 


pPn_i{t)  -  pPn(t). 


Letting  At  — >  0,  we  have 

(9.4)  pp-  =  pPn—  i (t)  -  pPn{t). 


For  n  =  0,  (9.1)  simplifies  since  the  only  possible  event  is  “no  particles  in  t,  no 
particles  in  At,”  and  (9.4)  becomes,  for  n  =  0, 


(9.5)  pp  =  -pP0(t). 

Then,  since  Po(0)  =  “probability  that  no  particle  is  emitted  during  a  zero  time 
interval”  =  1,  integration  of  (9.5)  gives 


(9.6) 


Po  =  e-MT 


Substituting  (9.6)  into  (9.4)  with  n  =  1  gives  a  differential  equation  for  Pi(t);  its 
solution  (Problem  1)  is  Pi(t)  =  /rte_/i*.  Solving  (9.4)  successively  (Problem  1)  for 
P2,  P3,  ■  ■  • ,  Pn,  we  obtain 


Pn{t) 


{p£)_ 

n\ 


e 


—fit 


(9.7) 
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Putting  t  =  1,  we  get  for  the  probability  of  exactly  n  counts  per  unit  time 


(9.8) 

n 

P  =  —  e~P 

Poisson  distribution 

n! 

The  probability  density  function  (9.8)  is  called  the  Poisson  distribution  or  the  Pois¬ 
son  probability  density  function.  You  can  show  (Problem  2)  that  for  the  random 
variable  n,  the  mean  (that  is  the  average  number  of  counts  per  unit  time)  is  p,  and 
the  variance  is  also  p  so  the  standard  deviation  is  ^fp. 


Example  1.  The  number  of  particles  emitted  each  minute  by  a  radioactive  source  is 
recorded  for  a  period  of  10  hours;  a  total  of  1800  counts  are  registered.  During 
how  many  1-minute  intervals  should  we  expect  to  observe  no  particles;  exactly  one; 
etc.? 

The  average  number  of  counts  per  minute  is  1800/(10-60)  =  3  counts  per  minute; 
this  is  the  value  of  p.  Then  by  (9.8),  the  probability  of  n  counts  per  minute  is 


Pn  =  —re 
n! 


-3 


A  graph  of  this  probability  function  is  shown  in  Figure  9.1.  For  n  =  0,  we  find 
Po  =  e~3  =  0.05;  then  we  should  expect  to  observe  no  particles  in  about  5%  of 
the  600  1-minute  intervals,  that  is,  during  30  1-minute  intervals.  Similarly  we  could 
compute  the  expected  number  of  1-minute  intervals  during  which  1,  2,  •  •  • ,  particles 
would  be  observed. 


P„ 
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0.15 


0.10 


10 


Figure  9.1  Poisson  distribution  p  =  3. 


Poisson  Approximation  of  the  Binomial  Distribution  In  Section  8,  we  dis¬ 
cussed  the  fact  that  the  binomial  distribution  can  be  approximated  by  the  normal 
distribution  for  large  n  and  large  np.  If  p  is  very  small  so  that  np  is  very  much  less 
than  n  (say,  for  example,  p  =  10“  ,n  =  2000,  np  =  2),  the  normal  approximation 
is  not  good.  In  this  case  you  can  show  (Problem  10)  that  the  Poisson  distribution 
gives  a  good  approximation  to  the  binomial  distribution  (7.3),  that  is,  that 
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(9.9) 

(rrn)x  p~np 

C(n,x)pxqn-m~{  P> 

XI 

Large  n,  small  p. 

[The  exact  meaning  of  (9.9)  is  that,  for  any  fixed  x,  the  ratio  of  the  two  sides  ap¬ 
proaches  1  as  n  — >  oo  and  p  — >  0  with  np  remaining  constant.] 

Example  2.  If  1500  people  each  select  a  number  at  random  between  1  and  500,  what  is 
the  probability  that  2  people  selected  the  number  29? 

The  answer  is  given  by  the  binomial  distribution  (7.3)  with  n  =  1500,  p  =  1/500, 
x  =  2.  This  is 


C(n,  x)pxqn~x 


1500!  /  1  \2  /499\998 
211498!  V500/  V500/ 


(Or  from  your  computer:  the  binomial  probability  density  function  with  n  =  1500, 
p  =  1/500,  x  =  2,  is  0.2241  to  four  decimal  places.).  A  simpler  formula  from  (9.9)  is 
the  Poisson  approximation  with  p  =  np  =  3,  x  =  2,  namely  pxe~x/x\  =  32e~2/2!  = 
0.2240.  (Or  from  your  computer,  the  Poisson  probability  density  function  with 
p  =  3,  x  =  2,  is  0.2240  to  four  decimal  places.)  It  is  interesting  to  computer  plot  on 
the  same  axes  the  binomial  distribution  with  n  =  1500,  p  =  1/500,  and  the  Poisson 
distribution  with  p  =  3  as  in  Figure  9.1  to  discover  that  they  are  almost  identical 
(Problem  12). 


Approximations  by  the  Normal  Distribution  We  have  commented  that  many 
distributions  can  be  approximated  by  the  normal  distribution  when  n  and  p  =  np 
are  both  large,  and  have  shown  this  for  the  binomial  distribution  in  (8.1).  The 
Poisson  distribution  when  p  is  large  is  also  fairly  well  approximated  by  the  normal 
distribution  as  in  (9.10). 


(9.10) 


M  *  _J_e-(x-A.)a/(aM) 
a;!  \/2np 


p  large. 


Note  that  the  normal  distribution  in  (9.10)  has  the  same  mean  and  variance  as  the 
Poisson  distribution  it  is  approximating  (see  Problem  2  for  the  Poisson  mean  and 
variance).  It  is  useful  to  computer  plot  on  the  same  axes  graphs  of  the  Poisson 
distribution  and  their  normal  approximations  (Problem  13). 

PROBLEMS,  SECTION  9 

1.  Solve  the  sequence  of  differential  equations  (9.4)  for  successive  n  values  [as  started 
in  (9.5)  and  (9.6)]  to  obtain  (9.7). 

2.  Show  that  the  average  value  of  a  random  variable  n  whose  probability  function  is 
the  Poisson  distribution  (9.8)  is  the  number  p  in  (9.8).  Also  show  that  the  standard 
deviation  of  the  random  variable  is  y rp.  Hint:  Write  the  infinite  series  for  ex , 
differentiate  it  and  multiply  by  x  to  get  xex  =  J)(ra"/n!);  put  x  =  p.  To  find  cr2 
differentiate  the  xex  series  again,  etc. 
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3.  In  an  alpha-particle  counting  experiment  the  number  of  alpha  particles  is  recorded 
each  minute  for  50  hours.  The  total  number  of  particles  is  6000.  In  how  many 
1-minute  intervals  would  you  expect  no  particles?  Exactly  n  particles,  for  n  =  1,2, 
3,  4,  5?  Plot  the  Poisson  distribution. 

4.  Suppose  you  receive  an  average  of  4  phone  calls  per  day.  What  is  the  probability 
that  on  a  given  day  you  receive  no  phone  calls?  Just  one  call?  Exactly  4  calls? 

5.  Suppose  that  you  have  5  exams  during  the  5  days  of  exam  week.  Find  the  probability 
that  on  a  given  day  you  have  no  exams;  just  1  exam;  2  exams;  3  exams. 

6.  If  you  receive,  on  the  average,  5  email  messages  per  day,  in  how  many  days  out 
of  a  365-day  year  would  you  expect  to  receive  exactly  5  messages?  Fewer  than  5? 
Exactly  10?  More  than  10?  Just  1?  None  at  all? 

7.  In  a  club  with  500  members,  what  is  the  probability  that  exactly  two  people  have 
birthdays  on  July  4? 

8.  If  there  are  100  misprints  in  a  magazine  of  40  pages,  on  how  many  pages  would  you 
expect  to  find  no  misprints?  Two  misprints?  Five  misprints? 

9.  If  there  are,  on  the  average,  7  defects  in  a  new  car,  what  is  the  probability  that  your 
new  car  has  only  2  defects?  That  it  has  6  or  7?  That  it  has  more  than  10? 

10.  Derive  equation  (9.9)  as  follows:  In  C(n,x),  show  that  n\/{n  —  x)\  ~  nx  for  fixed  x 
and  large  n  [write  n\/(n  —  x)\  as  a  product  of  x  factors,  divide  by  nx ,  and  show  that 
the  limit  is  1  as  n  — >  oo].  Then  write  qn~x  =  (1  —  p)n~x  as  (1  —  p)n(  1  —  p)~x  = 
(1  —  np/n)rl(  1  —  p)~x;  evaluate  the  limit  of  the  first  factor  as  n  — >  oo,  np  fixed;  the 
limit  of  the  second  factor  as  p  — >  0  is  1.  Collect  your  results  to  obtain  equation  (9.9). 

11.  Suppose  520  people  each  have  a  shuffled  deck  of  cards  and  draw  one  card  from 
the  deck.  What  is  the  probability  that  exactly  13  of  the  520  cards  will  be  aces  of 
spades?  Write  the  binomial  formula  and  approximate  it.  Which  is  best,  the  normal 
or  the  Poisson  approximation?  Although  you  only  need  values  at  one  x  to  answer 
the  question,  you  might  like  to  computer  plot  on  the  same  axes  graphs  of  the  three 
distributions  for  the  given  n  and  p. 

12.  Computer  plot  on  the  same  axes  graphs  of  the  binomial  distribution  in  Example  2 
and  the  Poisson  and  normal  approximations. 

13.  Computer  plot  on  the  same  axes  a  graph  of  the  Poisson  distribution  and  the  corre¬ 
sponding  normal  approximation  for  the  cases  p  =  1,  5,  10,  20,  30. 

10.  STATISTICS  AND  EXPERIMENTAL  MEASUREMENTS 

Statistics  uses  probability  theory  to  consider  sets  of  data  and  draw  reasonable  con¬ 
clusions  from  them.  So  far  in  this  chapter,  we  have  been  discussing  problems  for 
which  we  could  write  down  a  density  function  formula  (normal,  Poisson,  etc.). 
Suppose  that,  instead,  we  have  only  a  table  of  data,  say  a  set  of  laboratory  mea¬ 
surements  of  some  physical  quantity.  Presumably,  if  we  spent  more  time,  we  could 
enlarge  this  table  of  data  as  much  as  we  liked.  We  can  then  imagine  an  infinite 
set  of  measurements  of  which  we  have  only  a  sample.  The  infinite  set  is  called  the 
parent  population  or  universe.  What  we  would  really  like  to  know  is  the  probability 
function  for  the  parent  population,  or  at  least  the  average  value  p  (often  thought  of 
as  the  “true”  value  of  the  quantity  being  measured)  and  the  standard  deviation  cr  of 
the  parent  population.  We  must  content  ourselves  with  the  best  estimates  we  can 
make  of  these  quantities  using  our  available  sample,  that  is,  the  set  of  measurements 
which  we  have  made. 
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Estimate  of  Population  Average  As  a  quick  estimate  of  //  we  might  take  the 
median  of  our  measurements  x$  (a  value  such  that  there  are  equal  numbers  of  larger 
and  smaller  measurements),  or  the  mode  (the  measurement  we  obtained  the  most 
times,  that  is  the  most  probable  measurement).  The  most  frequently  used  estimate 
of  fx  is,  however,  the  arithmetic  mean  (or  average)  of  the  measurements,  that  is  the 
sample  mean  x  =  (1/n)  )T/!=1  x*.  Thus  we  have 


n 

(10.1)  Estimate  of  population  mean  is  [x  ~  x  =  (1  /n)  x,;. 


For  a  large  set  of  measurements  we  can  justify  this  choice  as  follows  (also  see  Prob¬ 
lem  1).  Assuming  that  the  parent  population  for  our  measurements  has  probability 
density  function  /(x)  with  expected  value  fx  and  standard  deviation  a,  it  is  easy 
to  show  (Problem  2)  that  the  expected  value  of  x  is  fx  and  the  standard  deviation 
of  x  is  cr/y/n.  Now  Chebyshev’s  inequality  (end  of  Section  7)  says  that  a  random 
variable  is  unlikely  to  differ  from  its  expected  value  by  more  than  a  few  standard 
deviations.  For  our  problem  this  says  that  x  is  unlikely  to  differ  from  /r  by  more 
than  a  few  multiples  of  a/y/n,  which  becomes  small  as  n  increases.  Thus  x  becomes 
an  increasingly  good  estimate  of  p  as  we  increase  the  number  n  of  measurements. 
Note  that  this  just  says  mathematically  what  you  would  assume  from  experience, 
that  the  average  of  a  large  number  of  measurements  is  more  likely  to  be  accurate 
than  the  average  of  a  small  number.  For  example,  two  measurements  might  both 
be  too  large,  but  it’s  unlikely  that  20  would  all  be  too  large. 

Estimate  of  Population  Variance  Our  first  guess  for  an  estimate  of  er2  might 
be  s2  =  (1/n)  X^r=i(a;i  —  x)2,  but  we  would  be  wrong.  To  see  what  is  reasonable,  we 
find  the  expected  value  of  s2  assuming  that  our  measurements  are  from  a  population 
with  mean  /i  and  variance  <j2.  The  result  is  (Problem  3),  E(s2)  =  [(n  —  l)/n]er2. 
We  conclude  that  a  reasonable  estimate  of  a2  is  -^s2. 

n—  1 


(10.2) 


Estimate  of  population  variance  is  a2 


5>-*)2- 

i=l 


( Caution :  The  term  “sample  variance”  is  used  in  various  references — texts,  refer¬ 
ence  books,  computer  programs — to  mean  either  our  s 2  or  our  estimate  of  <r2,  so 
check  the  definition  carefully  in  any  reference  you  use.  We  shall  avoid  using  the 
term.) 

The  quantity  a  which  we  have  just  estimated  is  the  standard  deviation  for  the 
parent  population  whose  probability  function  we  call  /(x).  Consider  just  a  single 
measurement  x.  The  function  /(x)  (if  we  knew  it)  would  give  us  the  probabilities  of 
the  different  possible  values  of  x,  the  population  mean  /r  would  tell  us  approximately 
the  value  we  are  apt  to  find  for  x,  and  the  standard  deviation  a  would  tell  us 
roughly  the  spread  of  x  values  about  /x.  Since  a  tells  us  something  about  a  single 
measurement,  it  is  often  called  the  standard  deviation  of  a  single  measurement. 
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Standard  Deviation  of  the  Mean;  Standard  Error  Instead  of  a  single  mea¬ 
surement,  let  us  consider  x,  the  average  (mean)  of  a  set  of  n  measurements.  (The 
mean,  x,  will  be  what  we  will  use  or  report  as  the  result  of  an  experiment.)  Just  as 
we  originally  imagined  obtaining  the  probability  function  f(x)  by  making  a  large 
number  of  single  measurements,  so  we  can  imagine  obtaining  a  probability  function 
g(x)  by  making  a  large  number  of  sets  of  n  measurements  with  each  set  giving  us 
a  value  of  x.  The  function  g(x)  (if  we  knew  it)  would  give  us  the  probability  of 
different  values  of  x.  We  have  seen  (Problem  2)  that  Var(x)  =  a 2/n,  so  the  standard 
deviation  of  the  mean  (that  is,  of  x)  is 


(10.3) 


\/Var(a;) 


a 


y/n' 


The  quantity  am  is  also  called  the  standard  error ;  it  gives  us  an  estimate  of  the 
spread  of  values  of  x  about  g.  We  see  that  the  new  probability  function  g(x)  must 
be  much  more  peaked  than  /( x)  about  the  value  g  because  the  standard  deviation 
a / yfn  is  much  smaller  than  a.  Collecting  formulas  (10.2)  and  (10.3),  we  have 


(10.4) 


e;u(^-^)2 

n(n  —  1) 


Example  1.  To  illustrate  our  discussion,  let’s  consider  the  following  set  of  measurements: 
{7.2,  7.1,  6.7,  7.0,  6.8,  7.0,  6.9,  7.4,  7.0,  6.9}.  [Note  that,  to  show  methods  but 
minimize  computation,  we  consider  unrealistically  small  sets  of  measurements.] 

1  .E  70 

From  (10.1)  we  find  u~x  =  —  >  x,  =  —  =  7.0. 
v  ;  10^  10 

7=1 

1  ^  Q  30 

From  (10.2)  we  find  a2  ~  ( Xi  —  7)2  =  — —  =  0.04,  a  ~  0.2. 

9  .  ^  9 


From  (10.4),  the  standard  error  is  cr m  —  \j  Yq- g  =  0-0632. 

Combination  of  Measurements  We  have  discussed  how  we  can  use  a  set  of 
measurements  Xi  to  estimate  g  (the  population  average)  by  x  (the  sample  average) 
and  to  estimate  the  standard  error  amx  =  -\/V ar(af)  [equation  (10.4)].  Now  suppose 
we  have  done  this  for  two  quantities,  x  and  y:  and  we  want  to  use  a  known  formula 
w  =  w(x ,  y)  to  estimate  a  value  for  w  and  the  standard  error  in  w.  First  we  consider 
the  simple  example  w  =  x  +  y.  Then,  by  Problem  6.13, 

(10.5)  E(w )  =  E{  x)  +  E(y)  =  gx  +  gy 

where  gx  and  gy  are  population  averages.  As  discussed  above,  we  estimate  gx 
and  gy  by  x  and  y  and  conclude  that  a  reasonable  estimate  of  w  is 

(10.6)  W=x  +  y. 
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Now  let  us  assume  that  x  and  y  are  independently  measured  quantities.  Then  by 
Problem  6.15, 

Var(W)  =  Var(x)  +  Var  (y)  =  a^nx  +  a?ny, 

<10J>  _  mrr 

amw  —  Y  ffrni  +  a my  • 

Next  consider  the  case  w  =  4  —  2x  +  3y.  As  in  equations  (10.5)  and  (10.6),  we 
find  w  =  4  —  2x  +  3 y.  Now  by  Problem  5.13,  we  have  Var(x  +  K)  =  Var(x),  and 
Var(/\x)  =  K2  Var(x),  where  K  is  a  constant.  Thus, 


(10.8)  Var(w)  =  Var(4  —  2x  +  3  y)  =  Var(— 2x  +  3  y) 

=  (— 2)2  Var(T)  +  (3)2  Var(y)  =  4 a2mx  +  9 a2my, 

(10.9)  (Tmw  =  + 


We  can  now  see  how  to  find  w  and  crmw  for  any  function  w(x,  y )  which  can 
be  approximated  by  the  linear  terms  of  its  Taylor  series  about  the  point  (/ j,x,yy)7 
namely  (see  Chapter  4,  Section  2) 


(10.10) 


w{x,y)  =  w(y,X:Liy)  + 


Hx)  + 


(y  -  yv) 


where  the  partial  derivatives  are  evaluated  at  x  =  y  =  and  so  are  constants. 
[Practically  speaking,  this  means  that  the  first  partial  derivatives  should  not  be 
near  zero — we  can’t  expect  good  results  near  a  maximum  or  minimum  of  w — and 
the  higher  derivatives  should  not  be  large,  that  is,  w  should  be  “smooth”  near  the 
point  (fj,x,  ny)]  Assuming  (10.10),  and  remembering  that  w(fj,x,y,y)  and  the  partial 
derivatives  are  constants,  we  find 


dw 


(10.11)  E[w(x,  y)\  S*  w(nx,Hy)  +[-Q^)  iE(x)  ~  y-x]  +  (  )  [E{y)  -  yy\ 


dw 


dy 


=  W(HX,HV). 


Since  we  have  agreed  to  estimate  yx  and  yv  by  x  and  y,  we  conclude  that  a  reason¬ 
able  estimate  of  w  is 


(10.12) 


w  =  w{ x,  y). 


(This  may  look  obvious,  but  see  Problem  7.) 

Then,  putting  x  =  x,  y  =  y  in  (10.10)  and  remembering  the  comment  iust  before 
(10.11),  we  find  as  in  (10  8) 


(10.13) 


Var  (w)  =  Var  [w(x,y)\ 


=  Var 


w{yx,yy)  + 


(£>- 


dw 

dy 


y-x)  + 


(y  -  yv) 


We  can  use  (10.12)  and  (10.13)  to  estimate  the  value  of  a  given  function  w  of  two 
measured  quantities  x  and  y  and  to  find  the  standard  error  in  w. 
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Example  2.  From  Example  1  we  have  x  =  7  and  amx  =  0.0632.  Suppose  we  have  also 
found  from  measurements  that  y  =  5  and  amy  =  0.0591.  If  w  =  x/y,  find  w  and 
< Tmw  From  (10.12)  we  have  w  =  x/y  =  7/5  =  1.4.  From  (10.13)  we  find 


/  2  2/2  2 

<7rmu  =  v  (0  + (^)  ^  =  V (s)  (0-0632)2  +  (2!)  (°-°591)2 

=  0.0208. 

Central  Limit  Theorem  So  far  we  have  not  assumed  any  special  form  (such 
as  normal,  etc.)  for  the  density  function  f(x)  of  the  parent  population,  so  that 
our  results  for  computation  of  approximate  values  of  y,  cr,  and  am  from  a  set  of 
measurements  apply  whether  or  not  the  parent  distribution  is  normal.  (And,  in 
fact,  it  may  not  be;  for  example,  Poisson  distributions  are  quite  common.)  You 
will  find,  however,  that  most  discussions  of  experimental  errors  are  based  on  an 
assumed  normal  distribution.  Let  us  discuss  the  justification  for  this.  We  have 
seen  above  that  we  can  think  of  the  sample  average  x  as  a  random  variable  with 
average  y  and  standard  deviation  a/y/n.  We  have  said  that  we  might  think  of  a 
density  function  g(x)  for  x  and  that  it  would  be  more  strongly  peaked  about  y  than 
the  density  function  f(x)  for  a  single  measurement,  but  we  have  not  said  anything 
so  far  about  the  form  of  g(x).  There  is  a  basic  theorem  in  probability  (which  we 
shall  quote  without  proof)  which  gives  us  some  information  about  the  probability 
function  for  x.  The  central  limit  theorem  says  that  no  matter  what  the  parent 
probability  function  f(x)  is  (provided  y  and  0  exist),  the  probability  function  for 
x  is  approximately  the  normal  distribution  with  standard  deviation  a  /  yfn  if  n  is 
large. 

Confidence  Intervals,  Probable  Error  If  we  assume  that  the  probability  func¬ 
tion  for  x  is  normal  (a  reasonable  assumption  if  n  is  large) ,  then  we  can  give  a  more 
specific  meaning  to  am  (standard  deviation  of  the  mean)  than  our  vague  statement 
that  it  gives  us  an  estimate  of  the  spread  of  x  values  about  y.  Since  the  probability 
for  a  normally  distributed  random  variable  to  have  values  between  y  —  a  and  y  +  cr 
is  0.6827  (see  Section  8  and  Problem  8.7),  we  can  say  that  the  probability  is  about 
68%  for  a  measurement  of  x  to  lie  between  y  —  am  and  y  +  am.  This  interval  is 
called  the  68%  confidence  interval.  Similarly  we  can  find  an  interval  y  ±  r  such 
that  the  probability  is  ^  that  a  new  measurement  would  fall  in  this  interval  (and 
so  also  the  probability  is  \  that  it  would  fall  outside!),  that  is,  a  50%  confidence 
interval.  From  Section  8,  Example  3,  this  is  r  =  0.6745crm.  The  number  r  is  called 
the  probable  error.  When  we  have  found  erm  as  in  Examples  1  and  2,  we  just  have 
to  multiply  it  by  0.6745  to  find  the  corresponding  probable  error.  Similarly  we  can 
find  the  error  corresponding  to  other  choices  of  confidence  interval  (see  Problem  4) . 

PROBLEMS,  SECTION  10 

1.  Let  mi,  m2,  ■  ■  ■  ,  mn  be  a  set  of  measurements,  and  define  the  values  of  x%  by  x\  = 
mi  —  a,x 2  =  m 2  —  a,  ■  ■  ■  ,x„  —  mn  —  a,  where  a  is  some  number  (as  yet  unspecified, 
but  the  same  for  all  Xi).  Show  that  in  order  to  minimize  X/Li  xi>  we  should  choose 
a  =  (1/n)  J27=i  mi ■  Hint:  Differentiate  X)"=i  xi  with  respect  to  a.  You  have  shown 
that  the  arithmetic  mean  is  the  “best”  average  in  the  least  squares  sense,  that  is, 
that  if  the  sum  of  the  squares  of  the  deviations  of  the  measurements  from  their 
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“average”  is  a  minimum,  the  “average”  is  the  arithmetic  mean  (rather  than,  say,  the 
median  or  mode). 

2.  Let  xi,X2,  ■  ■  ■  ,x„  be  independent  random  variables,  each  with  density  function  f(x), 
expected  value  y,  and  variance  a2.  Define  the  sample  mean  by  x  =  Xi-  Show 
that  E(x)  =  /x,  and  Var(af)  =  a2/n.  (See  Problems  5.9,  5.13,  and  6.15.) 

3.  Define  s  by  the  equation  s2  =  (1  /n)  —  x)2 .  Show  that  the  expected  value 

of  s2  is  [(n  —  l)/n]cr2.  Hints:  Write 

(xi  -  x)2  =  [{Xi  -  /x)  -  (x  -  y)f 

=  ( Xi  -  /x)2  -  2{xi  -  /x)(®  -  /x)  +  (x  -  /x)2. 

Find  the  average  value  of  the  first  term  from  the  definition  of  a2  and  the  average 
value  of  the  third  term  from  Problem  2.  To  find  the  average  value  of  the  middle 
term  write 

(x-  /x)  =  (  11  +  X2  + - — —  —  fi)  =  —[(an  —  /x)  +  (x2  -  /x)  H - +  (x„  -  /x)]. 

\  n  J  n 

Show  by  Problem  6.14  that 

E\(xi  -  [i)(xj  -  /x)]  =  E(xi  -  n)E(xj  -  /x)  =  0  for  i  ^  j, 
and  evaluate  E[(xi  —  /x)2]  (same  as  the  first  term).  Collect  terms  to  find 


4.  Assuming  a  normal  distribution,  find  the  limits  /x±  h  for  a  90%  confidence  interval; 
for  a  95%  confidence  interval;  for  a  99%  confidence  interval.  What  percent  confidence 
interval  is  n  ±  1.3cr?  Hints:  See  Section  8,  Example  3,  and  Problems  8.7,  8.22, 
and  8.23. 

5.  Show  that  if  w  =  xy  or  w  =  x/y,  then  (10.14)  gives  the  convenient  formula  for 
relative  error 


6.  By  expanding  w(x,  y,  z)  in  a  three- variable  power  series  similar  to  (10.10),  show  that 


T  w 


2 


r 


2 
z  ’ 


7.  Equation  (10.12)  is  only  an  approximation  (but  usually  satisfactory).  Show,  how¬ 
ever,  that  if  you  keep  the  second  order  terms  in  (10.10),  then 


w  =  w(x,y)  +  - 


a 


2 

y 


8.  The  following  measurements  of  x  and  y  have  been  made. 

x  :  5.1,  4.9,  5.0,  5.2,  4.9,  5.0,  4.8,  5.1 
y  :  1.03, 1.05,  0.96, 1.00, 1.02,  0.95,  0.99, 1.01, 1.00,  0.99 

Find  the  mean  value  and  the  probable  error  of  x,  y,  x  +  y,  xy,  a:3sinx/,  and  lira;. 
Hint:  See  Examples  1  and  2  and  the  last  paragraph  of  this  section. 
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9.  Given  the  measurements 

x  :  98, 101,102,100,99 
y  :  21.2,  20.8,  18.1,  20.3, 19.6,  20.4,  19.5,  20.1 

find  the  mean  value  and  probable  error  of  x  —  y,  x/y,  x2y3,  and  ylni. 
10.  Given  the  measurements 


x  :  5.8,  6.1,  6.4,  5.9,  5.7,  6.2,  5.9 
y  :  2.7,  3.0,  2.9,  3.3,  3.1 

find  the  mean  value  and  probable  error  of  2x  —  y,  y2  —  x,  ey ,  and  x/y2 . 

11.  MISCELLANEOUS  PROBLEMS 

1.  (a)  Suppose  you  have  two  quarters  and  a  dime  in  your  left  pocket  and  two  dimes 

and  three  quarters  in  your  right  pocket.  You  select  a  pocket  at  random  and 
from  it  a  coin  at  random.  What  is  the  probability  that  it  is  a  dime? 

(b)  Let  x  be  the  amount  of  money  you  select.  Find  E(x). 

(c)  Suppose  you  selected  a  dime  in  (a).  What  is  the  probability  that  it  came  from 
your  right  pocket? 

(d)  Suppose  you  do  not  replace  the  dime,  but  select  another  coin  which  is  also 
a  dime.  What  is  the  probability  that  this  second  coin  came  from  your  right 
pocket? 

2.  (a)  Suppose  that  Martian  dice  are  regular  tetrahedra  with  vertices  labeled  1  to  4. 

Two  such  dice  are  tossed  and  the  sum  of  the  numbers  showing  is  even.  Let  x 
be  this  sum.  Set  up  the  sample  space  for  x  and  the  associated  probabilities. 

(b)  Find  E{x)  and  ax. 

(c)  Find  the  probability  of  exactly  fifteen  2’s  in  48  tosses  of  a  Martian  die  using 
the  binomial  distribution. 

(d)  Approximate  (c)  using  the  normal  distribution. 

(e)  Approximate  (c)  using  the  Poisson  distribution. 

3.  There  are  3  red  and  2  white  balls  in  one  box  and  4  red  and  5  white  in  the  second 
box.  You  select  a  box  at  random  and  from  it  pick  a  ball  at  random.  If  the  ball  is 
red,  what  is  the  probability  that  it  came  from  the  second  box? 

4.  If  4  letters  are  put  at  random  into  4  envelopes,  what  is  the  probability  that  at  least 
one  letter  gets  into  the  correct  envelope? 

5.  Two  decks  of  cards  are  “matched,”  that  is,  the  order  of  the  cards  in  the  decks  is 
compared  by  turning  the  cards  over  one  by  one  from  the  two  decks  simultaneously; 
a  “match”  means  that  the  two  cards  are  identical.  Show  that  the  probability  of  at 
least  one  match  is  nearly  1  —  1/e. 

6.  Find  the  number  of  ways  of  putting  2  particles  in  5  boxes  according  to  the  different 
kinds  of  statistics. 

7.  Suppose  a  coin  is  tossed  three  times.  Let  r  be  a  random  variable  whose  value  is  1  if 
the  number  of  heads  is  divisible  by  3,  and  0  otherwise.  Set  up  the  sample  space  for 
x  and  the  associated  probabilities.  Find  x  and  a. 
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8.  (a)  A  weighted  coin  has  probability  |  of  coming  up  heads  and  probability  |  of 

coming  up  tails.  The  coin  is  tossed  twice.  Let  x  =  number  of  heads.  Set  up 
the  sample  space  for  x  and  the  associated  probabilities. 

(b)  Find  x  and  a. 

(c)  If  in  (a)  you  know  that  there  was  at  least  one  tail,  what  is  the  probability  that 
both  were  tails? 

9.  (a)  One  box  contains  one  die  and  another  box  contains  two  dice.  You  select  a  box 

at  random  and  take  out  and  toss  whatever  is  in  it  (that  is,  toss  both  dice  if  you 
have  picked  box  2).  Let  x  =  number  of  3’s  showing.  Set  up  the  sample  space 
and  associated  probabilities  for  x. 

(b)  What  is  the  probability  of  at  least  one  3? 

(c)  If  at  least  one  3  turns  up,  what  is  the  probability  that  you  picked  the  first  box? 

(d)  Find  x  and  a. 

Do  Problems  10  to  12  using  both  the  binomial  distribution  and  the  normal  approximation. 

10.  A  true  coin  is  tossed  104  times. 

(a)  Find  the  probability  of  getting  exactly  5000  heads. 

(b)  Find  the  probability  of  between  4900  and  5075  heads. 

11.  A  die  is  thrown  720  times. 

(a)  Find  the  probability  that  3  comes  up  exactly  125  times. 

(b)  Find  the  probability  that  3  comes  up  between  115  and  130  times. 

12.  Consider  a  biased  coin  with  probability  1/3  of  heads  and  2/3  of  tails  and  suppose 
it  is  tossed  450  times. 

(a)  Find  the  probability  of  getting  exactly  320  tails. 

(b)  Find  the  probability  of  getting  between  300  and  320  tails. 

13.  A  radioactive  source  emits  1800  a  particles  during  an  observation  lasting  10  hours. 
In  how  many  one  minute  intervals  do  you  expect  no  a’s?  5a’s? 

14.  Suppose  a  200-page  book  has,  on  the  average,  one  misprint  every  10  pages.  On 
about  how  many  pages  would  you  expect  to  find  2  misprints? 

In  Problems  15  and  16,  find  the  binomial  probability  for  the  given  problem,  and  then 
compare  the  normal  and  the  Poisson  approximations. 

15.  Out  of  1095  people,  what  is  the  probability  that  exactly  2  were  born  on  Jan.  1? 
Assume  365  days  in  a  year. 

16.  Find  the  probability  of  x  successes  in  100  Bernoulli  trials  with  probability  p  —  1/5 
of  success  (a)  if  x  =  25;  (b)  if  x  =  21. 

17.  Given  the  measurements 

x  :  2.3,  2.1,  1.8, 1.7,  2.1 
y  :  1.0,  1.1,  0.9 

find  the  mean  value  and  the  probable  error  for  x  —  y,  xy ,  and  x/y3. 

18.  Given  the  measurements 


x  :  5.7,  4.5,  4.8,  5.1,  4.9 
y  :  61.5,60.1,59.7,  60.3,  58.4 

find  the  mean  value  and  the  probable  error  for  x  +  y,  y/x,  and  x2. 
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1.1  0.0173  yd;  0.104  yd  (compared  to  a  total  of  5  yd) 


1.3 

5 

9 

1.5 

7 

12 

1.9 

6 

7 

1.11 

19 

28 

1.15  1 

2.1 

1 

2.4 

oo 

2.7 

e2 

2.9 

1 

4.2 
4.4 

4.6 

5.2 

5.6 


an  = 


0;  Sn  =  -  1  —  — 


5.  R  1 

4’  "  4  •  5ra_1 


SVi  —  (  1 


1 


On  = 


i{n  +  1) 


0;  Sn  =  1  - 


n  +  1 


1.  R  . 

2’  "  2  •  3^ 

l;  = 


0 


1 


n+  1 


Test  further  5.4  D  5.5  D 

Test  further  5.8  Test  further  5.9  D 


6.5  b 

D 

6.7 

D 

6.9  C 

6.10 

C 

6.14  D 

6.18 

D 

6.20 

C 

6.22  C 

6.23 

D 

6.24  D 

6.26 

C 

6.29 

D 

6.31  D 

6.32 

D 

6.35  C 

6.36 

D 

7.1 

C 

7.2 

D 

7.4  C 

7.6 

D 

7.8  C 

9.2 

D 

9.3 

C 

9.7  D 

9.8 

C 

9.9  D 

9.10 

D 

9.12 

C 

9.13  C 

9.15 

D 

9.16  C 

9.20 

C 

9.21 

C 

9.22  (b)  D 

10.1 

M  < 1 

10.3  |x|  <  1 

10.4 

M  <  V2 

10.5 

Alls 

10.9  cc  <  1 

10.10 

N  <  i 

10.11 

— 5  <  x 

<  5 

10.13  -1  <  x  <  1 

i 

10.15 

-1  <  x  <  5 

10.17 

—2<x 

<  o 

10.18  <  x  <  - 

10.20  All  x 

4 

\x\  <  i^5 

10.21 

0  <  x  <  1 

10.22  No  x 

10.24 

10.25 

7 T 

7T 

z 

niT  —  — 

<  x  <  mr  + 

6 

6 
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13.4 

13.6 

13.8 


r_1/2,  _  ,_1/2*  (-l)"(2n-l)!! 

{  o  )  \  n  )  (2n)!! 

S(T  (1n)xn+1  (see  Example  2) 

S(T  {~n2)(~x2)n  (see  Problem  13.4) 


13.11  £ 
o 


{-l)nxn 
(2n  +  1)! 


13.14  £ 
0 


X 2n+l 
2n  +  1 


oo 

13.15  £ 
0 


(-1)” 


x2n+1 
2n  + 1 


Erf" 

— 

n 


odd  n 


13.21  x2  +  2z4/3  +  17a;6/45  •  •  • 

13.22  1  +  2x  +  5z2/2  +  8x3/3  +  65z4/24  •  •  • 

13.25  1  -  x  +  x2/3  -  z4/45  •  •  • 

13.27  1  +  x  +  x2/2  -  z4/8  -  rc5/15  •  •  • 

13.28  x  -  x2/2  +  a;3/6  -  x5 /12  •  •  • 

13.29  1  +  z/2  -  3x2/8  +  17x3/48  •  •  • 

13.34  x-x2  +  x3  -  13a;4/12  +  5x5/4  •  •  • 

13.35  1  +  ie2/3!  +  7a:4/ (3  •  5!)  +  31x6/(3  •  7!)  •  •  • 

13.41  e3[l  +  (x  -  3)  +  (x  -  3)2/2!  +  (x  -  3)3/3!  •  •  •  ] 

13.44  5  +  (x  -  25) /10  -  (x  -  25)2/103  +  {x  -  25)3/(5  x  104)  •  •  • 


14.8  For  x  <  0,  error  <  0.001;  for  x  >  0,  error  <  0.002. 


15.1  — x4/24  -  a;5/30  •  •  •  9*  -3.376  x  KT16 
15.3  x5/15  —  2x7/45  •  •  •  =  6.667  x  10-17 


15.6  12 

15.8  1/2 

15.10  -1 

15.12  1/3 

.3 

15.14  t  —  y ,  error 

<  10-6 

15.17  cos(tt/2)  =  0 

15.19  s/2 

15.20  (b)  5e 

15.21  (b)  0.937548 

15.22  (b)  1.202057 

15.23  (a)  1/2 

(c)  1/3 

15.24  (a)  — 7r 

(d)  0 

(f)  0 

15.27  (a)  l~~c=  1.3  x  10"5,  or  v  =  0.999987c 


(d)  1  - 


1.3  x  10" 


15.28  me 2  +  \mv2 

15-29  <b>  w  =  f H 

1  F 

15.30  (b )T=-- 

15.31  (a)  finite 


x3  3x5 

2P  +  W" 

i  e 2  e4 

1H - b  7 - 

6  360 

(b)  infinite 


16.6  C 


16.7  D 


16.9  -1  <  x  <  1 


16.10  -4  <  a;  <  4  16.13  -5  <  a;  <  1 

16.15  —x2/6  -  a:4/180  -  a:6/2835  •  •  • 

16.16  1  -  x/2  +  3x2/8  -  llx3 /48  +  19a;4/128  • 
16.19  —(a:  —  n)  +  {x  —  7t)3/3!  —  [x  —  7t)5/5!  •  • 

x  —  8  (x  —  8)2  5(x  —  8)3 


16.20  2  + 


12 


25  ■  32 


2s  •  34 


16.26  -1/3  16.28  1 

16.31  (b)  2.66  x  1086  terms.  For  N  =  15,  1.6905  <  S  <  1.6952 
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X 

y 

r 

e 

4.1 

1 

1 

72 

7t/4 

See  Fig.  5.1 

4.2 

-1 

1 

72 

37t/4 

See  Fig.  9.6 

4.3 

1 

-V3 

2 

— 7t/3 

4.5 

0 

2 

2 

7t/2 

See  Fig.  5.2 

4.7 

-l 

0 

1 

7 r 

See  Fig.  9.2 

4.9 

-2 

2 

2^2 

3-7t/4 

4.11 

73 

1 

2 

7t/6 

See  Fig.  9.1 

4.14 

72 

V2 

2 

7t/4 

4.15 

-1 

0 

1 

—  7T  or  7T 

See  Fig.  9.2 

4.17 

1 

-1 

72 

—  7r/4 

See  Fig.  9.5 

4.20 

-2.39 

-6.58 

7 

-110° 

=  —1.92  radians 

5.2 

-1/2 

-1/2 

1/72 

— 37r/4  or  57t/4 

5.4 

0 

2 

2 

7t/2 

5.6 

-1 

0 

1 

7T 

5.8 

1.6 

-2.7 

3.14 

-59.3° 

5.10 

-25/17 

19/17 

758/17 

142.8° 

5.12 

2.65 

1.41 

3 

to 

OO 

0 

5.14 

1.27 

-2.5 

2.8 

—  1.1  radians  =  —63° 

5.16 

1.53 

-1.29 

2 

O 

O 

1 

5.17 

-7.35 

-10.9 

13.1 

-124° 

5.18 

-0.94 

-0.36 

1 

201°  or  -159° 

5.19 

(2  +  3*)/13;  ( x  -  yi)/(x2  +  y 2) 

5.21 

(1  +  i)/6;  (x  +  1  -  yi)/[(x  +  l)2  +  y2} 

5.23 

(-6  -  3i)/5;  (1  -  x2  -  y2  +  2yi)/[{l  -  x) 

2  +  y2} 

5.26 

1  5.30  3/2 

5.31  1 

5.32 

169  5.34  1 

5.35  x 

II 

1 

II 

CO 

5.36 

a:  =  —1/2,  y  =  3  5.39 

x  =  y  =  any  real  number 

5.42 

x  =  —1/7,  y  =  —10/7  5.43 

{x,  y)  =  (0,  0),  or  (1, 1),  or  (-1, 1) 

5.45 

x  =  0,  any  real  y\  or  y  =  0,  any  real  x 

5.46 

y  =  —  x  5.48 

x  =  36/13,  y  = 

2/13 

5.49 

V  =  0,  x  =  l/2 

5.53 

Circle  (Find  center  and  radius) 

5.55 

Straight  line  (What  is  its  equation?) 

5.56 

Part  of  a  straight  line  (Describe  it.) 

5.57 

Hyperbola  (What  is  its  equation?) 

5.60 

Circle  (Find  center  and  radius) 

5.62 

Ellipse  (Find  its  equation;  where  are  the  foci?) 

5.63 

Two  straight  lines  (What  lines?) 

5.68 

v  =  2,  a  =  4 

6.2 

D  6.3  C 

6.4  D 

6.5 

D  6.10  C 

6.12  C 
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7.1 

All  z 

7.3 

All  z 

7.6 

W  <1/3 

7.7 

All  2 

7.10 

M  <  1 

7.12 

\A  <4 

7.14 

\z  —  2*1  <  1 

7.16 

jz  +  (*  —  3)|  <  1/V2 

8.3 

See  Problem  17.30 

9.3 

-9* 

9.4 

— e(l  +  *\/3)/2 

9.6 

1 

9.7 

3e2 

9.8 

—  \/3  +  * 

9.10 

-2 

9.11 

—1  —  * 

9.13 

-4  +  4* 

9.14 

64 

9.17 

-(1  +  0/4 

9.19 

16 

9.20 

i 

9.21 

1 

9.24 

4i 

9.26 

(1  +  iV 3)/2 

9.29 

1 

9.32 

3e2 

9.34 

4/e 

9.35 

21 

9.38 

1/V2 

10.3 

±1,  ±  * 

10.4  ±2,  ±2* 

10.7  ±V2,  ±*V 2,  +  1  +  * 

10.9  1,  0.309  ±0.951*,  -  0.809  +  0.588* 

10.16  +*,  (±y/3±i)/2 

10.17  -1,  0.809  ±  0.588*,  -  0.309  ±  0.951* 

10.18  ±(l  +  i)/y/2  10.21  ±(v/3  +  *) 

10.22  r  =  \/2,  0  =  45°  +  120°n:  1  +  *,  -  1.366  +  0.366i,  0.366  -  1.366* 

10.24  ±(v^+*)/2,  ±  (1  —  *v/3)/2,  ±(0.259  +  0966*),  ±(0.966-0.259*) 

10.25  0.758(1  +  *),  -0.487  +  0.955*,  -1.059  -  0.168*,  -0.168  -  1.059*, 
0.955  -  0.487* 


11.3  3(1  —  *)/V2  11.5  1  +  *  11.8  -41/9  11.9  4*/3 

12.25  sin  a:  cosh  y  —  *  cos  a;  sinh  y,  \/sin  x  +  sinh  y 

12.26  cosh  2  cos  3  —  *  sinh  2  sin  3  =  —3.72  —  0.51*.  3.76 


12.28  tanhl  =0.762 
12.30  -* 

12.33  *  tanhl  =  0.762* 

14.2  — *7t/2  or  3*7t/2 
14.5  Ln  2  +  5*7t/4 

14.8  -1,  (1  ±  *v/3)/2 

14.11  cos(Ln  2)  +  *  sin(Ln  2)  =  0.769  + 

14.14  0.3198*  —  0.2657 
14.18  -1 

14.23  e7r/2  =  4.81 

15.2  7r/2  +  r*7r  +  (*  Ln  3)/2 

15.4  *(2n7r  +  7T / 6) ,  *(2r*7r  +  57r/6) 

15.8  7r/2  +  2r*7r  ±  *Ln3 

15.12  *(2n7r  ±  7r/6) 

15.15  r*7T  +  37r/8  +  (*/4)  Ln  2 


12.32 

— 4*/3 

12.35 

—  cosh 

2  =  -3.76 

14.3 

Ln  2  + 

*7t/6 

14.6 

—  *7t/4 

or  7*7t/4 

14.10 

e-^2/4 

.639* 

14.15  e"™11111  =  0.0249 
14.20  1 


15.3  *(±7r/3  +  2mr) 

15.5  ±[tt/2  +  2r*7r  -  *  Ln(3  +  V8)] 

15.9  *(7r/3  +  n7r) 

15.14  2n7r  +  *Ln2,  (2n  +  l)7r  —  *Ln2 


16.3  \z\  =  \/2;  motion  around  a  circle  of  radius  \/2,  at  constant  speed  v  =  V2, 

constant  acceleration  a  =  \/2. 
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16.5  v  =  \zi  —  Z2I;  a  =  0 

16.6  (a)  Series:  3  —  2i;  parallel:  5  +  * 

(b)  Series:  2(l  +  i\/3);  parallel:  i\J 3 

16.8  [R  —  i{uCR 2  +  lo3L2C  —  loL)\/[(u>CR)2  +  (oj2LC  —  l)2];  this  simplifies  to 
L/(RC)  at  resonance. 


16.9  (b)  u>  =  1/y/LC 

16.12 

(1  +  r4  —  2r2  cos 

17.2  (V3  +  *)/2 

17.4 

i  cosh  1  =  1.54i 

17.6  — e-’1'2  =  —5.17  x  10-5 

17.7 

gTT/2  =  4  81 

17.9  7r/2±2n7r 

17.11 

i 

17.13  x  =  0,  y  =  4 

17.15 

M  <  l/e 

17.26  1 

17.27 

(c)  e-2("-‘)2 

17.28  1  +  (a2  +  b2)2(2ab)~2  sinh2  b 

17.30  ex  cosx  =  X)^°=o  (2 n^2xn/n\)  cosnn/A 

ex  sin  a:  =  X^^°=o  (2 n^2xn  jvS)  sinn7r/4 
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2.4 

2.8 

2.9 

2.12 

2.17 


=  5(2  +  1),  v  =  1 


x  =  y  —  11,  z  =  7 


inconsistent,  no  solution 


x  =  -2,y  =  l,z  =  l 


3.1  -11  3.5  -544  3.12  16 

3.16  A  =  -(K  +  ik)/(K-ik),  \A\  =  1 


4.12 

arccos(— 1/\/2)  =  37t/4 

4.14 

(a)  arccos(l/3)  =  70.5 

4.14 

(c)  arc  cos  ^/2/3  =  35.3° 

4.15 

(b)  8i  —  4j  +  8k 

4.18 

2i  —  8j  —  3k 

4.19 

i+j  +  k 

4.22 

Law  of  cosines 

4.24 

A2B 2 

5.1  r  =  (2i  —  3j)  +  (4i  +  3j)t  [Note  that  2i  —  3j  may  be  replaced  by  any  point 
on  the  line;  4i  +  3j  may  be  replaced  by  any  vector  along  the  line.  Thus,  for 
example,  r  =  6i  —  (8i  +  6j)t  is  just  as  good  an  answer,  and  similarly  for  all 
such  problems.] 

5.4  r  =  i+(2i  +  j)f 

5.6  (x  —  1) /I  =  (y  +  1)/ (—2)  =  (z  +  5)/2,  or  r  =  i  —  j  —  5k  +  (i  —  2j  +  2k )t 

5.8  x/2>  =  (z  —  4)/(— 5),  y  =  —2;  or  r  =  — 2j  +  4k  +  (3i  —  5k)f 

5.9  x  =  —  1,  z  =  7;  or  r  =  —  i  +  7  k  +  j  t 
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5.11  (x  —  4)/l  =  (z  —  3)/(— 2),  y  =  — 1;  or  r  =  4i  —  j  +  3k  +  (i  —  2k )t 

5.12  (x  —  5)/5  =  (y  +  4)/(— 2)  =  (z  —  2)/l;  or  r  =  5i  —  4j  +  2k  +  (5i  —  2j  +  k)f 

5.14  36a;  —  3y  —  22z  —  23  5.16  5a:  —  2y  +  2  =  35 

5.18  o;  +  6y  +  7z  +  5  =  0  5.20  x  —  4y  — z  +  5  =  0 

5.21  cos  9  =  25/(7v/30 )  =  0.652,  9  =  49.3° 

5.22  cos  9  =  2/V6,  9  =  35.3° 

5.24  r  =  2i  +  j  +  (j  +  2k)f,  d=2y/6/5 

5.25  r  =  i  —  2j  +  (4i  +  9j  —  k)t,  d  =  (3 )/7 

5.29  2/V6  5.31  5/7  5.33  ^43/15 

5.34  v/TlTTO  5.36  3  5.38  arc  cos  ^21722  =  12.3° 

5.39  Intersect  at  (3,2,0);  cos 9  =  5/V60,  9  =  49.8° 

5.42  1/V5  5.43  20/^21  5.45  d  =  V2,t=-l 


6.2 


-25 

14 


A  +  B 
B2  = 


1  4  ) 

0  4  )’ 


-1 

5 


5A  = 


g  J  ,  det(5A)  =  52  det  A  for  a  2  x  2  matrix 


6.4  You  should  have  found  BA,  C2,  CB,  C3,  C2B,  and  CBA;  all  others  are 


/  32 

!2\ 

/  36 

46 

14 

-36\ 

meaningless.  C2B  =  53 

7  ’ 

CBA  =  40 

22 

1 

91 

V  — 13 

-9/ 

V— 8 

-2 

1 

-29/ 

6.13 


5/3  -3 
-1  2 


6.19  A"1  = 


1  2 
-3  1 
4 


6.22  A"1  =  —  -7  -1 

12  1  1-5 

6.30  sinfcA  =  Asinfc  =  ( 


(x,y)  =  (5,0) 

(x,y,z)  =  (1,-1,  2) 

) ,  cos  k  A  =  I  cos  k  = 


cos  k 
0 


0 

cos  k 


pkA  _  (  coshk 

sinh  k  \ 

„ikA  _  ( 

cos  k 

i  sin  k 

sinh  k 

cosh  k  J  ’ 

e 

i  sin  k 

cos  k 

) 

7.1 

Not  linear 

7.4 

Linear 

7.6 

Not  linear 

7.8 

Not  linear 

7.11 

Not  linear 

7.12 

Linear 

7.14 

Not  linear 

7.15 

Linear 

7.22 

D  =  1,  rotation  9  =  —45° 

7.24 

D  = 

— 1,  reflection  line 

7.26  D  =  —1,  reflection  line  x  =  2y 


7.30  R  = 


S  = 


R  is  a  90°  rotation  about  the  z  axis,  S  is  a  90°  rotation  about  the  x  axis. 
7.32  180°  rotation  about  i  —  k 

7.35  Reflection  through  the  (x,y)  plane  and  90°  rotation  about  the  z  axis. 
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8.1 

In  terms  of  basis  u  =  1(9, 0,7),  1 

v  =  I 

9 

(0,- 

'9, 13),  the  vectors  are: 

5u  —  2v,  2u  +  v,  3u  +  6v. 

8.3 

Basis  i,  j,  k 

8.6 

V 

=  3A  —  B 

8.19 

H 

II 

as 

II 

II 

S 

II 

o 

8.20 

X  - 

=  -z,y  =  z 

8.23 

For  A  =  3,  x  =  2y;  for  A  =  8,  x  = 

-2  V 

8.25 

For  A  =  2:  x  =  0,  y  =  —3 z;  for  A  = 

=  -3: 

X  = 

-5 y,  z  =  3 y\ 

for  A  =  4:  z  =  3 y,  x  =  2 y 

8.26 

r  =  (3,1,0)  +  (—1,1,1  )z 

/  0  i  3\ 

9.4  A*  =  -2  i  2  0  ,  A’1 

V-i  o  o/ 

9.14  CTBAT,  C^M^C,  H 

10.1  (b)  d  =  8 

10.2  The  number  of  basis  vectors  given  is  the  dimension  of  the  space.  We  list  one 
possible  basis;  other  bases  consist  of  the  same  number  of  independent  linear 
combinations  of  the  vectors  given. 

(b)  (1,0, 0,5, 0,1),  (0,1, 0,0, 6, 4),  (0,0, 1,0, -3,0) 

10.3  (a)  Label  the  vectors  A,  B,  C,  D.  Then  cos(A,B)  =  1/VT5, 
cos(A,  C)  =  y/2/3,  cos(B,  D)  =  ^17/690. 

10.4  (b)  ei  =  (0,  0, 0, 1),  e2  =  (1, 0, 0,  0),  e3  =  (0, 1, 1,  0)/y/2 

10.5  (b)  ||  A ||  =  7,  ||B ||  =  \/60,  |Inner  product  of  A  and  B|  =  y/5 


11.5  9  =  1.1  =  63.4° 


11.11 


1 

-1 


,  not  orthogonal 


In  the  following  answers,  for  each  eigenvalue,  the  components  of  a  corresponding 
eigenvector  are  listed  in  parentheses. 


11.12 


11.18 


11.22 


11.23 


11.26 


4 

-1 

4 
2 

-3 

-4 

5 

-2 

18 

9 

9 

4 

1 

1 


11.29  D  = 


(1,1) 

(3,-2) 


11.15 


11.20 


1 

-1 

5 

3 

4 

-2 


(0,0,1) 
(1,  -1,0) 
(1,1,0) 
(0,-1, 2) 
(1,2,1) 
(-5,2,1) 


(2.1.3) 

(0,  —3, 1) 

(5, -1,-3) 

(-4,1,1) 

(1,2,2) 

(0,  1,1) 

(2,  2,  —1)  The  two  eigenvectors  corresponding  to  the  eigenvalue  9 
(1,  —1, 0)  may  be  any  two  vectors  orthogonal  to  (2, 2,  —1)  and  or- 

(1. 1. 4)  thogonal  to  each  other. 

(1,1,1)  11.27  D  = 

(1,  -1,0) 

(1,1, -2) 

11  0^ 


c  =  i 


V2  (-1 


0  1 

11.41  A  =  1,  3;  U  = 


,  C  = 


-f1 

75  l2 


-2 


11.31  D  = 


C  = 


1 


( 1  -1 


V2  Vi 


1 

71 
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11.44  A  =  3,  -7;  U  = 


-3-4i 

5 


5V2  V3  -  4i 

11.52  60°  rotation  about  —iy/2  +  k  and  reflection  through  the  plane  0  =  xy/2 

11.53  180°  rotation  about  i  +  j  +  k 
11.56  45°  rotation  about  j  —  k 

11.58  M10  = 


1 


11.59  eM  =  e 


1  +  4-610  2-2  •  610 

2  —  2  •  610  4  +  610 

cosh  1  —  sinh  1 

—  sinh  1  cosh  1 


12.2  3a;'2  -  2 y'2  =  24  12.3  10x'2  =  35 

12.6  3x'2  +  V3y'2  -V 3  z'2  =  12 

12.15  y  =  2x  with  u  =  \J 3k /m ;  x  =  —2 y  with  u)  =  ■ \JSk/m 
12.17  x  =  —2 y  with  ui  =  y/ 2k /m\  3x  =  2 y  with  u>  =  yj2k/{3m) 
12.19  y  =  —x  with  u>  =  y/3k/m\  y  =  2x  with  uj  =  \J 3k/(2m ) 
12.22  y  =  — x  with  ui  =  \J2 k/m\  y  =  3x  with  u>  =  \J 2k/ (3m ) 


13.6  The  cyclic  group 

13.11  The  four  matrices  of  the  symmetry  group  of  the  rectangle  are: 


This  group  is  isomorphic  to  the  4’s  group. 

13.21  SO(2)  is  Abelian;  SO(3)  is  not  Abelian. 

14.3  x,  cosx,  xcosa;,  e^cosx  14.5  1,  x  +  x3,  x2,  x4,  x5 

14.6  Not  a  vector  space  14.8  1,  x2,  x4,  x6 


15.3  (a)  (x  -  4)/l  =  (y  +  l)/(-2)  =  (z  -  2)/(-2);  or  r  =  (4,  -1, 2)  +  (1,  -2,  -2)t 

(b)  x  —  5y  +  3z  =  0  (c)  5/7 

(d)  5-v/2/3  =  2.36  (e)  arc  sin  19/21  =  64.8° 

15.5  (a)  y=  7,  (x  -  2)/3  =  (z  +  l)/4;  or  r  =  (2, 7,  -1)  +  (3, 0, 4 )t 

(b)  x  —  Ay  —  9z  =  0  (c)  arc  sin(  ||  \/2 )  =  41.8° 

(d)  12/V98  =  1.21  (e)  V29/5  =  1.08 

15.7  You  should  have  found  all  except  ATBT,  BAT,  ABC,  ABTC,  B_1C,  and 
CBT,  which  are  meaningless. 


15.9 

15.13 


BtAC  = 


Area  =  ^ 


15.14  x"  =  —x,  y"  =  —y,  180°  rotation 

15.15  x"  =  —v.  v"  =  x:  90°  rotation  of  vectors  or  —90°  rotation  of  axes 


15.18  1  (1,1) 

-2  (0,1) 

15.22  1  (1,0,1) 

4  (0,1,0) 

5  (1,0,  -1) 

15.27  3x'2  -  y'2  -  5z'2  =  15,d=V5 


15.20  1  (1,1) 

9  (1,-1) 

15.24  2  (0,4,3) 

7  (5, -3, 4) 

-3  (5, 3, -4) 

15.29  3x/2  +  6 y'2  -  4 Y2  =  54,  d  =  3 


Chapter  4 


Answers  to  Selected  Problems  789 


Chapter  4 


1.1 

1.3 

1.4 
1.7 
1.13 
1.19 
1.8' 
1.14' 
1.20' 
1.22' 


du/dx  =  2  xy2  /  (x2 

+  y2)2,  du/dy  =  —2 x2y/(x2 

+  y 2) 

dz/du  =  u/(u2  +  v 

2  +  w2) 

At  (0,  0),  both  =  0 

;  at  (-2/3,  2/3),  both  =  -4 

2x 

1.9  2a;(l  +  2  tan2  9) 

1.11 

4r2  tan  6 

1.15  r2  sin  29 

1.17 

0 

1.21  — 4cccsc 2  9 

1.23 

—2rA/x3 

1.10'  2y  +  4y3/x2 

1.12' 

2 y2  sec2  9  tan  9 

1.16'  2r  tan2  9 

1.18' 

4x(tan  9  sec2  9)  (tan2  9  +  sec2  9) 

-8  r3/x3 

1.24'  -8 y3/x3 

2  y 
4  r 

2rsin20 
2 y  sec2  9 
— 2ry4  /  (r2 


2\2 


2/2) 


2.1  j/  +  y3  / 6  —  x2y/2  +  xAy/ 24  —  x2y3 / 12  +  y5/120  •  •  • 

2.3  x  —  x2 /2  —  xy  +  x3 /3  +  x2y/2  +  xy2  ■  ■  ■ 

2.5  1  +  xy/2  —  x2y2/8  +  x3y3/ 16  —  5x4y4/128  •  •  • 

2.8  e*  cosy  =  1  +  x  +  (x2  —  y2)/ 2  +  ( x 3  —  3xy2)/6  ■  ■  ■ 


4.2 

2.5  x  10"13 

4.4 

12.2 

4.6  9% 

4.8 

5% 

4.10 

4.28  nt 

4.11  3.95 

4.15 

8  x  1023 

5.1 

e~v  sinht  +  zsint 

5.3 

2r(g2  —  p2) 

5.7 

(1—2  b  —  e2a)  cos  (a 

- b ) 

6.2 

y'  =  1,  y"  =  0 

6.3 

y'  =  4(ln  2  —  l)/(2  In  2  —  1) 

6.5 

2x  +  lly  -  24  =  0 

6.6 

1800/113 

6.10 

x  +  y  =  0 

6.11 

y"  =  4 

7.1 

dx/dy  =  z  —  y  +  tan  (y  +  z 

),  d2x/dy2  = 

\  sec 3(y  +  z)  +  i  sec(y  +  z)  - 

7.4 

dw/du  =  —2  (rv  +  s 

)w ,  dw/dv  =  —2 (ru 

+  2s)w 

7.7 

(dy/d9)r  =  x,  ( dy/d9)x  = 

r2 /x,  ( d9/dy)x  =  x/r 2 

7.8 

dx/ds  =  —19/13,  dx/dt  = 

:  -21/13,  dy/ds  =  24/13,  dy/dt  =  6/13 

7.10  dx/ds  =  1/6,  dx/dt  =  13/6,  dy/ds  =  7/6,  dy/dt  =  —11/6 
7.13  ( dp/dq)m  =  -p/q,  ( dp/dq)a  =  l/(acosp  -  1), 

( dp/dq)b  =  1  —  6 sing,  ( db/da)p  =  (sin p)(b sing  —  1)/ cosg, 

(■ da/dq)m  =  [g  +  p(acosp  -  l)]/(gsinp) 

7.15  ( dx/du)v  =  (2yv2  —  x2)/(2yv  +  2 xu),  ( dx/du)y  =  ( x2u  +  y2v)/(y2  —  2xu 2) 

7.17  (dp/ds)t  =  -9/7,  ( dp/ds)q  =  3/2 

7.19  ( dx/dz)s  =  7/2,  ( dx/dz)r  =  4,  ( dx/dz)y  =  3 


8.3  (—1,2)  is  a  minimum  point  8.4  (—1,-2)  is  a  saddle  point 

8.8  9  =  7t/3;  bend  up  8  cm  on  each  side 

8.9  l=w  =  2h  8.11  6  =  30°,  x  =  yV3  =  z/2 

8.13  (4/3, 5/3)  8.16  m=  5/2,  6=1/3 


9.2  r:l:s  =  V 5:(l  +  \/5):3  9.4  4/v^  by  6/^3  by  10/^3 

9.6  V  =  1/3  9.8  (8/13,12/13) 

9.12  Let  legs  of  right  triangle  be  a  and  b ,  height  of  prism  =  h;  then  a  =  b, 
h=  (2-  V2)a. 
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10.2  4,  2 

10.6  d=  2 

10.10  (a)  maxT  =  i,  minT  =  —  ^ 

(b)  maxT  =  1,  minT  =  —  | 

(c)  maxT  =  1,  minT  =  —  ^ 

10.13  Largest  sum  =  3  arc  sin(l/-\/3) 

11.1  z  =  f(y+2x)  +  g(y +  3x) 

11.11  H  =  pq  —  L 


10.4  d=  1 

10.7  i\/TT 

10.12  Largest  sum  =  180° 

Smallest  sum  =  3  arccos(l/v/3) 
=  164.2° 

105.8°,  smallest  sum  =  90° 

11.6  d2y/dz2  +  dy/dz  —  5y  =  0 


12.1  ^x~x/2  smx 

12.3  dz / dx  =  —  sin(cos  x)  tan  x  —  sin(sin  x)  cot  x 

12.4  i  sin  2 

12.7  ( du/dx)y  =  — e4,  ( du/dy)x  =  e4/ In  2,  ( dy/dx)u  =  In  2 

12.10  dy/dx  =  ( ex  —  l)/x 

12.12  (2x  +  l)/ln(a:  +  x2)  —  2/ln(2ar) 

12.14  tt/(4 y3) 


13.2  (a)  and  (b)  d  =  4/\/l3 

13.4  —  esc  0  cot  0 

13.5  —  6rr,  2x2  tan  dsec2  9,  Ax  tan  0  sec2  6 

13.9  dz/dt  =  1  +  (t/z)(2  —  x  —  y),  z  ^  0 

13.10  [x  In  x  —  (y2/x)]xv  where  x  =  r  cosd,  y  =  r  sind 

13.13  -1 

13.14  ( dw/dx)v  =  {df/dx)a<t  +  2{df/ds)x<t  +  2{df/dt)x<s  =  fi  +  2 f2  +  2 f3 

13.18  a/W 3  13.21  T(2)  =  4,  T(5)  =  -5 

13.23  tcott  13.25  —ex/x 

13.29  dt  =  3.9 
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2.1 

3 

2.3  4 

2.5  je2  —  2.7 

5/3 

2.9  6 

2.11 

36 

2.13  7/4 

2.15  3/2  2.17 

5  In  2 

2.19  32 

2.21 

131/6 

2.23  9/8 

2.25  3/2  2.27 

32/5 

2.29  2 

2.31 

6 

2.33  16/3 

2.36  1/6  2.37 

7/6 

2.39  70 

2.41 

5 

2.43  9/2 

2.45  46fc/15  2.47 

16/3 

2.49  1/3 

3.2 

(b)  Ml2/ 12 

(c)  Ml2/ 3 

3.3 

(a)  M  = 

=  140 

(b)  x  =  130/21 

(c)  Im  =  6.92 M 

(d)  /  = 

150M/7 

3.5 

(a)  Ma2 / 3 

(b)  Ma2/ 12 

(c)  2Ma2/3 

3.7 

(a)  M  = 

=  9 

(b)  (x,y)  =  (2,4/3) 

(c)  Ix  = 

=  2 M,  Iy  =  9M/2 

(d)  Im  =  13M/18 

3.9 

(a)  1/6 

(b)  (1/4, 1/4, 1/4) 

(c)  M  =  1/24,  z 

3.11 

(a)  M  = 

=  (5^5  -  l)/6  =  1.7 

(b)  x  =  0,  y  =  (313  +  15^)7620  =  0.56 

3.14  V  =  2ir2a2b,  A  =  An2ab ,  where  a  =  radius  of  revolving  circle,  and  b  = 
distance  to  axis  from  center  of  this  circle. 

For  area,  (x,y)  =  (0,  |r/7r),  for  arc,  (x,y)  =  (0,2r/n) 


3.15 


Chapter  6 


Answers  to  Selected  Problems  791 


3.18  s=  [3v/2  +  ln(l  +  v/2)]/2 

3.20  13tt/3 

3.21  sx  =  [  51v/2-ln(l  +  v/2)]/32,  sy  =  13/6,  s  as  in  Problem  3.18 
3.23  (149/130,0,0) 

3.25  I /M  has  the  same  numerical  value  as  x  in  Problem  3.21 

3.26  2M/3  3.27  149M/130  3.29  2  3.30  32/5 

4.1  (b)  x  =  y  =  |a/7r  (c)  I  =  Ma2/ 4  (e)  x  =  y  =  2a/i r 

4.2  (c)y=|a/7r 

(d)  4  =  Ma2/A ,  Iv  =  hMa2/\,  Iz  =  2>Ma2 /2 

(e)  y  =  2a/ tt 

(f)  x  =  6a/5,  Ix  =  48Ma2/175,  Iv  =  288Ma2/175,  Iz  =  48Ma2/25 

(g)  A  =  il'K~  5^3 )a2 

4.4  (b)  (0,0,  a/2)  (c)  2Ma2/3  (e)  (0, 0, 3a/8) 

4.5  7tt/3 

4.11  12tt 

4.12  (c)  M=  (16p/9)(37r-4)  =  9.64p 

/  =  (128p/152)(157r  -  26)  =  12.02p  =  1.25M 
4.14  7r(l  —  e_1)/4  4.16  u2  +  v2  4.19  7t/4 

4.22  12(1  +  367T2)1/2  4.24  pGna/2  4.26  (a)  f  Mo2 

4.27  2-7T ah  (where  h  =  distance  between  parallel  planes) 


5.1 

§ttv/30 

5.3 

7r(373/2  -  l)/6 

5.5 

87t  for  each  nappe 

5.6 

4 

5.8 

\/6  +  ln(-\/2  +  V3 ) 

5.9 

5.12 

M  =  0,  ( x,y,z )  =  (i,  |) 

5.14 

5.16 

x  =  0,y=l,z  =  [32/(9^^275 

=  0.716 

6.2 

45(2  +  v/2)/112 

6.3 

1571-/8 

6.4 

(a)  \MR 2  (b)  §Mi?2 

6.6 

(a)  (4tt-3V3)/6 

6.7 

(8? r  -  3V3)(4t r  -  3 

6.8 

(b)  27/20 

6.10 

(a)  (x,y)  =  (tt/2,  tt/8) 

6.10 

(c)  3M/8 

6.12 

(abc)2/ 6 

6.14 

16a3/3 

6.15 

4  =  |Ma2,  Iy  =  |Ma2 

6.16 

x  =  y  =  2a/5 

6.18 

(0, 0, 54/6) 

6.19 

4  =  4  =  20Mh2/21,  4  =  10M/i2/21,  /m  =  65M/i2/252 

6.21 

TrGph(2  -  V2) 

6.24 

(0,0,2c/3) 

6.26 

i  sinh  1 

6.27 

e2  —  e  —  1 

Chapter  6 

3.1  (A  •  B)C  =  6C,  (A  x  B)  •  C  =  A  •  (B  x  C)  =  -8, 
A  x  (B  x  C)  =  -4(i  +  2k) 

3.3  -5 

3.6  v=  (2/v/6)(A  x  B)  =  (2/v/6)(i-7j -3k), 
r  X  F  =  (A  -  C)  X  B  =  3i  +  3j  -  k, 

n  •  (r  X  F)  =  [(A  -  C)  X  B]  •  C/|C|  =  8/^26 

3.7  (a)  Hi  +  3j  -  13k,  (b)  3,  (c)  17 
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3.9  -9i-23j  +  k,  l/v7^ 

3.15  Ui  •  u  =  — u3  •  u,  mui  X  u  =  n2u2  X  u 

3.17  a  =  (u  ■  r)u  —  lu2 r;  for  r  _L  ur,  a  =  —oj2 r,  |a|  =  v2/r. 

3.19  (a)  16i  -  2j  -  5k  (b)  8/^6 

3.20  (b)  12 

4.2  (a)  1=2 

(b)  v  =  4i  —  2j  +  6k.  |v|  =  2-\/l4 

(c)  {x  —  4)/4  =  (y  +  4)/(— 2)  =  (xr  —  8)/6,  2*  -  y  +  3z  =  36 

4.5  \dr / dt\  =  \[2\  |d2r/rit2|  =  1;  path  is  a  helix. 

4.8  dr/dt  =  er(dr/dt)  +  eg(r  d9/dt)\ 

d2r/dt2  =  er[d2r / dt2  —  r(dd/dt)2]  +  eg[r  d29/dt2  +  2(dr  /  dt){dO  /  dt)\ 

6.2  -i 

6.4  716/(31/5) 

6.6  6x  +  8y-z  =  25,  {x  —  3)/6  =  (y  -  4)/8  =  (z  -  25)/(— 1) 

6.9  (a)  2i  —  2j  —  k  (b)  5/-y/6 

(c)  r  =  (1,1,1)  +  (2,  —2,  — l)t 

6.12  (a)  2-^/5,  -2i  +  j  (b)3i  +  2j  (c) 

6.14  (b)  Down,  at  the  rate  llv^ 

6.17  er  6.19  j 

7.1  V-r  =  3,  Vxr  =  0 

7.2  V-r  =  2,  Vxr  =  0 

7.4  V  •  V  =  0,  V  x  V  =  -(i  +  j  +  k) 


7.6 

V  •  V  =  5 xy,  V  X  V  =  ixz  —  j yz  +  k(y2 

—  X2) 

7.7 

V-V  =  0,  VxV  =  ix- 

j  y  —  karcosy 

7.10 

0 

7.11 

—(x2  +  y2)/{x2  —  y2)3/2 

7.13 

2  xy 

7.14 

0 

7.16 

2{x2  +  y2  +  z2) 

1 

7.19 

2/r 

8.1 

-11/3 

8.2 

(a)  — 47t  (b)  —16  (c) 

8.3 

(a)  5/3  (b)  1 

(c)  2/3 

8.4 

(a)  3  (b)  8/3 

8.7 

(b)  0 

(d)  27T 

8.8 

yz  —  x 

8.9 

3  xy  - 

q  9 

x  yz  —  z 

8.11 

— y  sin2  x 

8.14 

—  arc  sin  xy 

8.18 

(a)  7r  +  7t2/2  (b)  7t2/2 

9.2 

40 

9.4 

—3/2 

9.7  nab 

9.8 

24t r 

9.10 

-2D 

9.11  2 

10.2 

3 

10.4 

367T 

10.5  4tt  -52 

10.7  48tt  10.9  1 

'0, 

r  <  i?i; 

10.12 

II 

-e- 

(fc/27re0)  ln(i?i/r), 

VI 

c/ 

r  <  R2] 

/fc/27re0)ln(i?i/i?2), 

r>  R2. 

11.2 

2  ab2 

11.3 

0 

11.4  -12 

11.5 

36 

11.6 

457T 

11.7  0 

11.10 

—  67T 

11.12 

187T 

11.15  -2nV2 

11.18 

A  = 

xz  — yz2  - 

y2/2)i  +  {x2 /2  — 

x2z  +  yz2 / 2  —  yz) j  +  Vit,  any  u 

11.20  A  =  i  sin  zx  +  j  cos  zx  +  \tezy  +  Vu,  any  u 
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12.1  (sin(7cos0)C 

12.7  (a)  9i  +  5j  —  3k  (b)  29/3  12.9 

12.11  (a)  grad  (f>  =  —  3yi  —  3xj  +  2zk 

(c)  2x  +  y  —  2z  +  2  =  0 ,  r  =  (1, 2,  3)  +  (2, 1,  -2)7 

12.13  (a)  6i  —  j  —  4k  (b)  53-V2(6i  -  j  -  4k) 

(d)  531/2  (e)  531/2 


12.18  Not  conservative 
12.21  4 
12.27  4 


(a)  1/2 
12.23  192tt 
12.29  10 


12.25 

12.31 


24 

(b)  -y/3 

(c)  same  as  (a) 

(b)  4/3 
—  18-7T 
29/3 
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Amplitude 

Period 

Frequency 

Velocity 

Amplitude 

2.2 

2 

7t/2 

2/t: 

8 

2.3 

1/2 

2 

1/2 

7t/2 

2.6  s  =  6  cos(7t/8)  sin(27) 

6cos(7t/8) 

=  5.54 

7 r 

1/7T 

12cos(7t/8) 

=  11.1 

2.8 

2 

47T 

1/(4tt) 

1 

2.10 

4 

7 r 

1  j  7T 

8 

2.11  q 

3 

1/60 

60 

I 

360tt 

1/60 

60 

2.13 

A  =  maximum  value  of  9,  ui  =  \/g/l 

2.16 

t  =  4.91  =  281° 

2.19 

II 

h 

T— 1 

II 

=  1/4,  V  =  1/4,  A  =  1 

2.21 

y  =  20  sin  - 

-  67),  dy/dt  =  —  607tcos  hn(x 

—  67) 

2.23 

y  =  sin8807r((x/350)  —  t) 

2.25 

y  =  10  sin[7r(x  — 

3  •  1087)/250] 

3.6 

sin(2x  +  ^7 r) 

4.5 

TT-'  +  i 

4.6  2/t r 

4.8 

0 

4.11 

1/2 

4.14  (a)  2n/3  (b)  7 r 

4.15 

(a)  3/2 

X  — > 

—27 r 

—7 r 

—  7t/2 

0 

tt/2 

7 r 

27 r 

6.2 

1/2 

0 

0 

1/2 

1/2 

0 

1/2 

6.4 

-1 

0 

-1 

-1 

0 

0 

-1 

6.6 

1/2 

1/2 

1/2 

1/2 

1/2 

1/2 

1/2 

6.8 

1 

1 

1  —  |tt 

1 

1+  i7T 

1 

1 

6.10 

7 r 

0 

7t/2 

7 r 

tt/2 

0 

7 r 
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7.1 

7.2 


7.7 

7.11 


8.2 


8.6 

8.11 


8.14 


8.19 

8.20 


9.2 

9.5 


1  OO  1 

m=~  +  1  £  1 

z  7 r  z '  n 

—  OO 

odd  n 


(1  -  i)eto  +  (1  +  i)e-fa  -  i(e2ix  -  e~2ix ) 


/0*0 


-  E 

—  OO 

odd  n 


1  elx  -  e 

f(x)  =  L  +  e— e- 

n  4z 


1  +  i 
3 

1  i 
n2TT  2  n 

—ix  2 
7 r 


^ Six  _|_  ^ 


E 

—  OO 

even  n^O 


2n 


E 


—  OO 

even  n^O 


n2  —  1 


1  +  i 


—Six 


l 


7TX 


3to  1 


5to 


f(x)  =  -  H —  I  cos  — - - -  cos  — —  +  -  cos  —— 


1 

—  |  sin  ■ 


•TO 


l 


l 

2  2to 
2Sm  — 


l 

1  .  3to 
3Sm  — 


1  .  5to 

rm— 


a*)  =  5  +  ;E 
2  00 
(a)  f{x)  =  to  +  4^] 


—  sin  (n  =  2,  6,  10,  •••) 

n  L 

‘  (-i) 


■  cos  nx 


n* 


.  O  OO  / 

(b)  /(®)  =  £T  +  2E  To  +  — 


emx,  0 


,  ,  ,,  ,  8  ^  n(— l)n+1  .  o 

(a)  fix)  =  -  E  4n2  _  x  sm  2™7ra 

,,,,,,  2  4  ^  cos2 riTO  2  ^  1 

(b)  /(*)  =  ---E  4^23T  =  --E4^T 

1  — OO 

,,  .11^1  n  1  ^ 

/(x)  =  E  £2cos2^  +  ^E 


odd  n—1 


,  2  9 


2to  1 


4to 


cos  ■ 


2^C0S  — 


-  1 

(-l)n+1 

n 

1  87HE 

4^C°S  — 


2  .  6to 

6Sm  — 


'3y/3 


1  \  .  2to 

„  ,  —  sin - 

87T2  7T  /  3 


3^3 


sin  2mrx 


4TO 


_  -  sm - 

32tt2  2tt  /  3 


1  .  6to  /  3>/3  1  \  .  8to 

—  sm - - -  H - sm - •  •  • 

y  1287T2  4ti -J  3 


- sm 

3tt  3 


(a)  \  In  |1  -  x2\  +  \  In  |(1  -  ar)/(l  +  x) 

4  ~  1  . 


/(*)  =  -  E  1 

-7T  z 


sm  nx 


7r  z — '  n 

odd  n—1 
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9,9  if; 


2 

fs(x)  =  —  (sin  x  +  sin  3x  +  |  sin  5x  +  |  sin  lx  +  |  sin  9x  +  A  sin  lire  •  •  • ) 

(-l)r 


9-20  fc(x)  -  „  +  2  E 


■  COS  T17TX 


2  ~  (_l)n+l 

jsix)  =  —  >  - smriTrx 

t r  ^  n 


O  V — "V  1 

- T  /  — O  St 

7T°  ^ 


sin  n7rx 


7r°  z '  nJ 
odd  n— 1 


p  /  \  J-  1  1  1  v — >  1  . 

/ o(x)  =  —  H - ^  >  — r  cos2n7rx - >  —  sm2mra; 

3  7T2  ^  n2  71-^71 


20 


7TX 


3nx 


5nx 


9.22  /c(x)  =  15 - cos  —  —  -  cos  — —  +  -  cos  — — 


20 


20 


20 


20 


TTX 


2nx 


3  nx 


5nx 


6irx 


fs(x)  =  —  3  sin  —  —  —  sin  — —  +  —  sin  — —  +  —  sin  - —  sin  — — 


fp(x)  =  15- 


8  h 


20 


20  2 

OO 


V  Ei 

^  n 


20  3 

mrx 


20 


20  6 


20 


odd  n=  1 


sin - 

n  10 


9.23  f(x,  0)  =  (  sin  —  - 


3nx 


32 


sin  ■ 


1  .  57ra; 

52  Sln  ~T 


10.1  Relative  intensities  =  1:0:0:0:^:0:^:0:0:0 
10.3  Relative  intensities  =  l:25:|:0:^:^:^:0:gj:l 


10.5  I(t) 


1-2  E 


even  n— 2 


n2  —  1 


■  cos  120mri 


■  sin  1207rf 


10.6  V(i)  =  50  —  V  -Eosl20n7rf 
'  7 t2  ^  n2 

odd  n— 1 


20  (-l)r 


10.7  /(*)  =  --E 

-7T  Z ' 


sin  120n7r7 


10.10  V(i)  =  75  - 


l 

200 

7r2  ^ '  n* 

odd  n— 1 


oo 

E  ^ 


cos  120n7rt  — 


100 


OO 

E1 

^  n 


sin  120mrf 


7r  * — '  n 

l 


Relative  intensities  =  1.4  :  0.25  :  0.12  :  0.06  :  0.04 


11.5 

tt2/8 

11.7  tt2/6 

12.2 

/s(a:)  = 

2  1  -  cos  a  . 

—  /  - sin  ax  da 

ft  Jo  a 

12.4 

/(z)  = 

I"00  sin  an  -  sin(an/2 )  ^  ^ 

7  _oo 

12.6 

/fa)  = 

/'°°  sin  a  — a  cos  a  , 

/  - ; — ^ - elax  da 

J- OO  ?7rQ! 

12.8 

/fa)  = 

ziytr1-- 

11.9  — 
16 


1 

2 
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12.10  f{x)  =  2  aa  Sm  00  'lax 


da 


—  OO 

oo 


12.11 


m=U. 

2  rc 

12.13  fc{x)  =  -  / 

7t  J  0 

2  c 

12.16  /C(rc)  =  -  / 

^  Jo 
2  /'c 

12.18  /s(x)  =  - 


12.19  fs{x)  =  - 


'I'KOi 

cos("/2)e<«.rft, 

1  —  or 

sina7r  —  sin(a7r/2) 


cos  ax  da 


a 


cos(a7r/2) 

1  —  a2 
sin  a  —  a  cos  a 


cr 

aa  —  sin  aa 


cos  arc  da 
sin  ax  da 


■  sin  ax  da 


a * 


12.21  g(a)  =  a(2n)  1/2e  “  a  /2 

1  C00  1  p-ia-n- 

12.25  (a)  /(*)  =  —J  + 


4  C 

12.28  (a)  /c(ar)  =  -  / 
Jo 


,  1  —  a2 

cos  3  a  sin  a 


elax  da 
cos  ax  da 


....  4  [°°  sin  3a  sin  a  . 

(b)  js{x)  =  —  I  - smaxaa 


12.30  (a)  fc(x)  =  -  f 
77  JO 


0  a 
1  —  cos  2a 


cos  ax  da 


(b)  fs{x)  =  - 

7 r 


2a  —  sin  2a 


a^ 


sin  ax  da 


r  4  00  1 

13.7  f(x)  =  — - V  —cosnx 

2  7r 

odd  n 

13.10  (d)  -1,  -1/2,  -2,  -1 

13.8  (b)  1 

1  A  00 

-  «  -  .  /  N  *  /  N  1  COS  n7rX 

13.14  (a)  f(x)  =  3  +  £  n2 

13.15  -tt/4 

13.23 

(b)  7r4/90 
tt2/8 

Chapter  8 

1.5  x  =  —Alo~2  sin  cot  +  vq t  +  Xq 

1.7 

x  =  ( c/F)[(m2c 2  +  F2t2)xl2  —  me] 

2.2  (1  -  x2)1/2  +  (1  -  y2)1/2  =  C,  C  = 

2.3  lny  =  H(csca;  —  cot  a;),  A  =  \/3 

2.6  2y2  +  1  =  H(a;2  -  l)2,  A  =  1  2.7 

2.9  yey  =  aex,  a  =  l  2.13 

2.19  (a)  I/I0  =  e~0  5  =  0.6  for  s  =  50  ft 

y2  =  8  +  eK~x2,  K  =  1 
y  =  1,  y  =  -1,  x  =  1,  x  =  -1 

Half  value  thickness  =  (In  2)///  =  69.3  ft 
(b)  Half  life  T  =  (In  2)/A 

2.20  (c)  t  =  RC,  t  =  L/R.  Corresponding  quantities  are  a,  A  =  (In2)/T,  y,  1/t. 

2.22  N  =  NoeKt  —  (R/ K)(eKt  —  1)  where  Nq  =  number  of  bacteria  at  t  =  0, 
KN  =  rate  of  increase,  R  =  removal  rate. 

T  =  100[1-  (lnr)/(ln2)] 


2.23 
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2.26  (a)  k  =  weight  divided  by  terminal  speed 

(b)  t  =  g~x  ■  (terminal  speed)  •  (In  100);  typical  terminal  speeds  are  0.02  to 
0.1  cm/sec,  so  t  is  of  the  order  of  10-4  sec. 


2.27 

t  =  10(ln  j^)/(ln  ^h)  =  6.6  min  2.29 

t  =  100  In  |  =  81.1  min 

2.31 

ay  =  bx 

2.33 

x2  +  ny 2  =  C 

2.35 

x(y  - 1)  =  c 

3.1 

y  =  ±ex  +  Ce~x 

3.3 

y=  (\x2  +  C)e~x2 

3.6 

y  =  (x  +  C)/(x+  'Jx1  +  1 ) 

3.8 

y  —  ^lna;  +  C/lna; 

3.9 

2/(1  —  at2)1/2  =  x2  +  C 

3.11 

y  =  2(sina;  —  1)  +  Ce~sinx 

3.13 

x  =  \ey  +  Ce~v 

3.14 

x  =  y2/3  +  Cy~Y!3 

3.15 

S  =  (107/2)[(1  +  3f/104)  +  (1  +  3t/104)' 

-1/3],  where  S  =  number  of  pounds 

of  salt,  and  t  is  in  hours. 

3.17 

/  =  -‘/(«0  _  VowCfsinwt 

—  ujRC  cos  cat) /(I  +  uj2R2C2) 

3.21 

Nn  =  cie_Alt  +  c2e~X2t  +  ■  ■  ■ 

where 

A1A2  •  •  •  Xn-iNo 

A1A2  ■  ■  ■  A„i7Vo 

(A2  —  Ai) (A3  —  Ai) ...  (A 

„-Ar)’C2 

(Ai  —  A2)(A3  —  A2)  ■  •  •  (A „  —  A2) 

etc.  (all  A’s  different) 

3.22 

y  =  x  +  1  +  Kex 

4.1 

a/1/3  =  x  -  3  +  Ce-*/3 

4.4 

x2e3y  +  ex  __  ly3  =  C 

4.5 

x2  —  y2  +  2x(y  +  1)  =  C 

4.7 

x  =  t/(ln  x  +  C) 

4.9 

y2  =  Ce~x2/y2 

4.11 

tan  1  (a;  +  y)  =  x  +  C 

4.13 

y2  =  —  sin2  x  +  C  sin4  x 

4.16 

y2  =  C(C  ±  2x) 

4.18 

x2  +  (2/ —  ^)2  =  k2 

4.19 

r  =  Ae~e ,  r  =  Be6 

5.1 

y  =  Aex  +  Be~2x 

5.3 

y  =  Ae3lx  +  Be~3lx  or  other  forms 

as  in  (5.24) 

5.5 

y  =  (Ax  +  B)ex 

5.7 

y  =  Ae3x  +  Be2x 

5.9 

y  =  Ae2x  sin(3a;  +  7) 

5.11 

y=  (A  +  Bx)e~3x/2 

5.20 

y  =  Ae~x  +  Belx 

5.22 

y  =  Aex  +  Be~3x  +  Ce~5x 

5.24 

y  =  Ae  x  +  Bex!2  sin(la;v/3  +  7)  5.26 

y  =  Ae5x  +  (Bx  +  C)e~x 

5.28 

y  =  ex(Asm.x  +  B  cos  a;)  +  e 

x(Csmx  +  D  cos  x) 

5.29  y  =  (A  +  Bx)e~x  +  Ce2x  +  De~2x  +  Esin(2x  +  7) 
5.35  T  =  2 ty^J R/ g  =  85  min. 


6.1  y  =  Ae2x  +  Be~2x  -  §  6.3  y  =  Aex  +  Be~2x  +  \e2x 

6.5  y  =  Aeix  +  Be~ix  +  ex  6.7  y  =  Ae~x  +  Be2x  +  xe2x 

6.9  y  =  (Ax  +  B  +  x2)e~x 

6.11  y  =  e~x(As\n?>x  +  B  cos  3a; )  +  8  sin  Ax  —  6  cos  4a; 

6.13  y  =  (Ax  +  B)  ex  —  sin  x 

6.15  y  =  e_6x/5[Asin(8a;/5)  +  £?cos(8x/5)]  —  5  cos  2a; 

6.17  y  =  A  sin  4a;  +  B  cos  Ax  +  2a;  sin  4a; 

6.18  y  =  e-x(Asin4a;  +  Bcos4a;)  +  2e~4x  cos 5a; 

6.20  y  =  Ae~2x  sin(2a;  +  7)  +  4e_;C//2  sin(5x/2) 

6.22  y  =  A  +  Be~xl2  +  a;2  -  4a;  6.24  y  =  (A  +  Bx  +  2x3)e3x 

6.26  y  =  A  sin  x  +  B  cos  a;  —  2a;2  cos  a;  +  2a;  sin  x 

6.33  y  =  A  sin(a;  +  7)  +  x3  —  6a;  —  1  +  x  sin  x  +  (3  —  2x)ex 
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6.34  y  =  Ae3x  +  Be2x  +  ex  +  x 

6.37  y  =  (A  +  Bx)ex  +  2x2ex  +  (3  —  x)e2x  +  x  +  1 

6.41  y  =  e~x(Acosx  +  Ssinx)  + 

OO 

+  ^  [4(n2  —  2)  cosnx  —  8nsinnx\/[Trn2(n2  +  4)] 

odd  n—  1 


7.1 

(a) 

III 

Oi 

(c) 

y  =  tan((|  —  |)  =  sec  a;  —  tana; 

7.4 

X 2 

+  (y  —  b)2  =  a2,  or  y  =  C 

7.11 

7.12 

t  = 

f*  u2(  1  —  w4)-1/2  du 

7.16 

(a) 

y  =  Ax  +  Bx~3 

7.16 

7.18 

y  = 

-  Ax  +  Bx~3  +  \  (x  +  a;-1)  In  a: 

7.20 

y  = 

=  x2(A  +  Blnx)  +  a;2  (In  a;)3 

7.22 

7.25 

x~ 

1  -  1  7.27  x3e°° 

(b)  y  =  2/(x  +  l) 

(d)  y  =  2  tanh  x 
x  =  (1  -  3 t)1'3 

(c)  y  =  ( A  +  i?  In  x)/x3 

y  =  A  cos  In  x  +  B  sin  In  x  +  x 
7.29  xe 1/x 


8.8  e_2t  -  te~2t 

8.12  3  cosh  5t  +  2  sinh  5f 

8.23  y  =  fe_2t  (cost  —  sin  t) 


8.10  ^e*sin3t  +  2e4cos3t 
8.21  2  b(p  + a)/[(p  + a)2 +  b2]2 

8.25  e_p7r/2/ (p2  +  1) 


9.3 

9.7 

9.11 

9.13 

9.19 

9.23 

9.27 


y  —  6  2t(4 1+  \t2) 
y  =  1  —  e2t 
y  =  te2t 
y  =  sinh  2 1 
y  =  e2t 

r/  =  sin  t  +  2  cos  t  —  2e-t  cos  2f 
fy  =  t+±(l-e4t) 

1^=i  +  e4t 


9.4 

y  =  cos  t  +  \  (sin  t  — 

t  cost) 

9.9 

y  =  (t  +  2)  sin  At 

9.12 

y=  i(t2e-*  +  3e*- 

e-‘) 

9.17 

y  =  2 

9.21 

y  =  e3t  +  2e  2t  sin  t 

9.25 

9.28 

y  =  (3  +  t)e~2t  sint 

1  y  =  t  cos  t  —  1 

2  =  cos  t  +  t  sin  t 


9.30 


y  =  t  —  sin  2 1 
z  =  cos  2 1 


9.32 


y  =  sin  2 1 
z  =  cos  2t  —  1 


9.36  arctan(2/3)  9.38  4/5 

9.40  1  9.42  tt/4 


10.3  \t  sinh  t 

10.5  [6(6  -  a)te~bt  +  a(e~bt  -  e~at)]/(b  -  a)2 

10.7  (acoshbt  —  bsinhbt  —  ae~at)/(a 2  —  b2) 

10.9  (2t2  -2t  +  l-  e~2t)/A 

10.12  (b2  —  a2)-1(6-2  cos  bt  —  a-2  cos  at)  +  a~2b~ 2 

10.13  ^(e_t  +  sin t  —  cost) 

10.15  ±e3b  +  ±e~^-±eb 

iniv  j  (cosh  at  - 1)/ a2,  t>  0 
10.17  y  =  < 

1  0,  t  <  0 


11.7 


y  = 


(t  -  t0)e  *o),  t>  tQ 
0,  t  <tQ 
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11.9  y  = 


11.11  y  = 


ie  (t  t0)  gin 3(£  _  ^0)j  t  >  to 

0,  t  <  t0 

|[sinh(t  —  t0)  —  sin(t  —  t0)],  t  >  t0 

0,  t  <  to 


11.13  (b)  35(x  +  5)  -  45(x  -  10) 

11.15  (b)  0  (d)  cosh  1 

11.21  (b)  0(|a|)/(2|a|)  (c)  1/2 

11.23  (a)  5(x  +  5 )5(y  —  5)5(z),  5(r  —  5\/2)5(6  —  )S(z)/r , 

6{r-5V2)6(6-%)5(<l>-3£)/{rBm0) 

(c)  5(x  +  2 )5(y)5(z  —  2 a/3  ) ,  5(r  —  2)5(0  —  7 t)5(z  —  2/3  )/r, 
5(i —  4)5(0  —  f  )5((j>  —  n)/(rsin0) 

11.25  (c)  G"(x)  =  5(x)  +  5 5'(x) 


12.7  y  =  [a (cosh at  —  e  ')  — sinha/ 
12.11  y  =  — |  sin2x 

J  x  -  y/2  sin  x,  x  < 

12.13  5  =  <  ,  r 

I  ^7T  —  x  —  v  2  cos  x,  x  > 

-  x  —  x{hvx)2 /2 


12.16  y  : 

12.18  j/  = 


— *  In  1  „ 

a;2/2  +  x4/6 


1)] 


13.1  y  =  —  \x  2  +  Ccc 

13.5  x2  +  y2  —  y  sin2  x  =  C 

13.8  y  =  x(A  +  B  In  a;)  +  ^/ln  x)2 

13.13  y  =  Ae~2x  sin(x  +  7)  +  e3x 

13.18  x=  (y  +  C)e~ siny 

13.20  y  =  Aex  sin(2rc  +  7)  +  x  +  |  +  ex(l 

13.22  y  =  (A  +  Bx)e2x  +  C sin(3ir  +  7) 

13.24  y 2  =  ax2  +  b 

13.28  y2  +  4(x  —  l)2  =  9 


13.3  y  =  A  +  Be  xsin(a:  +  7) 

13.7  3  x2y3  +  1  =  Ax3 
13.10  u  —  In  u  +  In  v  +  v~x  =  C 
13.15  y  =  (A  +  Bx)e2x  +  3x2e2x 

—  x  cos  2x) 

13.26  y  =  x2  +  x 

13.30  y  =  5/3,  v  =  7g/12 ,  a  =  5ff/12 


13.32  1:23  p.m. 

13.33  In  both  (a)  and  (b),  the  temperature  of  the  mixture  at  time  t  is  given  by  the 
formula  Ta(  1  -  e~kt)  +  (n  +  n')_1(nT0  +  n%)e~kt. 

13.38  i  ln[(a2  +p2)/p2]  13.41  i(tanh  1  —  sech2  1)  =  0.0854 


13.43  (sin  at  +  at  cos  at) /(2a) 

13.46  For  e~x:  gs(ce)  =  (2/n)1/2a/(l  +  a2),  gc(&)  =  (2/n)1^2 /(I  +  a2) 

13.47  y  =  Asint  +  B cost  +  sintln(sect  +  tant)  —  1 
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2.1 

Parabola 

2.2 

Circle 

2.3 

ax  =  sinh(ay  +  b) 

2.6 

x  +  a  =  |  (51/2  —  26)  (6  +  51/2)1/2 

3.1 

dx/dy  =  C/y/y3  -  C2 

3.3 

xAy'2  =  C2(l  +  x2y'2)3 

3.6 

x  =  ay3!2  —  \y2  +  b 

3.7 

y  =  K  sinh(a;  +  C) 

3.9 

cot  0  =  Acos((j)  —  a) 

3.12 

(x  —  a)2  +  y2  =  C2 
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3.15  r cos(0  +  a)  =  C  or,  in  rectangular  coordinates, 
the  straight  line  x  cos  a  —  y  sin  a  =  C 
3.18  See  Problem  3.9 


4.6 

5.2 


5.4 

5.8 

5.12 

5.14 

5.16 

5.19 

5.21 

5.23 

6.1 


Cycloid 


{ra(f  —  r92)  =  —dV/dr 
m(rQ  +  2  rd)  =  — (1  /r)(dV/dQ) 
mz  =  —dV/dz 

19  +  g  sin0  =  0 


Comment:  These  equations  are  in  the 
form  m&  =  F;  recall  from  Chapter  6, 
equation  (6.7),  the  polar  coordinate 
form  for  F  =  -VP. 

5.6  j  aO  —  a  sin  0  cos  0  (j)2  —  gsin9  =  0 


((d/df)(sin2  9  4>)  =  0 

L  =  |m(  2r2  +  r29 2)  —  mgr  5.11  L  =  i(m  +  Ia~2)i2  —  mgz 

2 r  -  r92  +  g  =  0,  ( d/dt)(r29 )  =  0  (m«2  +  I)z  +  mga 2  =  0 

L  =  \m(r2  +  r292)  —  [\k{r  —  ro)2  —  mgr  cos  9] 
r  —  r92  +  ^(r  —  ro)  —  geos  9  =  0,  ( d/dt){r29 )  +  gr  sin  9  =  0 
L  =  Mx2  +  Mgx  sina,  2Mx  —  Mgsina  =  0 
L  =  i m(l  +  a9)29 2  —  mg[a  sin  9  —  (l  +  a9)  cos  9] 

( l  +  a6)9  +  a9 2  +  g  sin  9  =  0 

x  =  y  with  u)  =  \J g/l ;  x  =  —y  with  a ;  =  'JSg/l 
29  +  <j>  cos (9  —  (j>)  +  cj)2  sin(0  —  <j>)  +  sin  0=0 

<j>  +  9cos(9  —  <j>)  —  92  sin(0  —  <j>)  +  |  sin^>  =  0 


<f>  =  29  with  ui  =  \j2g/ (31) ;  <j)  =  —29  with  u>  =  yj2g/l 

Catenary  6.3  Circular  cylinder  6.5  Circle 


8.4  dr/d9  =  Kr\Jr 4  —  K 2 
8.6  (re  -  a)2  +  (y  +  l)2  =  C2 

8.8  Intersection  of  r  =  1  +  cos 9  with  z  =  a  +  6sin(0/2) 

8.10  Intersection  of  y  =  x2  with  az  =  b[2x\/Ax2  +  1  +  sinlC1  2x]  +  c 
8.12  ey  cos(x  —  a)  =  K 

8.16  Hyperbola:  r2  cos(20  +  a)  =  K  or  (x2  —  y2)  cos  a  —  2xys\na  =  iP 

8.17  K  In  r  =  cosh(AT0  +  C) 

8.18  Parabola:  (x  —  y  —  C)2  =  AK2(x  +  y  —  K2) 

8.20  m{r  —  rO2)  +  Kr~ 2  =  0,  r29  =  const. 

8.22  r~1m(r29  +  2 rr9  —  r2  sin  9  cos  9  (j)2)  =  —r~1(dV/d9)  =  Fg  =  mag, 
ag  —  r9  T  2r9  —  r  sin  9  cos  9  <j)2 
8.27  dr/d9  =  r^K2(  1  +  Ar)2  -  1 
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/  9  o -3\ 

4.6  /  =  ^  o  6  oj;  principal  moments:  (6,6,12);  principal  axes  along  the  vec¬ 

tors  (1, 0,  —1)  and  any  two  orthogonal  vectors  in  the  plane  z  =  x,  say  (0, 1, 0) 
and  (1,  0, 1). 


5.6  (a)  3 


(c)2 


(e)  -1 
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6.15  (c)  vector 

8.1  hr  =  1,  hg  =  r,  h,/,  =  rsind 

ds  =  er  dr  +  eg  r  dd  +  r  sin  9  dcj) 
dV  =  r2  sin  9  dr  d9  d</> 

ar  =  i  sin  9  cos  (/>  +  j  sin  9  sin  <f>  +  k  cos  9  =  er 
a  g  =  ir  cos  9  cos  4>  +  jr  cos  9  sin  </>  —  kr  sin  9  =  reg 
a$  =  — ir  sin  9  sin  (f>  +  jr  sin  9  cos  4>  =  r  sin  9  e $ 

8.3  ds/dt  =  err  +  eg  r6  +  e<f,r  sin  0  <j> 
d2s/dt2  =  er(r  —  r92  —  rsin2  9  <p2) 

+  eg (r9  +  2 r9  —  r  sin  9  cos  9  (j)2) 

+  e$  (r  sin  9  <j>  +  2r  cos  9  9cf>  +  2  sin  9  f<j>) 

8.5  V  =  er  cos  9  —  eg  sin  9  —  e^r  sin  9 

8.6  hu  =  hv  =  (u2  +  v2)1/2,  hz  =  1 

ds  =  ( u 2  +  v2y/2(eu  du  +  ev  dv)  +  ez  dz 
dV  =  ( u 2  +v2)du  dv  dz 
au  =  hr  +  jv  =  ( u 2  +  v2)1/2eu 
av  =  —it;  +  ju  =  ( u 2  +  v2)1t2ev 
az  =  k  =  ez 

8.9  hu  =  hv  =  a(coshtt  +  cost;)-1 

ds  =  a(coshtt  +  cosi;)-1(e„  du  +  ev  dv) 
dA  =  a2 (cosh w  +  cost;)-2  dudv 

au  =  (A2/a)[i(l  +  cost;  cosh  u)  —  j  sin  v  sinh  u]  =  hueu 
av  =  (h2 /a)  [i  sinh  u  sin  v  +j(l  +  cost;  cosh  tt)]  =  hvev 
8.11  ds/dt  =  ( u 2  +  v2)1^2(euii  +  evv)  +  ezz 

d2s/dt2  =  eu(u2  +  t;2)-1/2[(tr2  +  v2)u  +  u{ii2  —  v2)  +  2vuv] 

+  ev(u2  +  v2)-1^2[(it2  +  v2)v  +  v{v2  —  tt2)  +  2uuv]  +  ezz 
8.14  ds/dt  =  a(coshtt  +  cost>)-1(euti  +  evv) 

d2s/dt2  =  e„a(coshtt  +  cos  t;)-2 [(cosh u  +  cosr)ti  +  ( ir  —  tt2)  sinhtt  +  2m;  sin  v] 

+  e„a(coshu  +  cos  t;)-  [(cosh  u  +  cosv)v  +  {v2  —  tt2)  sint;  —  2uv  sinhtt] 

9.10  Let  h  =  hu  =  hv  =  (tt2  +  v2)1'2  represent  the  u  and  v  scale  factors. 


-l 


VU  =  h 


V  •  V  =  h~ 


V2U  =  h~ 


V  x  v=  U 


dU 
1  du 


dU_ 
’  dv 


+  k 


dU 

dz 


l(kV„)  + 


dVz 

dz 


' d2U  d2U ' 


d2  U 


\  du2 
_r  d_\A 
dv 


dv2  J 
dVv 
dz 


dz 2 


dVu  f  — !  dV z 

dz  du 


ev  +  h 


l(hV„)  -  l(hV„) 
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9.13  Same  as  9.10  if  h  =  a(cosh  u+cos  u)_1  and  terms  involving  either  z  derivatives 
or  Vz  are  omitted.  Note,  however,  that  V  X  V  has  only  a  2  component  if 
V  =  euVu  +  evVv  where  Vu  and  Vv  are  functions  of  u  and  v. 

9.15  hu  =  1,  hv  =  u/\/l  —  v 2 

eu  =  iu  +  j\/l  -  v 2,  ev  =  \yjl  -  v 2  -  ju 
m[ii  —  mu2/ (1  —  u2)]  =  —dV/du  =  Fu 

m[(uv  +  2  mu) /(I  —  u2)1/2  +  muu2/ (1  —  u2)3/2]  =  —h^dV/dv  =  Fv 


9.16 

r  \  0, 

0,  r  4ez 

9.19 

2e^,  er  cos  9 

—  eg  sin  9,  3 

9.21 

2r-\  6, 

2r"4,  —k2eikr 

cos  6 
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3.3 

9/10 

3.7 

8 

3.9 

r(5/4) 

3.11 

1 

3.14 

-r(4/3) 

3.17 

r(p) 

7.1 

\B(  §,±) 

=  3tt/16 

7.3 

HU) 

7.5 

B{  3,3)  = 

1/30 

7.7 

HU) 

7.11 

7.13 

II 

00 

A)/B(U) 

8.1 

B(bl)VWd=  7.4163^ 

8.3 

t  =  Tt^fafg 

(Compare  2ny/l/g) 

10.2 

T(p,x)  ~ 

xp  ie  *[i  -p  [p  - 

-  l)x 

_1  +  (p  - 

l)(p~2)x~2- 

10.5 

(a)  Ei  (a:) 

=  r(o,x) 

10.6 

(b)  Ei(x) 

11.5 

1 

12.1 

AT  =  E(tt 

■/2,fc)  =  (tt/2){1 

+  (1 

)2fc2  +  [(1 

•  3) / (2  •  4)]2fc4 

...} 

E  =  E(-7t 

/2,/c)  =  (tt/2){1 

“(5 

)2fc2  —  [1/(2  •  4)]2  •  3fc4 

~  [(1  ‘  3)/ (2  ■  4  •  6)]2  •  hk6  •  ■  ■  } 

Caution:  For  the  following  answers,  see  the  warning  about  elliptic  integral  notation 
just  after  equations  (12.3)  and  in  Example  1. 


12.6  |E(|,  |)  =  0.355 
12.10  ±E( f,±)  =  0.402 


12.5  E(l/3)  =  1.526 
12.7  5E(^,  i)  =  19.46 

1211  F(f^)  +  ^(^)  =  4-097 

12.16  2y/2E{l/y/2)  =  3.820 

12.23  T  =  8.  /#-  E(l/\/5  );  for  small  vibrations,  T  =  2tt 


12.13  3E(f,f)  +  3E(arcsinf,f)^3.96 


13.8  ^v/7rerf(l) 

13.11  |E(arcsin  |,  |)  =  0.1834 
13.15  T(7/2)  =  15V^/8 
13.19  7}^nerfc5 
13.24  —  255v/tt/109!! 


13.10  y/2K{l/y/2)  =  2.622 
13.13  —  snudnM 
13.17  iS(|,|)=37rV2/64 
13.21  54E(§,f)  =  (§)5(±f) 
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1.2 

V  =  a0ex 

1.7 

y  =  Ax  +  Bx3 

2.4 

Qo  =  Qi  =  f  In 

1+x  -1 

1—x  L 

3.3 

(30  —  a:2)  sin  a:  +  12a:  cos  x 

5.3 

Po(x)  =  1 

P4(x) 

Pi  (x)  =  X 

P5(x) 

P2(x)  =  (3x2  -  l)/2 

Pe(x) 

P3(x)  =  (5x3  —  3x)/2 

5.9 

2  P2  +  Pi 

5.12 

f  Pa  +  4P2  -  3 Pi  +  f  P0 

8.2 

yjl  P2(x) 

9.1 

9.4 

^(3Pi  +  ^P3  +  iiP5... 

) 

9.6 

Po  +  |Pi  -  f  P2  ■  ■  ■ 

9.8 

|(1  —  a)Po  +  f  (1  —  a2)P\ 

+  f  a(l 

9.11 

f  P4  +  4P2  -  3Pi  +  f  P0 

9.12 

|  (Pi-Ps) 

9.14 

^Po  +  |P2  =  ^(5x2  +  l) 

10.5 

\  (sin  0)  (35  cos3  0—15  cos  9) 

11.2 

y  =  Ax  3  +  Bx3 

1.3  y  =  a\X 

1.9  y  =  ao(l  —  x 2)  +  a\x 

3.5  (x2  -  200x  +  9900)e~x 

=  (35x4  —  30a;2  +  3)/8 
=  (63a:5  —  70x3  +  15x)/8 
=  (231x6  -  315a:4  +  105a:2  -  5)/16 

5.11  |(Pi-P3) 

8.4  N  =  tt1/4,  7r-1/4e"x2/2 

-  a2)P2  +  ^(1  -  a2) (5a2  -  1)P3  •  •  • 


11.4  y  =  Ax  2  +  Bx 3 


11.6  y  =  Ae~x  +  Px2/3[  1  -  3x/5  +  (3x)2/(5  ■  8)  -  (3x)3/(5  •  8  •  11)  +  •  •  ■  ] 

11.8  y  =  A(x~x  -  1)  +  Px2(l  -  x  +  3x2/5  -  4x3/15  +  2x4/21  +  •  •  • ) 

11.10  y  =  A[ l  +  2x-  (2x)2/2!  +  (2x)3/(3  ■  3!)  -  (2x)4/(3  •  5  •  4!)  +  •  •  •  ] 

+  Px3/2[1  -  2x/5  +  (2x)2/(5  •  7  •  2!)  -  (2x)3/(5  •  7  •  9  •  3!)  +  •  •  •  ] 

11.11  y  =  Ax1/Q[l  +  3x2/25  +  32x4/(5  •  210)  +  •  •  •] 

+  Bx~1/6[x  +  3x3/2e  +  32x5/(7  ■  211)  +  •  ■  ■  ] 


16.1  y  =  x  3/2Z1/2{ x) 
16.5  y  =  xZo(2x) 

16.9  y  =  x1/3Z2/3(4 y/Z) 
16.15  y  =  Z2(5x) 


16.3  y  =  x  1/2Z1(4x1/2) 

16.7  y  =  x~~lZ1/2{x2 /2) 

16.11  y  =  x~2Z2(x) 

16.17  y  =  Z0( 3x) 


17.7  (a)  y  =  xl^2I\(2x1^2).  Note  that  the  factor  i  does  not  need  to  be  included, 

since  any  multiple  of  y  is  a  solution. 


18.11  1.7  m  for  steel. 


20.1  1/6  20.3  4/v r  20.5  1/2 

20.7  hn\x)  ~  a;-1ei[a:-(Tl+1)7r/2l  20.9  h£\ix) - i-nx~xe~x 
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21.1  y  =  Ax  +  B  [x sinh  1  x  —  Vx2  +  l) 

21.2  y  =  A(1  +  x)  +  Bxe 1/x 

21.5  y  =  A(x  —  1)  +  B[(x  —  1)  lna;  —  4] 

21.7  y  =  Aj^  +  B[j^  lns+i±2] 

21.8  y  =  A( x2  +  2x)  +  B[(x2  +  2x)  lna;  +  1  +  5a;  —  a;3/6  +  tc4/72  +  •  ■  •  ] 

22.4  H0(x)  =  1  H3(x)  =  8x3  -  12x 

H\(x)  =  2x  H4(x)  =  16a;4  —  48a;2  +  12 

H2{x)  =  Ax2  -  2  H5(x)  =  32a;5  -  160a;3  +  120a; 

22.13  L0(x)  =  1 

L  i(x)  =  1  —  x 


L2(x)  =  |(2  -  4a;  +  a;2) 

L3(x)  =  ^(6  —  18a;  +  9a;2  —  x3) 

L4(x)  =  ^(24  -  96a;  +  72a;2  -  16a;3  +  a;4) 

L5(x)  =  yip  (120  -  600a;  +  600a;2  -  200a;3  +  25a;4  -  x5) 

Note:  The  factor  1/n!  is  omitted  in  most  quantum  mechanics  books  but  is 
included  as  here  in  most  reference  books. 
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2.12 


2.14 


3.9 

3.11 

4.8 


4.9 


T=  E 


400 


odd  n 


rnr  sinh  3n7r 


•  .  n7T  \  .  n nx 

smh  —(30  —  y)  sm  ■ 


10 


10 


+  £ 


400 


odd  ' 


.  ,  ™  x  .  nny 

- —■ - 7— -  smh  ——(10  —  x)  sm  — — 

n7rsinh(n7r/3)  30 v  30 


For  f(x)  =x-5 :T=~  V  Ajcos  —  e~n”v/w 

7 rz  '  nz 

odd  n 


1 


10 


For  f(x)  =  x:  add  5  to  the  answer  just  given. 


=  100  -  —  V  EE  e-KWWft  cos 

7T 


7 r  z — '  2n  +  1 

n— 0 


2n  +  1 


i5„=nV/(2m);  *{x,t)  =  -  V  e~iE^n 

/tt  <  ^  n 


odd  1 


4/ 

y=  — 

ir^v 


1  7ra;  nvt  7r  2nx  2n  vt 

—  sm  — —  sm  — — I - sm  — —  sm  — - — 

3  l  l  16  l  l 


Problem  2 
Problem  3 
Problem  5 


00  •  /O 

sm  nn  /  2 
n(n2-4) 
n  =  2,  v  =  v/l 
n  =  3,  v  =  | v/l  and  n 
n  =  1,  v  =  \vjl 


.  nnx  .  nnvt 
sm  — ; —  sm  — ; — 


=  4,  v  =  2 v/l  have  nearly  equal  intensity. 


5.1  (a)  u  =  9.76 

OO  1 

5.4  u  =  200  V  - — —— — r  Jo(fcmf/a)e_('fcma^a')  *,  km  =  zeros  of  Jo 

kmJi  (rCm  ) 

m.=  1  '  7 
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5.10 

5.11 

5.13 

5.14 

6.5 

6.8 


v  v  y  .inis' si 

7T3  Z_^  nmp  l  l  l 

odd  n  odd  m  odd  p 

R  =  rn,  r~n,  n  ^  0;  R  =  In  r,  const.,  n  =  0. 

R  =  rl ,  r_z_1. 

400  1  /  r  \  4"  . 

-  /  -  Ty  sm  4n0 

7r  '  n  V 10/ 

odd  n 


U  = 


In  2 
64Z4 


ZUU  x  - 

7T  ‘ 


7r  z-~i  n( 2n  —  2  n) 

odd  n  y  ' 


sin  nQ 


z  = 


Ev-^  1  n'nx  mny  m Am2  +  n2)l'2t 

>  — = — r  sm  — —  sm  — - —  cos - ; - 

Z—s  nim-S  1  1  1 


odd  m  odd  n 


= 


j  (k  r)  I  s*nn^  1  e -iEmnt/h  £ 
■>n{Krnnr)  j  ^  ^  j  e  , 


2  7-2 


2  m«2 


it  =  |rPi(cos0)  —  |r3P3(cos0) 

u  =  iPo(cos0)  +  |r2P2(cos0)  —  ^r4P4(cos0)  •  •  • 

u  =  |7r[3rPi(cos0)  +  ^rr3P3(cos0)  +  ||r5P5(cos0)  •  ■  ■] 


7.2 

7.5 

7.6 

7.8  u  =  25[Po(cos0)  +  |rPi(cos0)  +  ^ r2P2(cos0 )  +  ||r3P3(cos0)  •  ■  •] 
7.10 

7.12 

7.13 


u  =  ^r3P|(cos0)  cos20  —  rPi(cos0) 
u  =  |rPi(cos  0)  +  ^r3P3(cos0)  -  ^r5P5(cos0)  ■ 
u  =  E0(i —  a3/r2)Pi(cos0) 

200a  g  (-1)"  B.n  wrr  p_(cm7r/a)2t 


7.15  u  =  100  + 


■  sm - e 

n  a 


7.19 


7.20 

7.21 

7.22 

8.4 


8.5 

9.2 

9.7 


=  100  +  200  J2(-1)njo(mrr/a)e-(°‘n*/a)2t 
tf(r,0,0)  =  ji(J3r)Pr( cosd)e±im*e-iEt/R,  where 

/ -  ?j2 

0  =  y  2 ME/n2  ,  Pa  =  zeros  of ju  E  =  2(  zeros  of  jz)2 


ipn(x)  =  e  a  x  /2Hn(ax),  a  =  y/muJJh 

Degree  of  degeneracy  of  En  is  C(n  +  2 ,n)  =  (n  +  2 )(n  +  l)/2,  n  =  0  to  oo. 

=  R(r)Yr(9,4>),  R(r)  =  rl  e~R  ^  L2^  (£) ,  En  = 

Let  K  =  line  charge  per  unit  length.  Then 

V  =  —  PTln(r2  +  a2  —  2 ra  cos  0)  +  K  In  a2  —  K  In  R2 
+  K  ln[r2  +  (P2/a)2  -  2(P2/a)rcos6>] 

P  at  (a,  0),  ~K  at  (P2/a,0) 

u  =  2007T-1  f 0°°  fc_2(l  —  cos  2 k)e~ky  cos  fccc  dk 

u(x,t)  =  100erf[x/(2at1/2)]  —  50  erf  [(a;  —  l)/(2at1/2)]  —  50  erf  [(a;  +  l)/(2at1/2)] 


7r2  ■2-—'  n2  sinh  2n7r 
odd  n 


10.3 


sinh  n7r(2  —  y )  cos  n7ra; 
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10.4 


T  = 


odd  n 


l 

nsinh(3n7r/5) 


mry  nnx 

sinh - sin - 

5  5 


10.6 

10.8 

10.10 

10.16 

10.18 

10.20 

10.22 

10.26 


40 


E 


.  _  rar(5  —  x)  .  niry 
sinh - - - sin  • 


7 r  '  nsinh(5n7r/3)  3  3 

odd  n  \  I  J 


=  20--E 

■7T  •  ^ 


u  =  z  u - >  - - -e 

7 r  ^  2n  +  1 


HT  p-[(2n+l)W(20]2tcos  ( 2U+  1 


- 


2/ 


-7TX 


.  =  20-®--  V  - 

7T  ^ 


1  p-(n7ra/10)2t  gin 


7r  z — '  n 

even  n 


U  = 


10 

1  /  mirr 

tt*  * — '  z '  nm/n(3m7r/20)  n  \  20 

odd  n odd  m 


1600 


E  E 


sin  nO  sin  ■ 


20 


va/E/ (27 r) 

vmn,  n  =  3,  6,  •  ■  • ;  the  lowest  frequencies  are: 

rq3  =  2.65i/io,  r/23  =  4.06  vw,  v16  =  4.13 1/10,  1/33  =  5.4i/10 


1/  =  v\i/(2na)  where  A/  =  zeros  of  ji,  a=  radius  of  sphere, 
v  =  speed  of  sound 

u  =  1  —  irPi(cos0)  +  |r3P3(cos0)  —  iir5Ps(cos 9)  ■  ■  ■ 
v  =  [v/(2n)][(kmn/a)2  +  A2]1/2  where  kmn  is  a  zero  of  Jn 


Chapter  14 

1.1  u  =  x3  —  3xy2,  v  =  3x2y  —  y3  1.3  u  =  x,  v  =  —y 

1.4  u  =  {x2  +  y2)1!2,  v  =  0  1.7  u  =  cos  y  cosh x,  v  =  sin  y  sinh  x 

1.9  u  =  x/(x2  +  y2),  v  =  —  y/{x2  +  y2) 

1.11  u  =  3x/[x2  +  (y  —  2)2],  v  =  (—2x2  —  2 y2  +  5 y  —  2)/[x2  +  (y  —  2)2] 

1.13  u  =  ln(x2  +  r/2)1/2,  v  =  0  1.17  u  =  cos x cosh y,  v  =  sin x sinh y 

1.18  u  =  ±2-1/2[(x2  +  y2)1/2  +  a;]1/2,  v  =  ±2_1/2[(a;2  +  y 2)1/2  —  x J1/2, 
where  the  ±  signs  are  chosen  so  that  uv  has  the  sign  of  y. 

1.19  u  =  ln(a:2  +  r/2)1/2,  v  =  arc  tan (y/x) 

[The  angle  is  in  the  quadrant  of  the  point  (x,y).} 


In  2.1 

-2.23,  A  = 

analytic,  N  =  not  analytic 

2.1 

A 

2.3 

N  2.4 

N 

2.7 

A 

2.9 

A,  z  ^  0 

2.11 

A,  2  ^  2i  2.13 

N 

2.17 

N 

2.18 

A,  z  ^  0 

2.19 

A,  z/0  2.23 

A, 

z/  0 

2.34 

1  r3 

3  ~ 

•,  |z|  <  1 

2.38 

-\i+\z+\iz2  - 

-  M  <  2 

2.42 

z  +  z3/3\  + 

z5/ 5!- 

•  • ,  all  z 

2.48 

Yes,  z  ^  0 

2.52  No 

2.53 

Yes,  . 

z/0 

2.54 

— iz 

2.56  —iz2 12 

2.59 

ez 

2.60 

2  In  z 

2.63  -i/{l-z) 

3.1 

\  +  i 

3.3  0 

3.5 

-1 

3.7 

7r(l  —  i)/  8 

3.9  1 

3.12 

(a)  1 

(1  +  2  i) 

3.17 

(a)  0  (b) 

in 

3.19  16*7t 

3.23 

72z7r 

4.4 

For  0  <  \z\ 

<  1:  - 

1--1  1  11  ~  13, 
A4,  2  164,  16  ^ 

:2  •  •  • 

;  R(0)  = 

1 

O  4o 

For  1  <  \z\ 

<2: 

■  +  z"3  +  z“2  +  fz-1 

+  \ 

+  Tez  + 

—z^  •  • 
16 

For  \z\  >  2: 

z-4  +  5z-5  +  17z-6  +  49z~ 

7  .  .  . 
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4.6  For  0  <  |z|  <  1:  z"2  -  2z~x  +  3  -  4z  +  5z2  ■  ■  ■ ;  i?(0)  =  -2 
For  |z|  >  1:  z-4  -  2z"5  +  3z"6  •  •  • 

4.8  For  |z|  <  1:  -5+ f  z- ^z2---;  R(0)  =  0 

For  1  <  |z|  <  2:  — 5( - h  z-3  -  z-2  +  z_1  +  \z  +  ^z2  +  jj- gZ3  •  •  • ) 

For  2  <  |zj  <  3:  •  •  •  +  3z"3  +  9z"2  -  3Z"1  +  1  -  \z  +  \z2  -  ^z3  •  •  • 
For  | z |  >  3:  30(z"3  -  2z"4  +  9z"5  •  •  • ) 

4.9  (a)  Regular  (b)  Pole  of  order  3 

4.10  (b)  Pole  of  order  2  (d)  Essential  singularity 


4.11 

(c)  Simple  pole 

(d) 

Pole  of  order  3 

4.12 

(b)  Pole  of  order  2 

(d) 

Pole  of  order  1 

6.1 

z_1  -  1  +  z  -  z2  •  •  • ;  R 

:=  l 

6.3 

z~3  —  z~1/3!  +  z/5!  •  •  • 

;  R  =  - 

1 

'  6 

6.5 

HO  -  H1  +  3  +  ?0 

-l)... 

■];  R  =  he 

6.7 

iKz-iH-i  +  O- 

*72)0 

6.9 

—  [(z  —  2)_1  +  1  +  (z  — 

2)  +  (z 

R  =  - 1 

6.14 

R(- 2/3)  =  1/8,  R{ 2)  = 

=  -1/8 

6.16 

R(  o)  =  - 

2,  R(l)  =  1 

6.18 

R(3i)  =  5  — 

6.19 

R(n/2)  = 

1/2 

6.21 

R[V 2{l  +  i)]=V2{l- 

-0/16 

6.22 

R(in)  =  - 

-1 

6.27 

R(n/6)  =  -1/2 

6.28 

R(3i)  =  - 

16  ^  24  L 

6.31 

R(0)  =  9/2 

6.33 

R(tt)  =  - 

1/2 

6.35 

R(i)  =  0 

6.14' 

ni/4 

6.16' 

—2m 

6.18' 

0 

6.19' 

0 

6.27' 

—m 

6.28' 

iri/4 

6.31' 

97 Tl 

6.33' 

0 

6.35' 

0 

7.1 

7t/6 

7.3 

2tt/3 

7.5 

7r/(l  —  r2) 

7.7 

7t/6 

7.9 

2-k l\  sin  a\ 

7.11 

3tt/32 

7.13 

7r/10 

7.15 

7re_4/3/12 

7.17 

(7r/e)(cos2  +  2  sin  2) 

7.19 

7re^3/54 

7.23 

71-/8 

7.24 

7T 

7.26 

—  7t/2 

7.28 

7r/4 

7.30 

tt/(2>/2) 

7.32 

T67rv/2 

7.33 

ttV2/2 

7.36 

-^272 

7.39 

2 

7.41 

(2tt)1/2/4 

7.45 

One  negative  real,  one 

each  in 

quadrants  I  and  IV 

7.48  Two  each  in  quadrants  I  and  IV 
7.50  Two  each  in  quadrants  II  and  III 


7.52 

7T7 

7.54 

87TZ 

7.55 

cosh  t  cos  t 

7.57 

1  +  sin  t  —  cos  t 

7.60 

t  +  e_t  —  1 

7.61 

(cosh  2t  +  2  cosh  t  cos  t\J 3  )  /3 

7.63 

(cosht  —  cost)/  2 

7.65 

(cos  2 1  +  2sin2t  —  e“*)/5 

8.3 

Regular,  i?  =  —  1 

8.5 

Regular,  R  =  —  1 

8.7 

Simple  pole,  R  =  —2 

8.9 

Regular,  R  =  0 

8.11 

Regular,  i?  =  —  1 

8.14 

—2m 
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9.3  u  =  x/(x2  +  y2),  v  =  —  y/(x2  +  y2) 

9.4  u  =  ex  cos  y,  v  =  ex  sin  y 

9.7  u  =  sin  x  cosh  y,  v  =  cos  x  sinh  y 


10.6  T  =  100 y/(x2  +  y2)-  isothermals  y/(x2  +  y2)  =  const.; 

flow  lines  x/(x 2  +  y2)  =  const. 

10.9  Streamlines  y  —  y/ (x2  +  y2)  =  const. 

10.12  T  =  (20/7t)  arc  tan[2y/(l  —  x2  —  y2)],  arc  tan  between  7r/2  and  37r/2 
10.14  <F  =  ^Vo ln{[(x  +  l)2  +  y2]/[(x  —  l)2  +  y2}} 

'P  =  Vo  arc  tan{2y/[l  —  x2  —  y2]},  arc  tan  between  7t/2  and  37r/2 
Vx  =  2Vo(l  -  x2  +  y2)/[(  1  -  x2  +  y2)2  +  4 x2y2], 

Vy  =  -AV0xy/[(l  -  x2  +  y2)2  +  4 x2y2} 


11.2  —  i\n(l  +  z) 

11.8  i?(l/2)  =  1/2 
11.14  (a)  2  (b)  —  sin  5 

11.18  We~n/2 


11.5  i?(i)  =  |(l-n/3),  R(-i)  =  -\ 
11.10  -1  11.12  1/2 
(c)  1/16  (d)  — 2t r  11.16  — tt/6 

11.20  ^7r(e_1  +  sin  1) 


11.29  7r3/8.  Caution:  — 7t3/8  is  wrong. 

11.32  One  negative  real,  one  each  in  quadrants  II  and  III 
11.34  Two  each  in  quadrants  I  and  IV,  one  each  in  II  and  III 
11.41  tt2/8 
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1.2  3/8,  1/8,  1/4 

1.6  27/52,  16/52,  15/52 


1.5  1/4,  3/4,  1/3,  1/2 

1.8  9/100,  1/10,  3/100,  1/10 


2.12 

(a)  3/4 

(b)  1/5 

(c)  2/3  (d)  3/4 

2.14 

(a)  3/4 

(b)  25/36 

(c)  37,  38,  39,  40 

2.17 

(a)  3  to  9  with  p( 5)  =  p( 7) 

=  2/9;  others,  p  =  1/9. 

(e)  3/7 
(c)  1/3 


3.4 

(a)  8/9,  1/2  (b)  3/5,  1/11,  2/3,  2/3,  6/13 

3.5  1/33,  2/9 

3.12 

(a)  1/49  (b)  68/441 

(c)  25/169 

(d)  15  times 

(e)  44/147 

3.14 

n  >  3.3,  so  4  tries  are  needed. 

3.16  9/23 

3.17 

(a)  39/80,  5/16,  1/5,  11/16 

(b)  374/819 

(c)  185/374 

3.20 

5/7,  2/7,  11/14  3.21 

2/3,  1/3 

4.1 

(a)  P(10,8)  (b)  C(10, 8) 

(c)  1/45 

4.4 

1.98  x  10"3,  4.95  x  10"4,  3.05  x  10"4,  1.39  x  10 

"5  4.7 

1/26 

4.8 

1/221,  1/33,  1/17  4.11 

0.097,  0.37,  0.67;  13 

4.17 

MB:  16,  FD:  6,  BE:  10 

5.1 

fx  =  0,  a  =  -\/3 

5.3  n  =  2, 

cr  = 

5.5 

A*  =  1,  cr  =  y/TJ& 

5.7  p,  =  3(2p  —  1),  a  =  2v/3p(l  —  p) 

6.1 

(c)  x  =  0,  a  =  2 _1/2a 

6.4  x  =  0, 

<7=  ( 21/2a)-1 

6.5 

/(*)  =  Ae-At,  F(t)  =  1  -  e~xt,  i 

=  1/A,  half  life 

=  tin  2 
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Answers  to  Selected  Problems  809 


6.7  (a)  F(s)  =  2[1  -  cos (s/P)],  f(s)  =  (2 /R)  sin {s/R) 

(b)  F(s)  =  [1  —  cos(s/i?)]/[l  —  cos(l/i?)]  =  s2, 
f{s)  =  i?_1[  1  —  cos(l/l?)]_1  sin  {s/R)  =  2s 


n 

Exactly 
7  h 

At  most 

7  h 

At  least 

7  h 

Most  probable 
number  of  h 

Expected 
number  of  h 

7.1 

7 

0.0078 

1 

0.0078 

3  or  4 

7/2 

7.2 

12 

0.193 

0.806 

0.387 

6 

6 

In  the  following  answers,  the  first  number  is  the  binomial  result  and  the  second 
number  is  the  normal  approximation  using  whole  steps  at  the  ends  as  in  Example  2. 
8.12  0.03987,  0.03989  8.14  0.9546,  0.9546 

8.17  0.0770,  0.0782  8.18  0.372,  0.376 

8.20  0.462,  0.455 

9.3  Number  of  particles  0  1  2  3  4  5 

Number  of  intervals  406  812  812  541  271  108 

9.5  P0  =  0.37,  Pi  =  0.37,  P2  =  0.18,  P3  =  0.06 

9.8  3,10,3 

9.11  Normal:  0.08,  Poisson:  0.0729,  (binomial:  0.0732) 

10.8  5  =  5,  y  =  1,  sx  =  0.122,  sy  =  0.029, 

(t x  =  0.131,  <jy  =  0.030,  <jmx  =  0.046,  amy  =  0.0095, 
rx  =  0.031,  ry  =  0.0064, 
x  +  y  =  6  with  r  =  0.03,  Ty  =  5  with  r  =  0.04, 
a;3  sin  y  =  105  with  r  =  2.00,  In  x  =  1.61  with  r  =  0.006 
10.10  5  =  6  with  r  =  0.062,  y  =  3  with  r  =  0.067, 

ev  =  20  with  r  =  1.3,  x/y 2  =  0.67  with  r  =  0.03 

11.3  20/47  11.7  x  =  1/4,  a  =  -s/3/4 

11.9  (d)  x  =  1/4,  a  =  V3T/12  11.13  30,60 

11.17  5  =  2  with  r  =  0.073,  y  =  1  with  r  =  0.039,  x  —  y  =  1  with  r  =  0.08, 
xy  =  2  with  r  =  0.11,  x/y3  =  2  with  r  =  0.25 
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Abelian  group,  175 
absolutely  convergent  series,  10,  56 
absolute  value  of  complex  number,  49,  53 
acceleration,  286 
of  car,  33 

centripetal,  83,  487 
in  circular  motion,  283 
complex,  56 

in  any  coordinate  system,  523 
Coriolis,  487 

in  cylindrical  coordinates,  523 
of  gravity,  392 
in  polar  coordinates,  487 
a-c  circuit,  See  electric  circuits 
active  transformation,  127 
adding  (subtracting) 
complex  numbers,  51 
matrices,  115 
mod  N,  174,  178 

rows  (columns)  of  a  determinant,  91 
series,  19,  23 
tensors,  504 
vectors,  97-98 
adjoint,  137 

air  pressure  in  a  sound  wave,  345,  372 
Airy  functions,  596 

alternating  current.  See  electric  current 
alternating  series,  17,  34 
Ampere’s  law,  329 
amplitude: 

of  a  complex  number,  49 
of  an  elliptic  integral,  555 
of  a  sinusoidal  function,  341 
of  vibration,  341 
AM  radio  signal,  347 
analytic  at  a  point,  667 
analytic  at  infinity,  703-704 
analytic  geometry  using  vectors,  106 
angle: 

of  a  complex  number,  49 


in  n  dimensions,  143,  145 
between  planes,  112 
solid,  320 

between  vectors,  102 
angular  frequency,  80 
angular  momentum,  289  (Problem  9) 
angular  velocity,  278,  515 
annihilation  operators,  607 
annular  ring,  679 
antisymmetric  matrix,  138 
antisymmetric  tensor,  503,  509 
applications  to  physics  problems.  See 
individual  topics 
approximation(s) : 

by  asymptotic  series,  549  ff,  552 
for  Bessel  functions,  604 
using  differentials,  196-199 
of  integral,  198 

“least  squares”,  214  (Problem  16), 
581,  774  (Problem  1) 
of  In  AH,  537 

by  normal  distribution,  762,  769 
using  series,  33,  38  (Problems  40  ff) 
arc  length,  250,  266,  521 

in  cylindrical  coordinates,  260,  521 
of  ellipse,  556 
metric  tensor,  524 
in  orthogonal  systems,  521 
in  polar  coordinates,  259,  266,  521 
in  rectangular  coordinates,  250 
in  spherical  coordinates,  261.  See 
also  523-524 
vector  element,  319 
arc  tan  series,  30 
area,  249.  See  also  surface 
under  a  curve,  249 
element  on  a  sphere,  261 
element  on  a  surface,  271 
as  measure  of  probability,  751 
of  parallelogram,  279 
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in  polar  coordinates,  258 
sec  7  method,  271 
of  surface  of  revolution,  254 
Argand  diagram,  48 
argument  of  a  complex  number,  49 
argument  principle,  694 
finding  zeros,  695 
arithmetic  mean,  771 
arrow  representing  a  rotation,  497 
arrow  representing  a  vector,  96 
associated  Legendre  functions,  583 
associated  tensors,  533 
associative  law 

for  matrix  multiplication,  139 
for  vector  addition,  97 
asymptotic  series,  549  ff 
audio  frequency,  347 
augmented  matrix,  85 
auxiliary  equation,  409  ff 
average  value.  See  also  mean  value 
of  a  function,  347  ff 
of  products  of  sines  and  cosines,  351 
of  sin2  x  and  cos2  x,  349 
axes 

along  eigenvector  directions,  151 
principal,  162 
real  and  imaginary,  48 
rotation  of,  126,  127,  498 
axial  vector,  515 

left-handed  coordinate  system,  515 
in  physics  formulas,  517 
right-hand  rule,  515 

bacteria,  growth  of,  1,  399 
balls  in  boxes,  739  ff 

distinguishable  and  indistinguishable, 
741 

identical,  740 

basic  interval  in  Fourier  series,  350,  360 
basis,  143  ff 
change  of,  127,  708 
orthonormal,  145 
basis  functions,  357 

completeness  relation,  375 
complete  set  of,  375-377 
basis  vectors.  See  also  basis  functions; 
unit  basis  vectors 
complete  set  of,  576 
covariant  and  contravariant,  530 


in  curvilinear  coordinates,  522 
in  general  coordinates,  533 
i,  j,  k,  100 
Bauer’s  formula,  618 
Bayes’  formula,  732 
beam  carrying  load,  496 
Bernoulli  equation,  404,  431 
Bernoulli  trials,  756 
Bessel  coefficients,  617 
Bessel  functions,  587  ff 
Airy  functions,  596 
applications,  587 
approximate  formulas,  604 
asymptotic  formulas,  604 
ber,  bei,  ker,  kei,  596 
differential  equations,  588,  593,  594 
first  kind,  589,  595 
generating  function,  616 
graphs,  591 
Hankel  functions,  595 
hyperbolic,  595 
integrals  involving,  617 
Kelvin  functions,  596 
modified,  595 
Neumann  function,  591 
normalization  of,  602-603 
order,  589 

orthogonality,  601  ff 
recursion  relations,  592 
second  kind,  591,  595 
second  solution,  590 
series  of,  640  ff 
spherical,  596 
third  kind,  595 
Weber  function,  591 
zeros,  591 

Bessel’s  inequality,  376 
beta  functions,  542  ff 

in  terms  of  gamma  functions,  543 
integrals,  542,  543 

binomial  coefficients,  28  ff,  192,  567, 

609,  739 

binomial  distribution,  756  ff 

mean  value  and  standard  deviation,  763 
normal  approximation,  762 
Poisson  approximation,  768 
binomial  series,  28  ff.  See  also  bino¬ 
mial  coefficients 
binomial  theorem,  28  ff 
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bipolar  coordinates,  525  ff 
birthdays,  743 
books,  pile  of,  44 
Bose-Einstein  statistics,  742 
bouncing  ball,  1 

boundary  conditions,  393,  625,  631, 
634,  711 

boundary  point  problems,  223  ff 
boundary  value  problems,  625,  711 
bounded  function,  356 
brachistochrone,  473,  482,  484 
branch,  branch  cut,  branch  point,  693 
Bromwich  integral,  696 
butterfly  net,  328 

calculator.  See  also  computer 
degree  and  radian  mode,  49,  52 
calculus  of  variations,  Chapter  9 
brachistochrone,  473,  482,  484 
Euler  equation,  474,  477 
Fermat’s  principle,  473,  484 
geodesics,  472  ff 
isoperimetric  problems,  491 
Lagrange’s  equations,  485  ff 
soap  film,  473,  480 
variational  notation,  493 
Caley-Hamilton  theorem,  161,  187 
capacitor,  323 

fringing  at  edge  of,  714-716 
car,  acceleration  of,  33 
carrier  wave,  347 
Cartesian  tensors,  498,  500 
Cartesian  vectors,  499 
catenary,  480 

Cauchy  differential  equation,  434 
Cauchy  principal  value,  692 
Cauchy- Riemann  conditions,  669  ff, 
673-674,  720 

Cauchy’s  integral  formula,  675 
Cauchy’s  theorem,  674 
center  of  mass,  251  ff 
central  limit  theorem,  774 
centripetal  acceleration,  487 
centroid,  251  ff 
chain  rule,  199  ff,  203  ff 
in  matrix  form,  205 
chain  suspended,  467,  473 
change  of  basis,  127,  708.  See  also 
change  of  variables 


linear  transformations,  125 
in  n  dimensions,  143 
orthogonal  transformations,  126 
change  of  variables,  228  ff.  See  also 
change  of  basis 

in  a  differential  equation,  228,  406, 
431,  434 

general  transformations,  529 
in  integrals,  258  ff 
rectangular  to  cylindrical,  260 
rectangular  to  polar,  258 
rectangular  to  spherical,  261 
rotation,  127,  498 
character  (group),  175 
characteristic  equation  for  a  differen¬ 
tial  equation,  409 

characteristic  equation  of  a  matrix,  148 
characteristic  frequencies,  165-166,  636, 
644 

characteristic  functions,  566,  625.  See 
also  eigenvalue  problems 
characteristic  modes  of  vibration,  166, 
636,  645 

characteristic  values,  148  ff.  See  also 
eigenvalue  problems 
characteristic  vectors,  148  ff 
Chebyshev’s  inequality,  759 
circuit.  See  electric  circuits 
circular  membrane,  644  ff 
circulation,  325  ff 
class  (group)  175 
cnu,  558 
coefficients 

in  Bessel  series,  641-642 
binomial,  28  ff,  192,  567,  609,  739 
determinant  of,  93,  148 
in  Fourier  series,  350 
in  Legendre  series,  580-581 
in  Maclaurin  series,  24 
matrix  of,  84 
in  Taylor  series,  24 
in  two-variable  power  series,  191 
undetermined,  421 
cofactor,  90 
coin,  weighted,  727 
coldest  points  of  a  plate,  224 
collectively  exhaustive,  723 
color,  346,  378 
column  matrix,  114 
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columns.  See  determinants  or  matrices 
column  vector,  114 
combinations  of  measurements,  772  ff 
combinations  or  selections,  737  ff 
commutative  law 

for  scalar  multiplication,  101 
for  vector  addition,  97 
not  for  vector  multiplication,  103 
commutator,  117,  122,  141 
comparison  test,  10 
special,  15 

complementary  error  function,  547 
complementary  function,  417 
complete  elliptic  integrals,  555 
complete  set  of  basis  functions,  375-377 
complete  set  of  basis  vectors,  576 
complex  acceleration,  56 
complex  conjugate,  50,  52 
of  a  matrix,  137 
roots  of  auxiliary  equation,  411 
complex  eigenvalues,  156 
complex  equations,  54  ff 
complex  Euclidean  space,  146 
complex  exponentials,  60  ff 
in  differential  equations,  420 
in  evaluating  integrals,  351 
complex  Fourier  series,  358  ff 
complex  impedance,  78 
complex  infinite  series,  56  ff,  678  ff. 

See  also  series;  power  series 
complex  numbers,  Chapter  2 
absolute  value,  49,  53 
angle,  49 

combining,  51,  60,  63 
conjugate,  50,  52,  53 
definition,  46 

elementary  functions  of,  60  ff 
imaginary  part,  47 
laws  of  exponents  for,  60 
modulus,  49 
plotting,  48,  666 
polar  form,  48,  50  ff 
real  part,  47,  49 
rectangular  form,  48,  51 
complex  plane,  47  ff,  666  ff 
complex  potential,  713 
complex  powers  and  roots,  73 
complex  power  series,  58,  678 


complex  variable,  functions  of, 

Chapter  14 

complex  velocity,  56,  717 
components,  82,  96 
scalar  and  vector,  100 
of  a  tensor,  496  ff,  499  ff 
of  a  vector,  100  ff,  497 
compound  interest,  3 
computer,  See  also  calculator 

comments  about  use,  xii,  3,  36,  63, 
93, 196,  241,  402,  408,  537,  562, 
699  (see  also  instructions  and 
comments  in  problem  sets) 
examples  of  use,  31,  66,  249,  357, 
393-394,  397,  605,  624,  646,  762, 
768  (see  also  individual  prob¬ 
lem  sets) 

conditionally  convergent  series,  18 
conditional  probability,  732 
confidence  interval,  774 
conformable  matrices,  116 
conformal  mapping,  708  ff 
applications,  710  ff 
conic  section,  principal  axes,  162 
conjugate  element,  175 
conjugate  harmonic  functions,  672 
conjugate  of  a  complex  number,  50, 
52,  53 

conjugate  of  a  matrix,  137 
conservation  of  energy,  433 
conservative  field,  conservative  force, 
302  ff,  325,  330  ff 
constraints,  214,  488 
continuity,  equation  of,  317 
continuous  distributions,  750-751 
contour  integrals,  674,  676-678 
for  inverse  Laplace  transform,  696 
problems,  699  ff 
by  residues,  687  ff 
contour  lines,  290 
contour  map,  295 
contraction  of  a  tensor,  502  ff 
contravariant  vector  or  tensor,  530  ff 
convergence,  3,  6-7 
absolute,  10 
conditional,  18 
disk  of,  58 

of  Fourier  series,  356 
of  infinite  series,  2,  7,  9  ff 
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interval  of,  20 
of  Legendre  series,  580 
of  power  series,  2,  7,  9  ff,  20 
radius  of,  58 

of  a  series  to  a  function,  23 
tests  for,  10  ff 

convolution,  444,  446,  447,  471 
cooling,  Newton’s  law  of,  400 
coordinate  lines,  522 
coordinates 
bipolar,  525  ff 
curvilinear,  521  ff 
cylindrical,  260,  521,  525  ff  ( see  also 
cylindrical  coordinates) 
elliptic  cylinder,  524  ff 
general,  523,  525,  529,  531 
orthogonal,  525,  533 
in  n  dimensions,  143 
non-orthogonal,  533 
parabolic,  525  ff 
parabolic  cylinder,  524  ff 
polar,  258  ff  (see  also  polar  coordi¬ 
nates) 

spherical,  261  ff,  529  (see  also  spher¬ 
ical  coordinates) 
and  variables,  258 
coordinate  surfaces,  522 
coordinate  transformations,  Chapters 
3  and  10.  See  also  transforma¬ 
tions;  linear  transformations; 
orthogonal  transformation; 
rotation 

Coriolis  acceleration,  487 
correspondence  principle,  197 
cosh  2,  70 
cosine 

of  angle  between  vectors,  102 
of  angles  between  axes,  164 
of  complex  number,  67 
Fourier  series,  366 
Fourier  transform,  382 
hyperbolic,  70 
inverse,  74 

power  series  for,  26,  69 
cos  n6  as  a  determinant,  96 
Coulomb’s  law,  304,  320 
counting,  fundamental  principle  of, 
737 

coupled  pendulums,  490 


covariant  vector  or  tensor,  530  ff 
Cramer’s  rule,  93-94 
creation  operators,  607 
critically  damped  motion,  413 
cross  product,  103  ff,  276 
in  tensor  form,  516 
cumulative  distribution  function,  748 
curl,  296 

and  angular  velocity,  325 
and  conservative  fields,  302  ff,  324, 
331  ff 

in  orthogonal  coordinates,  527 
and  Stokes’  theorem,  313,  324,  327  ff 
in  tensor  notation,  511 
curvature,  435 
curve 

arc  length  of,  250,  266,  521  ( see  also 
arc  length) 
integral  along,  299 
simple,  330,  674 
smooth,  674 
tangent  line  to,  203 
curve  fitting,  214,  581 
curvilinear  coordinates,  521.  See  also 
orthogonal  curvilinear  coordi¬ 
nates 

cyclic  group,  172 
cycloid,  482,  483,  546 
cylinder  functions,  587.  See  also  Bessel 
functions 

cylindrical  coordinates,  260 
arc  length  element,  266 
curl,  527 
div,  298,  526 
grad,  294,  525 
Laplace’s  equation,  639 
Laplacian,  298,  527 
scale  factors  and  unit  basis  vectors, 
522 

velocity  and  acceleration,  523 
volume  element,  260,  261,  524 

damped  motion,  413-414 
damping,  critical,  413 
d-c  generator  voltage,  374 
decay  constant,  39,  395,  402 
definite  integrals.  See  integrals 
deformation  of  elastic  membrane,  148 
degeneracy,  152,  647 
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degenerate  subspace,  153 
degree  mode,  49 
degrees  and  radians,  49 
V  (del),  296 
S  (delta,  variation),  493 
S  function  (Dirac  delta  function) ,  449  ff 
definition,  452 
derivatives  of,  454 
distribution  (generalized  function), 
450 

formulas  involving,  455 
Fourier  transform  of,  454 
generalized  function  identities,  455 
generalized  functions,  450 
impulse,  449 

Laplace  transform  of,  453 
operator  equations,  456,  457 
physical  applications,  450,  454,  457 
in  2  or  3  dimensions,  456 
unit  impulse,  449 

S  (Kronecker  delta),  138-139,  508  ff 
use  in  formulas,  510  ff 
in  general  coordinates,  531 
inertia  tensor,  505,  519 
isotropic  tensor,  509 
DeMoivre’s  theorem,  64 
density 

mass,  246  ff,  250 

probability,  753  (see  also  probabil¬ 
ity  functions) 
source  and  sink,  714 
dependent  equations,  87 
dependent  functions,  133 
dependent  variable 
differential  of,  193 
in  Euler  equation,  479 
missing  in  differential  equation,  430 
derivatives.  See  also  differentiation 
using  differentials,  201 
directional,  290  ff,  710  (Problem  15) 
of  functions  of  2,  668 
of  integrals  (Leibniz’  rule),  233  ff 
normal,  293 

partial,  189  (see  also  partial  differ¬ 
entiation) 

of  products  (Leibniz’  rule),  567 
of  vectors,  285 
determinant  (s),  89  ff 
of  coefficients,  93,  148 


cofactors,  90 
evaluating,  89-90 
formula  using  e,;^,  509 
Laplace  development,  90  ff 
of  a  matrix,  89 
minor,  89 
order  of,  80 
product  of,  118 
secular,  148 
useful  facts  about,  91 
Wronskian,  133,  136  (Problem  16) 
diagonalization  of  matrices,  148  ff 
applications,  162  ff 
Hermitian,  153 
simultaneous,  158 
symmetric,  154 

differential  equations.  See  ordinary 
differential  equations;  partial  dif¬ 
ferential  equations 

differential  operators,  409,  415-416, 
566 

differentials,  193 

approximate  calculations  using, 

196  ff 

change  of  variables  using,  228  ff 
exact,  305,  331 
finding  derivatives  using,  200 
of  independent  and  dependent  vari¬ 
ables,  193 
principal  part,  194 
relation  between,  in  two  coordinate 
systems,  529 

tangent  approximation,  193 
total,  193,  194 

differentiation.  See  also  derivatives 
chain  rule  for,  199  ff,  203  ff 
of  Fourier  series,  369 
implicit,  201,  202  ff,  215 
of  integrals,  233  ff 
partial,  188  ff  (see  also  partial  dif¬ 
ferentiation) 
of  power  series,  23 
of  vectors,  285  ff 
diffraction,  Fresnel,  37 
diffusion  equation,  297,  619,  628  ff. 

See  also  heat  flow 
diffusivity,  619 

dimension  of  a  vector  space,  144 
dipole  moment,  574 


INDEX  817 


Dirac  delta  function.  See  5  function 
directional  derivative,  290  ff,  710 
direction  cosines,  164,  498 
direction  field,  393,  394 
direct  product,  501 
Dirichlet  conditions,  356  ff 
Fourier  integrals,  379 
Fourier  series,  356,  362 
Legendre  series,  581 
Dirichlet  problem,  631 
discriminant,  46 
disk  of  convergence,  58 
dispersion  relations,  698 
dispersion,  variance,  747 
displacement  vector,  286,  498 
distance 

minimum,  214,  216,  221-222,  225 
between  points,  143 
point  to  line,  110-111 
point  to  plane,  110 
between  skew  lines,  111 
distinguishable  arrangements,  741 
distribution  functions,  748 
distribution  (s) 
binomial,  756 
continuous,  750,  751 
discrete,  750,  751 
generalized  function,  450 
joint,  754 
normal,  761 
Poisson,  767 

distributive  law,  102,  104 
divergence,  296 

in  cylindrical  coordinates,  298,  526 
in  orthogonal  coordinates,  526 
physical  meaning,  316 
in  tensor  notation,  533 
divergence  theorem,  318  ff 
and  Gauss’s  law,  318,  320 
use  of,  319 
divergent  series,  6  ff 
dividing 

complex  numbers,  51,  63 
series,  23,  27-28,  59 
dnit,  558 

dot  meaning  d/dt ,  165 
dot  product,  101,  276 
double  factorial,  29 
double  integrals,  242  ff 


double  pendulum,  490 
drumhead,  vibrating,  644  ff 
dual  tensors,  512 
dummy  index,  502 
dummy  variable,  380,  541 

earth,  gravitational  potential  of,  484 
eccentricity,  557 
effective  value  of  current,  348 
eigenfunctions.  See  eigenvalue  prob¬ 
lems 

eigenplane,  153 

eigenvalue  problems,  135,  148  ff 
boundary  value  problems,  625 
differential  equations,  566 
linear  equations,  135 
matrices,  148  ff 
partial  differential  equations, 
625-626,  630,  632,  636 
eigenvalues,  148  ff.  See  also  eigen¬ 
value  problems 
complex,  156 
real,  154 

eigenvectors,  148  ff.  See  also  eigen¬ 
value  problems 

Einstein  summation  convention,  502  ff 
elastic  medium: 

deformed  membrane,  148 
string  in,  665 

electric  charge,  18,  304,  317,  320-322 
problems,  289,  323,  336 
electric  circuits,  77,  390-391 
complex  impedance  of,  77 
differential  equation  for,  391,  414,  444 
impulse  applied  to,  450  ff 
electric  current,  329-330,  344.  See 
also  electric  circuits 
distribution  in  wire  (skin  effect) ,  596 
root-mean-square,  348 
electric-dipole  moment,  573-575 
electric  field,  304-305,  317,  320-322, 
712-713,  714  ff 
problems,  295,  308,  323,  716 
electric  oscillations,  340,  426  ff,  444 
electric  quadrupole  moment,  573-575 
electric  resistance,  197 
electromagnetic  force,  285,  289,  336 
electromagnetic  theory,  329 
electron,  42,  394,  467 
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electrostatic  potential,  304 

of  charge  distribution,  492,  573  ff, 
652-655 

by  conformal  mapping,  711  ff 
Legendre  expansion  of,  571  ff, 

574  (Problem  15) 
maximum  on  boundary,  720 
by  method  of  images,  657 
multipole  expansion  of,  571-573 
of  point  charge  outside  grounded  sphere, 
655  ff,  657-658 
Poisson’s  equation  for,  652  ff 
zero  at  infinity,  655 
elementary  functions,  60 
of  complex  numbers,  60 
derivatives  of,  60,  61 
power  series  for,  26 
elementary  row  operations,  86 
elements 

arc  length,  250,  259,  266-267,  521  ff 

area,  244  ff,  250,  258,  267,  271 

cylindrical  coordinates,  260-261,  264,  266 

of  a  determinant,  89 

of  a  group,  172 

mass,  251 

of  a  matrix,  89 

polar  coordinates,  258-259,  266,  521 
spherical  coordinates,  261,  264-266,  524 
surface  area,  255,  270  ff,  317 
volume,  243,  246,  253,  260  ff,  524 
ellipse,  163 
arc  length,  556 
principal  axes  of,  163 
ellipsoid,  box  inscribed  in,  218  ff 
elliptic  cylinder  coordinates,  524  ff 
elliptic  functions,  558 
elliptic  integrals,  555 
endpoint  and  boundary  point  prob¬ 
lems,  223  ff 

energy 

conservation  of,  433 
of  harmonic  oscillator,  342,  348,  545 
kinetic,  342,  348,  433,  485,  545 
problems,  488-491,  545 
of  a  particle,  433 
potential,  303,  433,  485 
in  a  sound  wave,  373 
equally  likely,  723 


equation(s).  See  also  individual  equa¬ 
tions 

auxiliary,  409  ff 
characteristic 

for  differential  equation,  409 
for  matrix,  148 
complex,  54  ff 
of  continuity,  317 
differential,  Chapters  8,  12,  13 
homogeneous  algebraic,  134  ff 
inconsistent,  86,  87 
indicial,  586 
linear  algebraic,  82  ff 
of  lines  and  planes,  106  ff 
of  normal  line,  109,  293 
partial  differential,  Chapter  13 
of  surfaces  of  revolution,  257 (Prob¬ 
lem  16) 

of  tangent  line,  203 
of  tangent  plane,  293 
equator,  239 
equipotentials,  290 
erfc,  549  ff 
erfi,  548 
error 

in  alternating  series,  34 
in  asymptotic  series,  550-551 
probable,  774 
relative,  197,  775 
in  series  computation,  34 
standard,  722 
error  function,  547 
errors,  combination  of,  772  ff 
escape  velocity,  435 
essential  singularity,  680 
Euclidean  space,  142  ff 
complex,  146  ff 

Euler  (Cauchy)  differential  equation, 
434 

Euler  (Lagrange)  equation,  474,  477  ff 
Euler’s  formula,  61 
Euler’s  theorem  on  homogeneous 
functions,  238 

evaluating  integrals,  243  ff,  687  ff 
even  functions,  364 
events,  723 
compound,  730 
independent,  731 
mutually  exclusive,  723 
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exact  differential,  305,  331 
exact  differential  equations,  405 
expectation,  expected  value,  747 
experimental  measurements,  770  ff 
exponential  Fourier  series,  358  ff 
exponential  function,  60,  419 
complex,  67,  420  ff,  428-429 
exponential  integral,  552 
exponential  series,  26,  61 
exponential  shift,  424 
exponents,  laws  for  complex  numbers,  73 
extremal,  474,  475 

factorial,  4,  538 
factor,  integrating,  401,  405 
double,  29 
function,  4,  538 
gamma  function,  538 
Stirling’s  formula,  552 
zero  factorial,  4,  538 
falling  body,  392,  400 
falling  book,  506 
Faltung,  446 

family  of  curves,  395-396 
Fermat’s  principle,  473,  484 
Fermi-Dirac  statistics,  742 
fields,  290 

conservative,  302,  330,  332 
electric  (see)  electric  fields 
gravitational  (see)  gravitational  force 
irrotational,  332 
lamellar,  332 
magnetic,  329,  335,  714 
scalar,  290,  303  ff,  332 
solenoidal,  332 
tensor,  520 

vector,  290,  325,  332,  520 
velocity,  290,  314 
filter,  430 
finite  groups,  172 
first  integral 

of  a  differential  equation,  433 
of  the  Euler  equation  479 
Florida,  highest  point,  223 
flow  of  water,  317,  325 
circulation,  326 
by  conformal  mapping,  713  ff 
equation  of  continuity,  317 
irrotational,  713 


out  of  channel,  714  ff 
sources  and  sinks,  714  ff 
fluid  flow,  325.  See  also  flow  of  water 
fluid,  incompressible,  324 
flux,  316 
force 

conservative,  303  ff,  325 
electromagnetic,  329 
electrostatic,  304 
friction,  302 

gravitational  (see)  gravitational  force 
impulsive,  449 

inverse  square,  304,  320,  468,  571,  652 
moment  of,  277 
periodic,  427 

forced  vibrations,  417  ff,  426ff 
forcing  function.  See  forced  vibrations 
Fourier-Bessel  series,  641 
Fourier  coefficients,  350  ff 
Fourier  integrals,  378  ff.  See  also  Fourier 
transforms 

Fourier  integral  theorem,  379 
Fourier  series,  Chapter  7 

applications  in  physics,  345,  372  ff, 
428  ff,  623  ff,  629-638 
complex  form,  358  ff 
convergence,  356 
cosine  series,  366 
differentiating,  369 
Dirichlet  conditions,  355-358 
double,  643 

even  and  odd  functions,  364  ff 
exponential,  358  ff 
in  ordinary  differential  equations,  428 
Parseval’s  theorem,  375 
in  partial  differential  equations,  Chap¬ 
ter  13,  Sections  1-4 
period  2 1,  360  ff 
period  27r,  350  ff 
sine-cosine  form,  350  ff 
sine  series,  366 
triple,  643 

Fourier  transform(s),  378  ff 
applications  of,  454 
convolution,  447 
cosine,  382 
of  delta  function,  454 
Dirichlet  conditions,  379 
exponential,  379 


820  INDEX 


factor  1/(27t),  381 
and  Fourier  series,  378 
Parseval’s  theorem,  383 
partial  differential  equations,  solu¬ 
tions  to,  660-662 
sine,  381 
4’s  group,  172 
free  vibrations,  80,  417 
frequencies,  characteristic,  165,  636, 
644 

frequency,  343 
angular,  80,  344 
continuous,  378 

of  damped  vibrations,  413,  414 
of  light,  346,  378 
modulation,  617 
natural  undamped,  427 
resonance,  427 
in  a  sound  wave,  345,  373 
frequency  function,  746.  See  also  prob¬ 
ability  function 

Fresnel  integrals,  37,  549  (Problem  6), 
701  (Problem  41) 
friction,  302,  412 

fringing  at  edge  of  capacitor,  714-716 
Frobenius  method,  585  ff,  605 
Fuch’s  theorem,  605 
functions 

analytic,  667  ff 
associated  Legendre,  583,  648 
average  value  of,  347  ff 
Bessel,  587  ff  (see  also  Bessel  func¬ 
tions) 

beta,  542  ff 
bounded,  356 

characteristic,  566,  625  (see  also  eigen¬ 
value  problems) 
complementary,  417 
of  a  complex  variable,  Chapters  2 
and  14 

conjugate  harmonic,  672 
cumulative  distribution,  748 
delta,  449  ff 

dependent  and  independent,  133 
discontinuous,  224,  346 
elementary,  60,  668 
elliptic,  558 
error,  547,  763 

complementary,  547 


even,  364 

exponential,  60,  419 
factorial,  4,  538 
forcing,  417,  426,  444,  449 
gamma,  538  ff,  694 
generalized,  450 

harmonic,  671  ff  (see  also  Laplace’s 
equation) 

Hermite,  607  ff 
holomorphic,  667 

homogeneous,  238,  406,  407  (Prob¬ 
lem  21) 
hyperbolic,  70 
Kelvin,  596 
Laguerre,  609 
Legendre,  564  ff 
In,  72,  667 
of  a  matrix,  121 
meromorphic,  681 
monogenic,  667 
multiple- valued,  667 
Neumann,  591 
normalized,  578  ff,  584 
not  having  a  derivative,  669 
odd,  364 

orthogonal,  575  ff 
complete  sets  of,  575 
orthonormal,  579 
periodic,  343 
probability,  745  ff 
rational,  60 
regular,  667 

of  several  variables,  Chapter  4  (see 
also  partial  differentiation) 
sinusoidal,  341 
transfer,  444 
fundamental,  345 

fundamental  principle  of  counting,  737 
fundamental  theorem  of  algebra,  701 

9 ,  9ij,  gij,  533 
gamma  function,  538ff 
asymptotic  series  for,  553 
in  Bessel  function  formulas,  589 
definition:  for  p  >  0,  538; 
for  p  <  0,  540 

incomplete,  551  (Problem  2) 
recursion  relation,  538,  539 
Stirling’s  formula,  552 
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Gaussian  distribution,  547 
Gauss’s  law,  318,  320 
Gauss’s  theorem,  318 
gdw,  559 

general  coordinate  systems,  529  ff 
generalized  function,  450 
generalized  power  series,  585 
general  solution  of  a  differential  equa¬ 
tion,  392,  402,  410,  411,  418 
generating  function: 

Bessel  functions,  616 
Hermite  polynomials,  609 
Laguerre  polynomials,  610 
Legendre  polynomials,  569 
geodesics,  472  ff 
on  a  cone,  481 
in  a  plane,  477 
geometrical  meaning  of: 
complex  equations,  54,  55 
covariant  and  contravariant  vectors, 
530 

differentials,  193 
linear  transformations,  126  ff 
real  equations,  188 
total  differentials,  194 
geometric  progression,  geometric  se¬ 
ries,  1 

Gibb’s  phenomenon,  357 
gradient,  290  ff,  294,  525 
normal  to  surface,  293 
in  orthogonal  coordinates,  525 
in  polar  coordinates,  294,  525 
in  tensor  notation,  533 
grading  “on  a  curve”,  766 
Gram-Schmidt  method,  145-146, 182- 
183 

graphical  representation  of 
complex  equations,  55 
complex  numbers,  45 
functions,  188 

roots  of  a  complex  number,  65  ff 
gravitational  field.  See  gravitational 
force 

gravitational  force,  302  ff,  392,  571  ff 
problems,  308,  416,  435,  467, 
489-90,  494 

gravitational  potential,  290,  330.  See 
also  potential 
inside  earth,  308,  484 


outside  earth,  308 
Green  functions,  461,  657 
response  to  unit  impulse,  461 
solution  of  ordinary  differential  equa¬ 
tions,  462  ff 

solution  of  partial  differential  equa¬ 
tions,  657-658 

variation  of  parameters,  464,  465 
Green’s  theorem  in  the  plane,  675  ff 
Green’s  theorems,  324 
grid  of  curves  or  surfaces,  258,  261 
group(s),  172  ff 
Abelian,  175 
character,  175 
class,  175 
closure,  175 
conjugate  element,  175 
cyclic,  172 
definition,  172 
element,  172 
finite,  172  ff 
4’s  group,  172 
infinite,  176 
isomorphic,  174 
multiplication  table,  173  ff 
order,  173 
product,  172,  173 
representation,  172 
dimension  of,  176 
irreducible,  176 
matrix,  175 
subgroup, 173 
symmetry  group: 

of  equilateral  triangle,  174 
of  rectangle,  178  (Problem  11) 
of  square,  178  (Problem  10) 
unit  element,  172 
growth  of  bacteria,  1,  399 
Gudermannian,  559 

half-life,  399 
half-value  thickness,  399 
Hamiltonian,  233 
Hamilton’s  principle,  485 
hanging  chain,  473 
Hankel  functions,  595 
harmonic  analysis,  345 
harmonic  functions,  671,  720.  See  also 
Laplace’s  equation 
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harmonic  oscillator.  See  simple  har¬ 
monic  motion 
harmonics,  373 
harmonic  series,  11,  12 
alternating,  17  ff 

heat  flow,  291,  619,  621  (Problem  4), 

628  ff 

in  a  bar  or  slab,  629,  659 
across  a  boundary,  627-631 
conformal  mapping  solution,  711  ff 
differential  equation,  390  (see  also 
partial  differential  equations) 
Laplace  transform  solution,  659-660 
partial  differential  equation,  619,  628  ff 
separation  of  variables  solution,  628 
temperature  gradient,  293,  295 
Helmholtz  equation,  620,  629,  645 
Hermite  functions,  607  ff 
differential  equation,  607 
ladder  (raising  and  lowering)  oper¬ 
ators,  607 

Hermite  polynomials,  608 
differential  equation,  608 
generating  function,  609 
normalization,  608 
orthogonality,  608 
table  of,  804 
Hermitian  adjoint,  137 
Hermitian  conjugate,  137 
Hermitian  matrix,  138 
higher  derivatives  by  implicit  differen¬ 
tiation,  202 

highest  point  in  Florida,  223 
high  fidelity,  346 
Hilbert  transform,  698 
histogram,  751 
holomorphic  function,  667 
homogeneous  algebraic  equations,  134 
homogeneous  differential  equations,  406, 
408  ff 

homogeneous  functions,  238,  406, 

407  (Problem  21) 

Hooke’s  law,  519 
hoop,  490 

horizon,  distance  to,  43 
hottest  points  of  a  plate,  224 
hydrodynamics,  713.  See  also  flow  of 
water 


hydrogen  atom,  614  (Problems  27,  28), 
652  (Problem  22) 
hyperbolic  Bessel  functions,  595 
hyperbolic  functions,  70 
identities,  71 
inverse,  74 

integrals  leading  to,  75 
i,  46  ff 

identities.  See  also  recursion  relations 
Bessel,  592 
Legendre,  570 
trigonometric,  69,  71 
vector,  339 
identity  matrix,  118 
images  in  conformal  mapping,  705 
images,  method  of,  657 
imaginary  axis,  48 
imaginary  numbers,  Chapter  2 
imaginary  part  of  a  complex  number, 

47 

impedance,  complex,  78 
implicit  differentiation,  201,  202  ff,  215 
improper  rotation,  514 
impulse,  unit  impulse,  449 
incomplete  gamma  function,  551 
incompressible  fluid,  324 
inconsistent  equations,  86,  87 
indefinite  integrals,  241  ff 
independent  events,  731 
independent  random  variables,  756 
independent  variable 
change  of,  431,  434 
missing,  in  differential  equation,  431 
differential  of,  193 
in  Euler  equation,  480 
in  maximum,  minimum  problems,  212 
in  partial  differentiation,  207 
independent  vectors,  132 
indeterminate  forms,  38 
index,  dummy,  502 
index  notation,  138 
index  of  refraction,  474,  478 
indicial  equation,  586 
inertia,  moment  of,  252  ff,  505.  See 
also  moment  of  inertia 
inertia  tensor,  505  ff,  519 
extended  body,  506 
matrix,  507 
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point  mass,  506 
principal  axes,  507 
infinite  group,  176 
infinite  series,  Chapter  1.  See  also 
series;  power  series;  Fourier 
series;  Legendre  series 
complex,  56 
infinity,  702  ff 

asymptotic  series  about,  549-551 
residues  at,  703 

inhomogeneous  linear  differential  equa¬ 
tion,  417 

initial  conditions,  393 
inner  product,  100 
insulated  boundary,  631 
integral  representations  of  functions 
analytic,  676 
Bessel,  617 
beta,  542 
error,  549 
gamma,  538 
integrals 

arc  length,  250 
area,  242,  249 
around  infinity,  703 
beta  function,  542 
Bromwich,  696 
change  of  variables  in,  258  ff 
contour,  674  ff,  676-678,  687  ff,  696 
in  cylindrical  coordinates,  260  ff 
differentiating,  233 
divergence  theorem,  318 
double,  242  ff,  249  ff,  262,  270 
elliptic,  555 
error  function,  547 
evaluated  by  contour 
integration,  687  ff 
evaluated  in  terms  of  inverse  hyper¬ 
bolic  functions,  75 
evaluated  using  complex  exponen¬ 
tials,  69,  351 

evaluated  using  series,  37,  548 
evaluating,  243  ff,  687  ff 
Fourier,  378 

Fresnel,  37,  549  (Problem  6),  701 
(Problem  41) 
gamma  function,  538 
improper,  539 
indefinite,  241  ff 


iterated,  243  ff 
as  “limit  of  sum” ,  249 
limits  on,  243  ff 
line,  299 
mass,  246,  250 

moment  of  inertia,  252,  259,  264  ff 
in  polar  coordinates,  258  ff 
of  power  series,  23,  37 
principal  value  of,  692 
by  residues,  687  ff 
setting  up,  242,  253 
in  spherical  coordinates,  260  ff 
Stokes’  theorem,  328 
surface,  253,  270  ff 
through  a  simple  pole,  691 
triple,  242  ff 
volume,  242  ff 

volume  and  surface  of  revolution, 
252  ff 

integral  test,  11  ff 
integral  transforms,  378  ff,  437  ff 
Fourier,  378  ( see  also  Fourier  trans¬ 
forms) 

Hilbert,  698 

Laplace,  437  ff  ( also  see  Laplace  trans¬ 
forms) 

integrating  factor,  401,  405 
integration.  See  integrals 
intensity  of  a  sound  wave,  376 
interior  maxima  and  minima,  223  ff 
interval  of  convergence,  20 
invariant,  500 

inverse  exponential  shift,  424 
inverse  Fourier  transform  (Bromwich 
integral),  696 

inverse  hyperbolic  functions,  74 
integrals  expressed  in  terms  of,  75 
inverse  Laplace  transforms,  440 
by  contour  integration  (Bromwich 
integral),  696 

of  a  product  (convolution),  445  ff 
using  the  table,  440  ff 
inverse  matrix,  119,  137 
inverse  of  a  product  of  matrices,  140 
inverse  operators,  424 
inverse  square  force,  304,  320,  571, 
652 

inverse  trigonometric  functions,  74 
invertible  matrix,  119 
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irrotational  field,  325 
isolated  singular  point,  670 
isoperimetric  problems,  491 
isothermals,  290,  295,  712 
isotropic  tensors,  509 
Kronecker  delta,  509 
Levi-Civita  tensor,  510 
product  of,  510 

3,  49 

Jacobian,  261  ff,  283,  536  (Problem 
12),  710  (Problems  12,  13) 
Jacobi  forms  of  elliptic  integrals,  555 
Jacobi  identity,  141,  284,  514 
joint  probability,  754 

Kelvin  functions,  596 
Kepler’s  second  law,  495 
kinetic  energy,  342,  348,  433,  485,  545. 
See  also  energy 

kinetic  theory  applications,  237,  239 
kite,  116 

Klein-Gordon  equation,  665 
Kronecker  S,  138-139,  508  ff 
in  general  coordinates,  531 

ladder  (raising  and  lowering)  opera¬ 
tors,  607,  614  (Problems  29,  30) 
Lagrange  multipliers,  214,  216  ff,  491 
Lagrange’s  equations,  485  ff 
in  polar  coordinates,  486 
in  spherical  coordinates,  487 
Lagrange’s  identity,  106,  284,  514 
Lagrangian,  485  ff,  545 
Laguerre  functions,  Laguerre  polyno¬ 
mials,  609  ff 
associated,  610 
generating  function,  610 
list  of,  804 

recursion  relations,  610,  611 
Rodrigues’  formula,  609, 
lamellar  field,  332 
lamina,  248,  256 
Laplace  development,  90  ff 
Laplace’s  equation,  297,  619,  621  ff, 
638  ff,  648  ff,  711 

conformal  mapping  solutions,  711  ff 
in  cylindrical  coordinates,  527,  639 
electrostatic  potential  problems,  712 
fluid  flow  problems,  713 


Fourier  transform  solution,  660  ff 
gravitational  potential,  652 
in  one  dimension,  629 
in  orthogonal  coordinates,  527 
in  polar  coordinates,  229 
separation  of  variables,  622 
in  spherical  coordinates,  298,  527, 
648 

temperature  problems,  621  ff,  711, 
712 

in  tensor  notation,  533 
in  two  dimensions,  671 
Laplace  transforms,  437  ff 
convolution,  444  ff 
of  delta  functions,  453 
of  derivatives,  440 
evaluating  integrals  using,  442 
finding,  437  ff 

inverse,  440  ff,  444,  696  ( see  also 
inverse  Laplace  transform) 
solving  ordinary  differential  equa¬ 
tions  by,  440  ff 

solving  partial  differential  equations 
by,  659 

table  of,  469-471 

Laplacian,  297.  See  also  Laplace’s  equa¬ 
tion 

Laurent  series,  678  ff 
Laurent’s  theorem,  678 
law  of  large  numbers,  759 
law  of  reflection,  474 
“least  squares,”  214  (Problem  16),  581, 
774  (Problem  1) 
left-handed  system,  515 
Legendre  equation,  564.  See  also  Leg¬ 
endre  polynomials 

Legendre  forms  of  elliptic  integrals,  555 
Legendre  functions,  564  ff.  See  also 
Legendre  polynomials 
associated,  583 
first  and  second  kinds,  567 
Legendre  polynomials,  566  ff 

applications,  571  ff,  581,  648  ff,  656- 
657 

generating  function,  569 
by  Gram  Schmidt  method,  182-183 
list  of,  803 
normalization,  578  ff 
orthogonality,  577 
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raising  and  lowering  operators,  607, 
618 

recursion  relations,  570 
Rodrigues’  formula,  568 
Legendre  series,  580  ff,  649 
Legendre  transformation,  231, 

233  (Problems  10  to  13) 
Leibniz’  rule: 

for  differentiating  integrals,  233  ff 
for  differentiating  products,  567 
lengthening  pendulum,  598 
length  of  arc.  See  arc  length 
length  of  a  vector,  96 
lenses,  198 
level  lines,  290 
level  surfaces,  290 
lever  problem,  277 
Levi-Civita  symbol  508  ff,  515 
in  determinant  formula,  509 
isotropic  tensor,  510 
products  of,  510 
L’Hopital’s  rule,  38,  685 
light.  See  also  optics  applications 
absorption,  399 
color,  346,  378 
continuous  spectrum,  378 
frequency,  346,  378 
parabolic  mirror,  407 
pulse,  382 
scattering,  389 
waves,  346,  378 
limits  on  an  integral,  243  ff 
line(s) 

distance  between  skew,  111 
distance  from  point  to,  110-111 
equations  of,  106  ff 
parametric,  108 
symmetric,  107-108 
normal,  109,  293 
perpendicular  to  plane,  109 
tangent  to  curve,  203 
linear  absorption  coefficient,  399 
linear  algebra,  Chapter  3 
linear  algebraic  equations,  82  ff.  See 
also  determinants;  matrices; 
matrix 

Cramer’s  rule,  93-94 
homogeneous,  134 
matrix  solution  of,  119-120 


solution  in  vector  form,  134 
standard  form,  84 
linear  combination,  124 
linear  dependence,  132 
linear  differential  equations,  401,  408, 
417.  See  also  differential 
equations 

linear  functions,  124 
linearly  independent  functions,  133 
linearly  independent  vectors,  144 
linear  operators,  124,  425,  438 
linear  space.  See  linear  vector  space 
linear  transformations,  125.  See  also 
transformations;  orthogonal 
transformations 

deformation  of  membrane,  148  ff 
eigenvectors  of,  148  ff 
matrix  of,  125,  148 
orthogonal,  126 

linear  triatomic  molecule,  167  ff 
linear  vector  space,  143  ff,  179  ff 
basis,  143  ff  ( also  see)  basis 
complex  Euclidean,  146  ff 
definition,  142,  179 
dimension,  144 
Euclidean,  142  ff 
examples,  143  ff,  180-181 
function  space,  180,  414-415  (Prob¬ 
lems  13  to  18) 
general,  179 

Gram-Schmidt  process,  145, 182-183 
infinite  dimensional,  183 
inner  product,  144,  146,  181 
inner  product  space,  181 
norm,  144,  146,  181 
orthogonality,  144,  146,  181 
orthonormal  basis,  145,  182 
Schwarz  inequality,  145,  146,  182 
span,  143 
subspace,  143 

line  element,  521.  See  also  arc  length 
line  integrals,  299  ff 
lines  of  force,  396 
Liouville’s  theorem,  718 
In,  72.  See  also  Logarithm 
Logarithm(s): 
to  base  e,  72 
branch  of,  693 
of  complex  numbers,  72 


826  INDEX 


and  inverse  hyperbolic  functions,  74 
of  negative  numbers,  72 
series  for,  26 
Lorenz  equations,  96 
loudness,  373 

lowering  and  raising  operators  (ladder 
operators),  607,  614,  618 
lowering  and  raising  tensor  indices,  532 
loxodrome,  269 

Maclaurin  series,  24  ff.  See  also  series; 

power  series 
magnetic  field,  329,  714 
magnitude  of  a  vector,  96 
main  diagonal,  118 
mapping 

functions  of  2,  705  ff,  710 
of  the  plane,  125  ff 
mass,  center  of,  251  ff 
matrices,  Chapter  3.  See  also  matrix 
addition  of,  115 
commutator  of,  117 
conformable,  116 
and  determinants  compared,  115 
elementary  row  operations,  86 
equal,  86,  114 
inverse  of  product  of,  140 
multiplication  of,  115,  116 
operations  on,  114  ff 
row  reduction,  83  ff 
similar,  150 

table  of  special,  137-138 
transpose  of  product  of,  139 
and  vectors  or  tensors,  114,  503 
matrix,  Chapter  3,  83ff.  See  also  matrices 
adjoint,  137 
anti-Hermitian,  138 
anti-symmetric,  138 
augmented,  85 
block  diagonalized,  176 
characteristic  equation  of,  148 
of  coefficients,  84 
column,  114 
dagger,  137 
determinant  of,  89 
diagonalizing,  148  ff 
eigenvalues  of,  148  ff 
of  eigenvectors,  150 
equations,  114 


functions  of,  121 
Hermitian,  137,  138 
Hermitian  conjugate,  137 
identity,  118 

index  notation,  84,  116,  138  ff 
inverse,  119,  137 
invertible,  119 

multiplication  by  a  number,  114 

normal,  138 

null,  117 

operators,  125 

orthogonal,  126,  127,  138 

power  of,  121 

pure  imaginary,  138 

rank  of,  86,  94 

real,  138 

reflection,  128  ff,  155  ff 
rotation,  120,  127,  129,  155  ff 
row,  114 
singular,  119 
skew-symmetric,  138 
special,  137 
symmetric,  138 
trace  of,  140 

of  a  transformation,  125,  148 
transpose  conjugate  of,  137 
transpose  of,  84,  137 
unit,  118 
unitary,  138,  154 
zero,  117 

maxima  and  minima,  211  ff 

boundary  point  or  endpoint  prob¬ 
lems,  223  ff 

in  calculus  of  variations,  472  ff 
with  constraints,  214 
of  harmonic  functions,  720 
interior,  224  ff 

using  Lagrange  multipliers,  214  ff 
second  derivative  tests,  213  (prob¬ 
lems  1,  2) 

Maxwell-Boltzmann  statistics,  742 
Maxwell  equations,  330,  621  (Problem  3) 
mean  free  path,  754 
mean  value,  of  See  also  average  value 
a  product  of  random  variables, 

756  (Problem  14) 

a  random  variable,  746,752,763,768 
a  set  of  measurements,  771  ff 
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a  sum  of  random  variables,  749  (Prob¬ 
lem  9) 

mean  value  theorem,  195 
measurements,  770  ff 
mechanics  applications.  See  accelera¬ 
tion;  energy;  motion  of  a  par¬ 
ticle;  velocity 
median,  771 
medians  of  triangle,  98 
membrane,  deformation  of,  148 
membrane,  vibration  of,  644  ff 
meromorphic  function,  681 
method  of  Frobenius,  585  ff,  605 
method  of  images,  657 
method  of  undetermined  coefficients,  421 
methods  of  counting,  736 
metric  tensor,  524,  532 

in  orthogonal  coordinates,  533 
raising  and  lowering  indices,  532 
minimum  distance: 

calculus  methods,  214,  216 
calculus  of  variations,  472  ff 
Lagrange  multipliers,  217-218,  221-222 
vector  methods,  109-111,  238  (Problem  2) 
minimum  surface  of  revolution,  479 
minor  of  a  determinant,  89 
mirror,  407 

mixed  second  partial  derivatives,  189 
not  necessarily  equal,  190 
reciprocity  relations,  190,  231,  306 
mixed  tensor,  531 
mobile,  44 
mode,  771 

mode  of  vibration,  165,  636,  645-646 
modern  physics.  See  also  quantum  mechanics 
charge  distribution  in  atoms,  573 
hydrogen  atom,  614  (Problems  27, 

28),  652  (Problem  22) 

Millikan  oil  drop  experiment,  400 
radioactive  decay,  39,  395,  399, 

402-403,  755,  768 
relativity,  42,  394 
waves,  342  (see  also  waves) 
modified  Bessel  functions,  595 
mod  N,  174,  178 

modulus  of  a  complex  number,  49 
modulus  of  an  elliptic  integral,  555 
Moebius  strip,  327 
moment,  277 


of  a  charge  or  mass  distribution,  573 
electric  dipole,  573-575 
first,  277 
of  a  force,  277 

of  inertia,  252,  265,  277,  505  (see 
also  inertia  tensor) 
problems,  255-257,  267-270, 

273-275,  508,  520 
quadrupole,  573-575 
second,  277 

monogenic  function,  667 
mothball,  394 

motion  of  a  particle,  4  (Problem  16), 

286.  See  also  acceleration;  force; 
Lagrange’s  equations 
in  a  circle,  283 

complex  notation,  56,  76,  80,  340-341 
damped,  413 
forced,  426 
under  gravity,  392 
problems,  308,  400,  435,  486 
in  polar  coordinates,  487 
probability  function  for,  750-753 
reduced  mass,  197 
simple  harmonic,  80 

problems,  344,  416,  754,  489-490 
multiple  integrals,  Chapter  5 
multiple  pole,  680 
multiple-valued  function,  667 
multiplication  (mod  TV ),  178 
multiplication  tables  for  groups,  173  ff 
multiplying.  See  product 
music.  See  sound  waves 
mutually  exclusive,  723 

nappe  of  a  cone,  263 

natural  frequency,  414,  427 

natural  logarithm,  72.  See  also  Logarithm 

n-dimensional  space,  143  ff 

negative  of  a  vector,  97 

Neumann  function,  591 

Neumann  problem,  631 

Newton’s  law  of  cooling,  400 

Newton’s  second  law,  80,  289,  390, 

394,  485 

Newton’s  third  law,  285 
nodal  line,  646 
nonanalytic  function,  669 
non-Cartesian  tensors,  529  ff,  531 
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noncommutative  operations,  103, 117, 132 
nonconservative  field,  302,  306 
nonlinear  differential  equation,  392,  396 
non-uniform  sample  space,  725 
norm,  96 

normal  cumulative  distribution,  547 
normal  derivative,  293 
normal  distribution,  547,  761,  769 
normal  error  curve,  761 
normal  frequencies,  166 
normalization  of 

Bessel  functions,  602-603 
Hermite  polynomials,  608 
Laguerre  polynomials,  610,  611 
Legendre  polynomials,  578  ff 
normal  line,  109,  293 
normal  matrix,  138 
normal  modes  of  vibration,  165 
normal  (perpendicular),  109 
normal  vector  to  a  plane,  109 
normal  vector  to  a  surface,  220,  271,  293 
null  matrix,  117 
number  of  poles  and  zeros,  694 

octopole  moment,  573,  575 
odd  functions,  364 
Ohm’s  law,  78 
one-sided  surface,  327 
operators  See  differential  operators;  lin¬ 
ear  operators;  vector  operators 
optics  applications,  79.  See  also  light 
combining  light  waves,  79 
Fermat’s  principle,  473,  484 
Fresnel  integrals,  37,  549  (Problem  6), 
701  (Problem  41) 
law  of  reflection,  474 
lenses,  198 

multiple  reflections,  79 
scattering,  389 
Snell’s  law,  474 
orbits,  468 
order: 

of  a  Bessel  function,  589 

of  a  determinant,  80 

of  a  differential  equation,  391 

of  eigenvalues  in  a  diagonal  matrix, 

152 

of  a  group,  173 

of  placing  electric  charges,  18-19 


of  a  pole,  680 
of  a  tensor,  496 
of  terms  in  a  series,  18 
ordinary  differential  equations,  Chap¬ 
ters  8  and  12.  See  also  partial 
differential  equations 
associated  Legendre,  583 
auxiliary  equation,  409  ff 
Bernoulli,  404,  431 
Bessel,  588,  593,  594 
boundary  conditions,  393 
change  of  variables  in,  406 
complementary  function,  417 
damped  motion,  413 
dependent  variable  missing,  430 
Euler  (Cauchy),  434 
Euler  (Lagrange),  485  ff 
exact,  405 

exponential  right-hand  side,  419 
family  of  solutions,  395-396 
first  integrals,  433 
forced  vibrations,  417  ff,  426  ff 
Fourier  series  solutions,  428 
free  vibrations,  412,  417 
Fuchs’s  theorem,  605 
generalized  power  series  solution,  585 
general  solution  of,  392,  395,  401, 
410,  411,  418 
Green  functions  for,  461 
Hermite,  607,  608 
homogeneous,  406,  408  ff 
independent  variable  missing,  431 
inhomogeneous,  408 
initial  conditions,  393 
integrating  factors,  401,  405 
Laguerre,  609  ff 

Laplace  transform  solution,  440  ff 
Legendre,  564 
linear,  391 

linear  first-order,  401  ff 
linear  second-order,  408  ff,  417  ff 
method  of  undetermined  coefficients, 
421 

nonhomogeneous,  408 
nonlinear,  391,  392,  396  ff, 
order,  391 

particular  solution,  392  ff,  397,  415  ff 
(see  also  particular  solution) 
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reduction  of  order,  434,  567  (Prob¬ 
lem  4),  606  (Problems  1-4) 
regular,  605 
second  solution,  606 
separable,  395  ff 
series  solutions,  Chapter  12 
simultaneous,  Laplace  transform  so¬ 
lution,  441 
slope  field,  393,  394 
solution  of,  391, 

Sturm-Liouville,  617 
variation  of  parameters,  464 
y'  missing,  430 

orthogonal  curvilinear  coordinates,  521  ff, 
708 

arc  length  element,  521 
basis  vectors,  522 
scale  factors,  522 
vector  operators  in,  525 
velocity,  524 
volume  element,  524 
orthogonal  functions,  575 
orthogonality 

of  Bessel  functions,  601  ff 
of  functions  in  Fourier  series,  351, 

601 

of  Hermite  polynomials,  608 
of  Laguerre  polynomials,  610,  611 
of  Legendre  polynomials,  577 
proof  using  differential  equation,  577 
of  solutions  of  a  Sturm-Liouville  equa¬ 
tion,  617 

of  vectors  in  3  dimensional  space, 

102 

of  vectors  in  n  dimensional  space, 

144 

with  respect  to  weight  function,  602 
orthogonal  matrix,  126,  127,  138 
orthogonal  similarity  transformation. 

See  orthogonal  transformation 
orthogonal  trajectories,  396 
orthogonal  transformation,  126  ff  See 
also  rotation;  reflection 
by  an  analytic  function,  708 
diagonalizing  a  symmetric  matrix 
by,  149  ff 
matrix,  127 
in  n  dimensions,  143 
to  normal  modes  of  vibration,  165 


to  principal  axes,  162 
similarity,  150 
in  3  dimensions,  155 
orthogonal  (perpendicular)  vectors, 

105,  144 
orthonormal 
basis,  145 
functions,  579 
oscillating  series,  7 
oscillator,  oscillatory  motion.  See  vi¬ 
brations 

oscillatory  (underdamped)  motion,  414 
outer  product,  501 
out  of  phase,  79 
overdamped  motion,  413 
overtones,  345,  373 

paddle  wheel  probe,  325 
parabolic  coordinates,  525 
parabolic  cylinder  coordinates,  524 
parabolic  mirror,  407 
parallel  axis  theorem,  255 
parallelogram,  186 
parallelogram  law,  97 
parallel  vectors,  102 
parameters,  variation  of,  464 
parametric  equations 
circle,  301 
curve,  301 
cycloid,  483  ff 
ellipse,  556 
line,  108 

parent  population,  770  ff 
Parseval’s  theorem,  375 
partial  derivatives,  Chapter  4.  See 
also  partial  differentiation 
cross  partials,  190 
of  functions  of  (x,y)  or  (r,  6),  189 
in  matrix  notation,  200 
in  thermodynamics,  190,  231 
partial  differential  equations,  Chapter  13 
change  of  variables  in,  228  ff 
derivations  of  620-621 
Fourier  series  solutions,  Chapter  13, 
Sections  1-4 

Green  function  solutions,  657-658 
heat  flow  or  diffusion  equation,  619, 
628  ff,  659 

Helmholtz  equation,  620,  629,  645 
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Laplace’s  equation,  619,  621,  638, 

648  ( see  also  Laplace’s  equation) 
Poisson’s  equation,  619,  652  ff,  657 
Schrodinger  equation,  620,  628,  631- 
632,  651  (Problems  18,  20-22) 
solution  by  Fourier  transforms,  660  ff 
solution  by  Laplace  transforms,  659  ff 
solution  by  separation  of  variables, 

622,  625,  628,  639,  645,  648 
wave  equation,  297,  620,  633  ff,  644  ff 
partial  differentiation,  Chapter  4.  See 
also  partial  derivatives 
chain  rule,  199  ff,  203  ff 
change  of  variables,  228  ff 
derivatives  not  reciprocals,  208 
differentials,  193 
implicit,  202 
of  integrals,  233  ff 
Lagrange  multipliers,  216  ff 
maximum  and  minimum  problems,  211  ff 
mixed  second  derivatives,  189-190 
notation,  189-190 
total  differential,  193  ff 
two-variable  power  series,  191  ff 
partial  fractions,  451 
partial  sum  of  a  series,  7 
particular  solution,  392ff,  397 
complex  exponentials,  420 
exponential  right-hand  side,  419  ff 
by  inspection,  418 
principle  of  superposition,  425,  428 
undetermined  coefficients,  421 
passive  transformation,  127 
Pauli  spin  matrices,  122 
pendulum(s),  38,  344,  545 
coupled,  490 
double,  491 
energy  of,  545 
large  vibrations  of,  557 
lengthening,  598 
period  of,  343,  545-546,  557 
seconds,  343 
shortening,  600 
simple,  38,  344,  545 
small  vibrations  of,  38,  344,  545 
spherical,  489 
period 

of  a  Fourier  series,  350  ff,  360  ff 
of  a  function,  343 


of  lengthening  pendulum,  599 
of  simple  harmonic  motion,  341 
of  simple  pendulum,  38,  344,  545 
periodic  function,  340,  343,  345 
permutation,  even,  odd,  509 
permutations,  737 
permutation  symbol  (Levi-Civita), 

508  ff 

perpendicular  axis  theorem,  252 
perpendicular  (orthogonal)  vectors,  105 
phase  of  a  complex  number,  49 
phase  space,  143 
plane 

complex,  47  ff 
distance  from  point  to,  110 
equations  of,  108  ff 
perpendicular  to  a  vector,  109 
tangent  to  a  surface,  293 
through  4  points,  136  (Problem  21) 
through  3  points,  109 
planet,  468 

plate,  conducting,  322 

plate,  hottest  and  coldest  points  of,  224 

plate,  temperature  in,  621,  661 

plotting 

complex  numbers,  48  ff,  62 
graphs  of  complex  equations,  54  ff 
roots  of  complex  numbers,  64  ff 
point 

boundary,  223  ff 
at  infinity,  702 
of  inflection,  212,  472 
maximum,  minimum,  211  ff 
in  n-dimensional  space,  143 
saddle,  212 

of  sample  space,  724  ff 
and  vector  r,  142 
Poisson  distribution,  767  ff 

approximated  by  normal  distribu¬ 
tion,  769 

mean  and  standard  deviation,  768 
Poisson’s  equation,  619,  652  ff,  657 
Poisson’s  summation  formula,  389 
polar  coordinates.  See  also  orthogo¬ 
nal  coordinates 
acceleration  in,  487 
arc  length  element,  259,  266,  521 
area  element,  258 
change  of  variables  to,  229,  258 
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complex  numbers  in,  48,  50  ff 
div,  298 

Euler  equation,  478 
grad,  294,  525 
integrals  in,  258  ff 
Lagrange’s  equations  in,  486 
Laplacian,  298 
partial  derivatives,  189,  208 
scale  factors,  522 
unit  basis  vectors,  288,  522 
polar  form  of  a  complex  number,  48,  61 
polar  vector,  515 

in  physics  formulas,  517 
pole(s) 

on  contour,  691 
at  infinity,  703 
number  in  a  region,  694 
of  order  2,  681 
of  order  n,  680 
residues  at,  684  ff 
simple,  680  ff 
Polya’s  urn  model,  741 
polynomial  approximation,  581 
polynomials 
Hermite,  608 
Laguerre,  609  ff 

Legendre,  566  ff  ( see  also  Legendre 
polynomials) 

Legendre  polynomials,  as  combina¬ 
tions  of,  574 

population  average  and  variance,  771 
position  vector,  286 
potential 
complex,  713 

by  conformal  mapping,  712-  713 
electrostatic,  304,  712,  713  ( see  also 
electrostatic  potential) 
gravitational,  290,  303  ff,  484, 
Legendre  expansion  of,  571  ff 
multipole  expansion  of,  571-573 
Poisson’s  equation  for,  652  ff 
scalar,  303  ff 
vector,  332 
velocity,  303,  713 
zero  at  infinity,  304-305 
potential  energy,  303,  433,  485 
power  of  a  matrix,  121 
power  series.  See  also  series 

absolutely  convergent,  10,  19,  58 


adding,  19,  23,  59 

alternating  series  test,  17 

in  annular  ring,  679 

binomial,  28 

combining,  23,  26  ff,  59 

comparison  test,  10 

complex,  56  ff 

computation,  3,  36  ff,  41  ff 

using  computer,  31 

conditionally  convergent,  18 

convergence  of,  6  ff,  20  ff,  58 

differentiating,  23,  59 

disk  of  convergence,  58,  671 

divergence  of,  6  ff 

dividing,  27  ff,  59 

expanding  functions  in,  23  ff 

functions  having  no  power  series,  33 

generalized,  585  ff 

general  term,  4 

integral  test,  11  ff 

integrating,  23,  30,  59 

interval  of  convergence,  20 

Maclaurin,  24  ff 

multiplying,  23,  26,  59 

numerical  computation  using,  36 

preliminary  test,  9 

ratio  test,  13,  14 

rearranging  terms,  18 

remainder  of,  7,  13 

sequence,  1,  5 

solution  of  differential  equations,  Chap¬ 
ter  12.  See  also  differential  equa¬ 
tions 

substitution  of  one  in  another,  23,  29 
Taylor,  24,  30,  213,  671 
theorems  about,  23 
two- variable,  191 
uniqueness  of,  23,  59,  191 
powers  of  a  matrix,  121 
powers  of  complex  numbers,  63 
Poynting  vector,  535 
preliminary  test,  9 

principal  angle  of  a  complex  number,  50 
principal  axes  and  moments  of 
inertia,  506  ff 

principal  axes  of  a  conic  section,  162 
principal  part  of  A z,  195 
principal  part  of  Laurent  series,  678 
principal  value  of 
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arc  tan,  50 
an  integral,  692 
In  z,  72 

principle  of  superposition,  425,  428 
probability,  Chapter  15 
applications,  722 
of  compound  events,  730 
conditional,  732 
cumulative  distribution,  748 
definition,  723,  725 
density,  753  ff  (see  also  probability 
functions) 
distribution,  748 
distribution  function,  748 
experimental  measurements,  770  ff 
functions,  745,  748 
binomial,  756  ff 
continuous,  750  ff 
normal  (Gaussian),  761  ff 
Poisson,  767  ff 
of  independent  events,  731 
mathematical,  727 
mean  value,  746,752,763,768 
natural  or  intuitive,  727 
random  variables,  744,  747 
sample  space,  724  ff 
standard  deviation,  747,753,763,768 
theorems,  729 

variance,  dispersion,  747,  753 
probable  error,  774 
product 

of  complex  numbers,  62  ff 
cross,  103  ff,  276 
derivative  of,  567 
of  determinants,  118 
dot,  101,  102,  276 
inner,  100 

of  matrices,  115,  116 
scalar,  101,  276 
of  series,  23,  26,  191 
vector,  103,  276 
of  vectors,  276 
progression,  geometric,  1 
projections  of  a  vector,  101 
proper  rotation,  514 
p-series  test,  13 

pseudovector,  pseudotensor,  514 
axial  vector,  515 
cross  product,  516 


pup  tent,  212 
pure  imaginary,  47 
pure  tone,  345 

quadratic  formula,  46 
quadratures,  433 

quadric  surface,  principal  axes  of,  163-164 
quadrilateral,  100 
quadrupole  moment,  573 
quantum  mechanics  applications 
complex  numbers,  81 
correspondence  principle,  197 
Dirac  delta  function,  454 
Fourier  transforms,  386 
Hermite  and  Laguerre  functions,  607-614 
hydrogen  atom,  614  (Problems  27, 

28),  652  (Problem  22) 

Klein-Gordon  equation,  665 
ladder  operators,  607  ff,  614  (Prob¬ 
lems  29,  30) 

Pauli  spin  matrices,  122 
Schrodinger  equation,  620,  628,  631- 
632,  651  (Problems  18,  20-22) 
simple  harmonic  oscillator,  651 
(Problem  21) 

spherical  harmonics,  649,  651 
(Problem  16) 
statistics,  739  ff 
problems,  744 

sums  of  integers,  651  (Problem  21), 

744  (Problem  21) 
quotient.  See  dividing 
quotient  rule  for  tensors,  504,  532 

radian  mode,  49 
radians  and  degrees,  49,  52 
radioactive  decay,  39,  395,  399,  402- 
403,  755 

and  Poisson  distribution,  768 
radio-frequency,  347 
radio  waves,  343,  347 
radius  of  convergence,  58 
raindrop,  467 

raising  and  lowering  operators,  ladder 
operators,  607,  614,  618 
raising  and  lowering  tensor  indices,  532 
ramp  function,  374 
random  variables,  744,  747 
random  walk,  757 
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rank  of  a  matrix,  86,  94 
rank  (order)  of  a  tensor,  496 
rational  functions,  60 
ratio  test,  13-14 
real  axis,  48 

real  part  of  a  complex  number,  47,  49 
rearranging  terms  of  a  series,  18 
reciprocals  in  differentiation,  208 
reciprocity  relations,  190,  231,  306 
rectangular  form  of  a  complex 
number,  48,  51 
rectified  half-wave,  374 
recurrence.  See  recursion  relations 
recursion  relations: 

Bessel  functions,  592 
gamma  function,  538-539 
Hermite  polynomials,  609 
Legendre  polynomials,  570 
reduced  mass,  197 
reduction  of  order  in  a  differential 
equation,  434,  606 
reflection  matrix,  128  ff,  155  ff 
reflection  of  axes,  126,  128  ff,  156  ff 
reflection  of  light,  285,  474 
refraction,  474 
region,  667 

simply  connected,  330 
regular  differential  equation,  605 
regular  function,  667 
regular  point,  680 
relative  error,  197,  775 
relative  intensity  of  harmonics,  373 
relativity,  42,  394 
relaxation  oscillator,  387 
remainder,  7,  13 
residues 

at  infinity,  703 
using  Laurent  series,  683 
at  a  multiple  pole,  685 
at  a  simple  pole,  684 
residue  theorem,  682 

evaluating  integrals  using,  687  ff 
resistance  of  a  wire,  197 
resonance,  427 

response  to  unit  impulse,  449,  459 

resultant,  446 

rhombus,  106 

rhumb  line,  269 

Riccati,  408 


Riemann  surface,  707,  708 
right-handed  system,  515 
right-hand  rule,  515 
RLC  circuit.  See  electric  circuits 
rocket,  467 

Rodrigues’  formula,  568 
Hermite  polynomials,  608 
Laguerre  polynomials,  609 
Legendre  polynomials,  568 
root-mean-square,  348 
roots  of  auxiliary  equation,  409-414 
roots  of  complex  numbers,  65,  66 
rotation(s) 
angle,  127,  129,  151 
axes  rotated,  127 

in  the  complex  plane,  77  (Problem  1), 
131  (Problem  19) 
equations,  127,  151 
improper,  514 

matrix,  120,  127,  129,  155  ff 
in  n  dimensions,  143 
non-commuting,  132 
not  a  vector,  497 
to  principal  axes,  162-164 
proper,  514 

of  a  rigid  body,  497,  505  ff 
tensor,  535  (problem  9) 
in  three  dimensions,  129,  155  ff 
in  two  dimensions,  127,  151 
of  a  vector,  127 
vector  rotated,  127 
rot  v,  rotation  v,  325 
row  matrix,  114 
row  operations,  86 
row  reduction,  83  ff 
row  vector,  114 

saddle  point,  212 
sample  average,  771 
sample  space,  724  ff 
sawtooth  voltage,  346,  374 
scalar,  82,  496 
field,  290 
operator,  296 

potential,  303  ff,  332  (see  also 
potential) 
product,  101,  276 
projection,  101 
triple  product,  278 
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scale  factors  and  basis  vectors,  522 
Schrddinger  equation,  620,  628,  631- 

632,  651  (Problems  18,  20-22) 
particle  in  a  box,  632 

time  independent,  631 
Schwarz  inequality 

in  complex  Euclidean  space,  146 
in  Euclidean  space,  145 
secy  method,  271 

second  derivative  tests  for  maximum, 
minimum,  213  (Problem  2) 
second-rank  tensor: 

Cartesian,  500 
contravariant,  532 
covariant,  531 
mixed,  531 
skew-symmetric,  503 
symmetric,  503 
seconds  pendulum,  343 
secular  determinant  148 
seesaw,  277 

selections  (combinations),  737 
separable  differential  equations,  395 
separation  constant,  622 

integral  valued,  639,  645,  648 
sign  of,  624,  629,  634,  639,  648 
zero  value  of,  628,  644 
separation  of  variables: 

ordinary  differential  equations,  395 
partial  differential  equations,  622, 

633,  639 
sequence  1,  5,  8 

series,  Chapters  1,  2,  7,  11,  12.  See 
also  individual  series 
absolutely  convergent,  10,  19,  58 
adding,  19,  23,  59 
alternating,  17,  34 
alternating  harmonic,  17,  18,  21,  37 
approximations,  33  ff,  38  ff,  545,  548 
asymptotic,  549  ff,  552 
Bessel,  640  ff,  645  ff  ( see  also  Bessel 
functions) 
binomial,  28 

complex,  56  ff,  673,  678  ff 
computation  with,  3,  36,  41  ff,  548- 
549,  624  ff 

conditionally  convergent,  18 
convergent,  6,  20,  58  (see  also 
convergence) 


divergent,  6  ff 
dividing,  23,  27,  59 
Fourier,  Chapter  7  (see  also  Fourier 
series) 

geometric,  1  ff 
harmonic,  11,  12 
infinite,  Chapter  1 
Laurent,  678  ff 
Legendre,  580,  649  (see  also 
Legendre  series) 

Maclaurin,  24  ff 
multiplying,  23,  26,  191 
oscillating,  7 
partial  sum  of,  7 
of  positive  terms,  10 
power,  Chapter  1  (see  also 
power  series) 
rearranging  terms  in,  18 
remainder  of,  7,  13,  33 
for  alternating  series,  34 
solutions  of  differential  equations, 
428,  562  ff 
summing,  37 
sum  of,  2,  7,  18 
Taylor,  24,  30,  671 
useful  facts  about,  19 
series  solutions  of  differential  equations, 
Chapter  12 
Bessel  588,  593,  594 
Hermite,  607  ff 
Laguerre,  609  ff 
Legendre,  564  ff 
sgn  x,  460 

shear  forces  in  stress  tensor,  496 
sheet  of  Riemann  surface,  708 
shifting  theorems,  439,  468 
SHM.  See  simple  harmonic  motion, 
340  ff 

shortening  pendulum,  600 
shortest  distance  along  a  surface,  see 
geodesics 

shortest  time.  See  brachistochrone 
side  bands,  347 
Sierpinski  gasket,  3 
similarity  transformation,  150  ff 
orthogonal,  150  ff,  154 
unitary,  154 
similar  matrices,  150 
simple  curve,  674 
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simple  harmonic  motion,  38,  80,  340  ff, 

412.  See  also  pendulum 
problems,  344,  416,  489-490,  754 
simple  pendulum.  See  pendulum 
simple  pole,  680 
simply  connected  region,  330 
simultaneous  diagonalization  of  ma¬ 
trices,  158 

simultaneous  differential  equations,  441 
sine 

of  complex  number,  68 
Fourier  series,  366 
Fourier  transform,  381 
hyperbolic,  70 
inverse,  74  ff 
power  series  for,  26,  69 
single-valued  function,  356,  667 
singularity,  670 
singular  matrix,  119 
singular  point,  670 
singular  solution,  397 
sinhz,  70 

sink,  sink  density,  316,  714 
sinusoidal  functions,  341 
skew  lines,  111 
skew-symmetric  matrix,  138 
skew-symmetric  tensor,  503 
skin  effect,  596 
slant  height,  255 
slope  of  a  curve,  202-203 
Snell’s  law,  474 
sntt,  558 

soap  film,  473,  480 
solenoidal  field,  332 
solid  angle,  320 
solid  of  revolution,  253 
sound  waves,  345,  372,  378 
air  pressure  in,  345,  372 
energy  in,  373 

frequency:  continuous,  378;  funda¬ 
mental,  345,  373;  overtones,  345,  373 
harmonics,  345,  373 
intensity  of,  373 
pure  tone,  345 
vibrating  drum,  644 
vibrating  string,  633  ff 
source,  source  density,  714 
space,  142  ff.  See  linear  vector  space 
span,  143 


special  comparison  test,  15 
specific  heats,  211  (Problem  28) 
spectrum,  continuous,  378 
sphere  and  complex  plane,  703 
spherical  Bessel  functions,  596 
spherical  coordinates,  261.  See  also 
orthogonal  coordinates 
acceleration,  523 
arc  length  element,  266 
curl,  527 
div,  298,  526 
grad,  294,  525 
Laplace’s  equation,  648 
Laplacian,  298 

scale  factors  and  unit  basis  vectors, 
523 

velocity,  266,  523 
volume  element,  261,  263,  524 
spherical  harmonics,  649 
spherical  pendulum,  489 
spring  constant,  80 
square  of  a  vector,  101 
square  wave,  353 
stability  of  a  vertical  wire,  600 
standard  deviation 

for  binomial  and  normal  distribu¬ 
tions,  763 
of  the  mean,  772 
for  Poisson  distribution,  768 
of  a  random  variable,  747,  753 
of  a  single  measurement,  771 
of  sum,  product,  etc.,  772  ff 
standard  error,  722 
standard  normal  distribution,  763,  764 
star  *  meaning  complex  conjugate,  50 
star  *  meaning  convolution,  446 
statics  problems,  42-43 
stationary  point,  472  ff 
statistical  mechanics  applications,  554, 
739  ff 

Bose-Einstein,  Fermi-Dirac,  Maxwell- 
Boltzmann  statistics,  742 
problems,  743-744 
statistics,  770  ff 

central  limit  theorem,  774 
combination  of  measurements, 

772  ff 

confidence  interval,  probable 
error,  774 
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and  experimental  measurements, 

770  ff 

population  mean  and  variance, 

771 

standard  deviation  of  the  mean  772 
steady  state,  426 
steady  state  temperature 
in  a  cylinder,  638  ff 
in  a  finite  plate,  625 
insulated  boundaries,  631 
insulated  edges,  627  (Problem  14) 
problems,  626  ff 

in  a  semi-infinite  rectangular  plate,  621 
in  a  sphere,  648  ff 
stereographic  projection,  703 
Stirling’s  formula,  552 
error  in,  553 

in  statistical  mechanics,  554 
stochastic  process,  746 
Stokes’  theorem,  313,  324  ff,  327  ff 
and  Ampere’s  law,  329 
strain,  strain  tensor,  519 
stream  function,  713 
streamlines,  325,  714.  See  also  flow  of 
water 

stress  tensor,  496,  519,  520 
stretch,  152 

string,  vibrating.  See  vibrating  string 
Sturm-Liouville  equation,  617 
subgroup,  173 
subscripts 
on  D ,  568 
on  A,  298 

in  partial  differentiation,  189-190,  205 
in  tensor  notation,  502  ff,  530 
on  vectors,  97  ff 
subspace,  143  ff 

substituting  one  series  in  another,  29 
subtraction  of  vectors,  98 
summation  convention,  502  ff 
summation  sign,  4 
summing  numerical  series: 
by  computer,  41,  44,  45 
using  Fourier  series,  358,  377 
power  series  of  a  function,  37 
sum  of 

alternating  harmonic  series,  37 
complex  numbers,  51 
conditionally  convergent  series,  18 


geometric  progression,  2 
infinite  series,  2,  7,  10  ff,  37 
matrices,  115 
power  series,  23 
two  series,  18,  23 
vectors,  97 

superposition  principle,  425 
superscripts,  530 
surface 

area,  271  (see  also  area) 
distance  to,  216 
integral,  270  ff 
level,  290,  293 
of  minimum  area,  479 
normal  to,  220,  271,  293,  317,  327 
one-sided  and  two-sided,  327 
of  revolution,  254,  479 
tangent  plane  to,  194,  293 
symmetric 

equations  of  line,  107-108 
matrix,  138 
tensor,  503 

symmetry  groups,  174,  178 
systems  of  masses  and  springs,  165-172 

tables 

approximate  formulas  for  Bessel 
functions,  604 

Laplace  transforms,  469-471 
matrices,  137-138 
orthogonal  polynomials: 

Hermite  and  Laguerre,  804 
Legendre,  803 
vector  identities,  339 
tangent  approximation,  193 
tangent  line,  203 
tangent  plane,  293 
tangent,  series  for,  28 
Taylor  series,  24,  30,  671.  See  also 
power  series 
temperature: 

in  a  bar,  629  ff,  659 
on  boundary,  720 
in  a  cylinder,  638  ff 
gradient,  292-293 
in  a  half  cylinder,  712 
in  a  hemisphere,  650 
maximum  and  minimum  in  a  plate, 
224-225 
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in  a  plate,  621,  625 
scale,  650 

in  a  semicircular  plate,  711 
in  a  sphere,  648  ff 
tensor  product.  See  direct  product 
tensor(s),  Chapter  10 

angular  momentum,  505  ff 
antisymmetric,  503 
applications  of,  505,  518  ff 
associated,  533 
Cartesian,  498  ff 
combining,  504 

components  and  basis  vectors,  530 

contraction,  502  ff 

contravariant,  530 

covariant,  530 

cross  product,  516 

definition,  499 

direct  product,  501 

dual,  512 

electric  dipole,  quadrupole, 

574  (Problem  15) 
field,  520 

in  general  coordinates,  531 
inertia,  505  ff 
isotropic,  509 
Kronecker  5,  508  ff 
Levi-Civita  symbol,  508  ff 
and  matrices,  503 
metric,  524,  532 
non-Cartesian,  529  ff 
notation,  502 
order  (see  rank) 

pseudo- vectors,  pseudo-tensors,  514  ff 
quotient  rule,  504,  532 
raising  and  lowering  tensor  indices,  532 
rank,  496 

rotation,  535  (Problem  9) 

2nd-rank,  496,  501,  506 
strain,  519 
stress,  496,  519 
summation  convention,  502 
symmetric,  503 
vector  identities,  511 
vector  operators,  525 
and  vectors,  496  ff 
tent,  212 

term-by-term  addition  of  series,  19,  23 
terminal  speed,  400 


thermodynamics,  190,  211.  See  also 
heat  flow 

Legendre  transformation,  231,  233 
reciprocity  relations,  190,  231,  306 
specific  heats,  211 
time  constant,  399 
torque  about  a  line,  277 
torque  about  a  point,  281 
torus,  257 

total  differentials,  193  ff 
tower 

of  books,  44 
circular,  43 
trace,  140 

and  group  character,  176 
of  a  product  of  matrices,  140 
of  a  rotation  matrix,  157 
transfer  function,  444 
transformations.  See  also  linear  trans¬ 
formations;  rotation 
active,  127 

conformal  mapping,  705  ff 
general,  520 

Jacobian  of,  261  ff,  283,  536  (Prob¬ 
lem  12),  710  (Problems  12,  13) 
Legendre,  231,  233 
linear,  125 
orthogonal,  126  ff 
passive,  127 
similarity,  150 

transforms,  integral,  378  ff,  437  ff 
Fourier,  378  ff  (see  also  Fourier 
transforms) 

Hilbert,  437 

Laplace,  437  ff  (see  also  Laplace  trans¬ 
forms) 

transients,  426 

translation  or  shifting  theorems,  439 
transpose  of  a  matrix,  84,  137 
transpose  of  product  of  matrices,  139 
trapezoid,  100 
triangle  inequality,  148 
trigonometric  functions 
of  complex  numbers,  67  ff 
identities,  69,  70 
integrals  of,  349,  351 
inverse  of,  74 
series  for,  23  ff 
triple  integrals,  242  ff 
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triple  scalar  product,  278 
triple  vector  product,  280 
trivial  solution,  134 
true  vector  (polar  vector),  515 
tuning  fork,  345 
two-sided  surface,  327 
two-variable  power  series,  191 

unbounded  region,  659 
underdamped  motion,  413,  414 
undetermined  coefficients,  421 
undetermined  multipliers,  216 
uniformly  distributed,  751 
uniform  sample  space,  725 
unitary 

matrix,  138 

similarity  transformation,  154 
transformation,  154 
unit  basis  vectors,  100,  144,  288,  522. 
See  also  basis  vectors 
cross  product,  104, 
in  curvilinear  coordinates,  522 
in  cylindrical  coordinates,  522 
derivatives,  288,  523 
dot  product,  102 

in  general  curvilinear  coordinates,  523 
in  polar  coordinates,  288 
unit  circle,  687 
unit  eigenvectors,  149,  164 
unit  element  of  a  group,  172 
unit  impulse,  449 
unit  matrix,  118 
unit  step,  470 

unit  vector,  98.  See  also  unit  basis 
vectors 

in  polar  coordinates,  287,  288 
universe,  770 

variables.  See  also  change  of  variables; 
coordinates;  dependent  variable; 
independent  variable 
complex,  Chapters  2  and  14 
dummy,  380,  541 
random,  744,  747 

variance,  747,  753,  763,  771.  See  also 
standard  deviation 
variational  notation,  493 
variation  of  parameters,  464 
varied  curves,  475 


vector  coordinate,  286 
vector  field,  289  ff,  332,  393,  394 
vector  identities,  table  339 

polar,  cylindrical,  spherical,  298 
proved  in  tensor  form,  511 
vector  integral  theorems,  325 
vector  operators,  296  ff,  525  ff 
curl,  296,  324,  511,  527 
in  curvilinear  coordinates,  525  ff 
in  cylindrical  coordinates,  525  ff 
divergence,  296,  314,  526 
gradient,  290  ff,  525 
Laplacian,  297,  298,  527 
in  tensor  notation,  533 
vector  potential,  332,  336  (Problem  3) 
vector  product,  276 
not  commutative,  103 
of  parallel  vectors,  103 
triple,  280 

vectors,  Chapters  3,  6  and  10 
acceleration,  286 
complex,  56 
addition  of,  97 
angle  between,  102 
associated,  533 

associative  law  for  addition,  97 
axial,  515 

basis,  287  ( see  also  basis  vectors) 
Cartesian,  498  ff 
characteristic,  148  ff 
column,  114 

commutative  law  for  addition,  97 
components,  82,  97,  100,  114,  497, 
530 

covariant  and  contravariant,  530 
cross  product,  103  ff,  276,  516 
definition,  82,  497 
derivatives  of,  285  ff 
displacement,  286 
dot  product,  101,  102,  276 
length,  96 

linearly  independent,  132 
magnitude,  96 
in  matrix  notation,  114 
multiplication  of,  100  ff 
in  n  dimensions,  143 
negative  of,  97 
norm  of,  96 
notation,  96 
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orthogonal,  144 
parallel,  102 
perpendicular,  102 
polar,  515 

in  polar  coordinates,  287,  288 

position,  286 

products  of,  276 

pseudo-,  514 

row,  114 

scalar  product,  276 
subtraction  of,  98 
transformation  law,  498 
triple  products  of,  278  ff,  511,  516 
unit,  100  (see  also  unit  basis  vec¬ 
tors) 
zero,  98 

vector  space.  See  linear  vector  space 
velocity 

amplitude,  343 

angular,  278,  324,  340 

complex,  56,  717 

in  cylindrical  coordinates,  523 

of  electrons,  42 

of  escape,  435 

of  light,  474 

in  orthogonal  coordinates,  524 
potential,  303,  620,  713 
in  spherical  coordinates,  266 
vector,  286 
wave,  342,  620,  634 
vibrating  string,  633  ff 

characteristic  frequencies,  636 
in  elastic  medium,  665 
with  free  end,  637 
Green  function  solution,  462 
plucked,  634 
standing  waves,  636 
struck,  635 

of  variable  density,  600 
vibrations.  See  also  pendulum;  sound 
waves;  waves;  vibrating  string 
characteristic  frequencies,  636 
of  circular  membrane  (drum) ,  644  ff 
damped,  413-414 
electrical,  340,  426  ff,  444 
forced,  417  ff,  426  ff 
free,  80,  417 
due  to  impulse,  449 
of  lengthening  pendulum,  598 


normal  modes,  645,  646 
of  pendulum,  38,  344,  545  ff,  557 
period  of,  341 
resonance,  427 

simple  harmonic,  38,  341,  412 
problems,  344,  416 
in  a  sound  wave,  345,  373 
steady  state,  426 
of  string,  633 
tuning  fork,  345 
violin,  645 

voltage,  346,  353,  378 
volume,  242  ff 

element,  253,  261,  524 
finding  by  integration,  242,  253 
integral,  253  ff,  318 
of  revolution,  253 
vortex,  714 

water  flow,  325,  713-714.  See  also 
flow  of  water 
water  waves,  342 

wave  equation,  297,  620,  633  ff,  644  ff 
for  vibrating  drum,  644 
for  vibrating  string,  633  ff 
wavelength,  342 
wave  number,  634 
waves,  342,  620,  634 
light,  346,  378 
notation,  344  (Problem  17) 
radio,  343 
sound, 345,  372 
velocity  of,  342,  620,  634 
Weber  function,  591 
weighted  coin,  727,  734 
weight  function,  602 
wire,  vertical  stability  of,  600 
work,  277 

independent  of  the  path,  303 
Wronskian,  133,  136  (Problem  16) 

yo-yo,  489 

zero  factorial,  4 
zero  matrix,  117 
zero-rank  tensor,  496 
zeros  of  Bessel  functions,  591 
zeros  of  f(z ),  694 
zeta  (£)  function,  41 
z-score,  764 


